Your browser is out-of-date!

Update your browser to view this website correctly. Update my browser now

×

PTZOptics launches AI visual reasoning initiative

Camera manufacturer has partnered with Moondream, an open-source vision language model developer, to turn live video feeds into automated, actionable workflows 

PTZOptics has launched a visual reasoning initiative that combines its robotic PTZ camera systems with AI-powered vision models, partnering with Moondream, a developer of lightweight, open-source vision language models, to bring the concept to market.

The initiative is designed to enable integrators and developers to build systems that analyse live video and trigger automated workflows, from auto-tracking and event-based alerts to content indexing and defect detection on production lines. PTZOptics says the platform is built around an open ecosystem, allowing third parties to develop and deploy custom workflows without being tied to a single technology stack.

Moondream, whose lightweight open-source vision language models are central to the partnership, provides the visual inference layer that the company says can interpret camera feeds in real time and translate them into discrete actions or data outputs.

Supporting the launch are three industry partners demonstrating the technology in specific use cases. Axle AI is applying it to media asset management, using visual reasoning to index and search large video libraries. Detect-It is deploying it on factory floors for defect detection and assembly verification. LayerJot is developing workflows for surgical environments, including instrument tracking and automated compliance checks.

Paul Richards, chief revenue officer of PTZOptics, said: “We created this movement to help small teams deliver much bigger results, with fewer errors, whether producing local sports matches and corporate events or delivering critical monitoring work in factories and hospitals.”

Jay Allen, co-founder of Moondream, added: “PTZOptics has spent years making remote cameras reliable, controllable, and easy to deploy. The alignment of these cameras with our lightweight visual AI solutions makes it now possible to deliver automated, practical decision making.”

Etay Gafni, co-founder of LayerJot, added: “In high-stakes environments such as operating theatres, you need compliant, secure systems that can understand what’s on the operating table, what’s changed, and what action to take next.”

PTZOptics has published a set of Ten Principles of Visual Reasoning covering open integration, privacy, data transparency and education-led adoption. An open-source GitHub repository is available for developers, and a companion book, Visual Reasoning AI for Broadcast and Pro AV, has been released alongside the initiative.

You can subscribe to Installation magazine for free here and the daily newsletter here.