Tuesday, December 23, 2025

AI That Reads Traffic

Milestone Launches Vision Language Model (VLM)
Milestone Launches Vision Language Model (VLM)

Vision language models trained on traffic data help cities and transport networks move from reactive video monitoring to proactive insight generation.

As video surveillance systems expand across cities and transportation networks, the volume of visual data generated has increased sharply. While cameras capture valuable information, reviewing footage and extracting insights remains time consuming and largely manual. Operators often deal with long review cycles, false alarms, and reporting delays, creating demand for tools that can automatically interpret video content and reduce operational load.

- Advertisement -

To address this, Milestone Systems has introduced a vision language model focused on traffic understanding, supporting two new offerings which is a Video Summarization tool for XProtect Video Management Software and a Vision Language Model as a Service for third party integrations. Both are powered by NVIDIA Cosmos Reason and trained on real world traffic video data.

The Video Summarization tool integrates into the XProtect Smart Client and uses generative AI to analyze short video segments and generate text based summaries of events. Users can search video content using natural language instead of timestamps or manual tags, helping streamline investigations and reporting. Early usage indicates the tool can reduce operator false alarm fatigue by up to 30 percent.

The Vision Language Model as a Service provides developers and integrators with API access to production ready video intelligence. It allows traffic focused AI capabilities to be added to existing applications without building or managing custom AI systems. The service supports regional deployments and is designed to accelerate development and scaling.

- Advertisement -

Key features include:

  • Automatic video to text summaries within XProtect
  • Content based search and filtering of video events
  • Integration with existing rules and alerts
  • API based access to traffic optimized vision language models
  • Region specific models for the US and EU
  • Training based on responsibly sourced and auditable data

Andrew Burnett, Acting Chief Technology Officer at Milestone Systems says that the new offerings aim to reduce video overload by delivering faster insights for operators while enabling developers to deploy advanced video intelligence with lower effort and infrastructure complexity.

SHARE YOUR THOUGHTS & COMMENTS

EFY Prime

Unique DIY Projects

Electronics News

Truly Innovative Electronics

Latest DIY Videos

Electronics Components

Electronics Jobs

Calculators For Electronics

×