Updated 11 February 2026 at 23:56 IST

Sarvam AI Launches Saaras V3, Boosting Real-Time Speech Recognition in 22 Languages

Sarvam launches Saaras V3 with real-time speech recognition in 22 Indian languages, offering low-latency transcription, language detection, and speaker identification for media, enterprise, and government use.

Follow : Google News Icon  
Sarvam AI Launches Saaras V3, Boosting Real-Time Speech Recognition in 22 Languages
Sarvam AI Launches Saaras V3, Boosting Real-Time Speech Recognition in 22 Languages | Image: Pratyush Kumar- X

New Delhi: Sarvam AI has launched Saaras V3, the latest version of its speech recognition model, with a strong focus on mixed-language and noisy audio environments.

The new model supports all 22 scheduled languages of India and now offers real-time streaming, allowing users to get low-latency transcriptions without losing accuracy. Saaras V3 also includes automatic language detection, word-level time-stamps, and speaker identification for multi-speaker recordings.

According to the company, the model is designed for use in voice bots, subtitling, and large-scale analysis of call recordings. Sarvam says the update extends its leadership in speech recognition for Indian languages.  

The launch of Saaras V3 is part of a wider set of announcements made by the company in a series of social media posts.

Advertisement

From Budget Dubbing to a Million Daily Conversations

Earlier, Sarvam revealed that it had enabled live multi-language dubbing of the Union Budget speech on Republic TV, reaching millions of homes with under two minutes of delay. This was powered by its Sarvam Dub system, which focuses on retaining speaker voice similarity while delivering fast translations.

The company also highlighted the growth of its conversational platform, Samvaad, which now handles over one million minutes of interactions every day. These AI-powered voice agents are being used for customer service, sales, and large-scale outreach programmes.

Advertisement

Sarvam claimed that nearly 80 per cent of its automated calls are now difficult to distinguish from human callers. The company says this has led to higher customer engagement and better sales interest.

Push for Sovereign AI and State Partnerships

In another announcement, Sarvam introduced Sarvam Vision, a 3-billion-parameter vision-language model aimed at improving digitisation in Indian languages. It also launched Bulbul V3, its latest text-to-speech system, which topped a third-party human listening study for preference and accuracy.

The company has also partnered with the governments of Odisha and Tamil Nadu to build state-level AI infrastructure and deploy AI across departments. These projects aim to develop sovereign models and expand public use of artificial intelligence.

Sarvam also unveiled Arya, a multi-agent orchestration platform for enterprise use. The company plans to open-source the system, which is designed to support large-scale, reliable AI workflows.

Throughout the thread, Sarvam stressed its focus on building a “sovereign AI” ecosystem for India. It argued that local data and models are key to long-term economic and technological growth.

With Saaras V3 at the centre of its latest rollout, Sarvam is positioning itself as a major player in Indian-language speech technology, targeting applications across media, government, and enterprise services.

ALSO READ: Sarvam AI Announces Arya Platform: New Multi-Agent System That Targets Reliability in AI Workflows

Published By : Shruti Sneha

Published On: 11 February 2026 at 23:56 IST