Real-time, multilingual conversations—no input language required. Now with personal voice, full language coverage, and human-interpreter level latency.
Today, we’re excited to introduce Live Interpreter –a breakthrough new capability in Azure Speech Translation – that makes real-time, multilingual communication effortless. Live Interpreter continuously identifies the language being spoken without requiring you to set an input language and delivers low latency speech-to-speech translation in a natural voice that preserves the speaker’s style and tone. With coverage across 76 input languages and 143 locales, Live Interpreter helps people communicate clearly and inclusively in everyday scenarios, like in Teams meetings, customer support centers, international classrooms, or global events.
Key features of the Live Interpreter API include:
- Automated & continuous Language Identification (LID): No need to set an input language; detect and translate automatically, even when speakers switch languages mid-session.
- Full language coverage: Supports all 76 input languages and 143 locales available in Azure Speech Translation.
- Significant latency improvements: Real-time speech-to-speech translation with significant latency reduction and on par with human interpreters for natural conversations.
- Personal voice that preserves style and tone: Translations delivered in a voice that sounds like the original speaker, maintaining intonation and pacing with enterprise-grade consent controls.
“We’re excited to partner with Microsoft and to demonstrate what’s possible when AI meets every day tech. Built on the Azure Speech Translation Live Interpreter capability, we’re able to deliver smarter, more intuitive, and truly immersive audiovisual experiences for users around the world.”
— Anker Innovations
Customer Scenarios enabled by the Live Interpreter API
The Live Interpreter API enables natural, real-time multilingual experiences—without setting an input language, even when speakers switch languages mid‑conversation. With automated & continuous LID, full language/locale coverage, interpreter‑level latency in many scenarios, and personal voice that preserves the speaker’s style, you can unlock new user experiences across customer service, collaboration, education, social commerce and many more:
- Multilingual Contact Centers: Serve global customers without language menus or session restarts. Agents get real-time translations—even when callers switch languages—plus session language lists for compliance and analytics.
- Online Meetings & Events: Deliver inclusive meetings where participants choose their language. Live Interpreter provides low-latency translations with personal voice and seamless handling of language switches.
- Multilingual Classrooms: Students hear lectures in their native language through smart headphones, with translations that preserve the instructor’s tone and pacing for better comprehension.
- Social Commerce Live Streaming: Creators reach global audiences with real-time translations that keep their voice style, maintaining brand personality and engagement across markets.
These scenarios illustrate how Live Interpreter unlocks possibilities that were once tedious, highly inefficient, or even impossible.
Get Started Today
Get started with implementing Multilingual Speech Translation into your products by using our QuickStart Guide.