Blog Post

AI - Azure AI services Blog
3 MIN READ

Enter new era of enterprise communication with Microsoft Translator Pro & document image translation

SwethaMachanavajhala's avatar
Nov 20, 2024

Microsoft Translator Pro: standalone, native mobile experience

We are thrilled to unveil the gated public preview of Microsoft Translator Pro, our robust solution designed for enterprises seeking to dismantle language barriers in the workplace.

Available on iOS, Microsoft Translator Pro offers a standalone, native experience, enabling speech-to-speech translated conversations among coworkers, users, or clients within your enterprise ecosystem.

Watch how Microsoft Translator Pro transforms a hotel check-in experience by breaking down language barriers. In this video, a hotel receptionist speaks in English, and the app translates and plays the message aloud in Chinese for the traveler. The traveler responds in Chinese, and the app translates and plays the message aloud in English for the receptionist.

 

 

Key features of the public preview

Our enterprise version of the app is packed with features tailored to meet the stringent demands of enterprises:

  • Core feature - speech-to-speech translation:
    • Break language barriers: Real-time speech-to-speech translation allows you to have seamless communication with individuals speaking different languages.
    • Unified experience: View or hear both transcription and translation simultaneously on a single device, ensuring smooth and efficient conversations.
  • On-device translation: Harness the app's speech-to-speech translation capability without an internet connection in limited languages, ensuring your productivity remains unhampered.
  • Full administrator control: Enterprise IT Administrators wield extensive control over the app's deployment and usage within your organization. They can fine-tune settings to manage conversation history, audit, and diagnostic logs, with the ability to disable history or configure automatic exportation of the history to cloud storage.
  • Uncompromised privacy and security: Microsoft Translator Pro provides enterprises with a high level of translation quality and robust security. We know that Privacy and security are top priorities for you. Once granted access by your organization's admin, you can sign in the app with your organizational credentials. Your conversational data remains strictly yours, safeguarded within your Azure tenant. Neither Microsoft nor any external entities have access to your data.

Join the Preview

To embark on this journey with us, please complete the gating form . Upon meeting the criteria, we will grant your organization access to the paid version of the Microsoft Translator Pro app, which is now available in the US.

Learn more and get started: Microsoft Translator Pro documentation.

Document translation translates text embedded in images

Our commitment to advancing cross-language communication takes a major step forward with a new enhancement in Azure AI Translator’s Document Translation (DT) feature. Previously, Document Translation supported fully digital documents and scanned PDFs. Starting January 2025, with this latest update, the service can also process mixed-content documents, translating both digital text and text embedded within images.

Sample document translated from English to Spanish:

(Frames in order: Source document, translated output document (image not translated), translated output document with image translation)

How It Works

To enable this feature, the Document Translation service now leverages Microsoft Azure AI Vision API to detect, extract, and translate text from images within documents. This capability is especially useful for scenarios where documents contain a mix of digital text and image-based text, ensuring complete translations without manual intervention.

Getting Started

To take advantage of this feature, customers can use the new optional parameter when setting up a translation request:

Request

A new parameter under "options" called "translateTextWithinImage" has been introduced. This parameter is of type Boolean, accepting "true" or "false." The default value is "false," so you’ll need to set it to "true" to activate the image text translation capability.

Response:

When this feature is enabled, the response will include additional details for transparency on image processing:

totalImageScansSucceeded: The count of successfully translated image scans.
totalImageScansFailed: The count of image scans that encountered processing issues.

Usage and cost

For this feature, customers will need to use the Azure AI Services resource, as this new feature leverages Azure AI Vision services along with Azure AI Translator. The OCR service incurs additional charges based on usage. Pricing details for the OCR service can be found here: Pricing details

Learn more and get started (starting January 2025): Translator Documentation

These new advancements reflect our dedication to pushing boundaries in Document Translation, empowering enterprises to connect and collaborate more effectively, regardless of language. Stay tuned for more innovations as we continue to expand the reach and capabilities of Microsoft Azure AI Translator.

Updated Nov 21, 2024
Version 2.0
No CommentsBe the first to comment