Group speech recognition

BryanSchacht · ‎Feb 10 2021

I'm wondering if there has been much headway in speech to text in a setting like a room with multiple speakers? The only way I know of is to have individual mics for each person to clearly separate the voices or directional mic integration with some processing prior to speech recognition. Users like the speech to text transcripts with attribution like in Teams but of course it is terrible if one side of a conference has multiple people contributing. Ends up being a jumble of words usually (but at least entertaining to read!). So I'm wondering if you know of any solutions or work in that area that might be on the roadmap?

Is it correct also that the Speech containers will run on Ubuntu Linux on ARM also? It seems that way from the descriptions.

Thanks,

Bryan

HeikoRa · ‎Feb 10 2021

@BryanSchacht for the room transcription take a look here: Conversation Transcription (Preview) - Speech service - Azure Cognitive Services | Microsoft Docs

Curious425 · ‎Feb 10 2021

We have experimented with it, and even released this preview of a Conversational Transcription Service. You can take a look at this and give us feedback. https://docs.microsoft.com/en-us/azure/cognitive-services/speech-service/conversation-transcription

Group speech recognition

Group speech recognition

Re: Group speech recognition

RE: Group speech recognition

Products (50)

Special Topics (27)

Video Hub (462)

Most Active Hubs

Most Active Hubs

Video Hub

Group speech recognition

Group speech recognition

Re: Group speech recognition

RE: Group speech recognition