User Profile
Quetzalcoatl
Former Employee
Joined 5 years ago
User Widgets
Recent Discussions
Speaker identification in transcripts?
Once a stream is completed and ready for playback, how may we: 1) Download the transcript easily (without a select-all-copy hack or multiple clicks in Settings) and 2) Potentially determine who spoke when? If (2) is not available based on the audio, I could provide a quick prototype in github if it may be useful to others; at least for playback of meetings with <=3 people actively speaking. Here is one way: https://stackoverflow.com/questions/20414667/cocktail-party-algorithm-svd-implementation-in-one-line-of-code Here is what the experience could look like: 1. On the Stream's site of the conversation, there is an obvious single-click button in top-right that says "Download transcript" [no extra clicks or hacks required] 2. The transcript is downloaded as a simple text file [not "vtt"; e.g. "txt" so that Microsoft quickly opens it with something like Notepad] 3. The downloaded text file has lines like this: [time] <person> spoken text So for example: [12:20] Bob: good morning, everyone welcome to the meeting. [12:21] Alice: today we will speak about a new tool on Teams The above names could easily be determined by who is in the meeting, which is metadata that Teams app already has during a meeting. Let's make it happen! I'm happy to help. We don't need a breakthru algo or much machine-learning to do this with sufficient accuracy so that it's useful. It's mostly putting metadata together. Here are some potential use-cases: 1. Determine how long each person spoke -- this could help derive which topics in the meeting may have been most important 2. Determine who asked the most questions and who answered the most questions -- this could help with follow-up discussions, e.g. if someone answered most questions about a given topic, they could be emailed for follow-up questions.Re: Download Transcripts
Is there a potential ETA on when a simple "Download transcript" button will be enabled? The current workarounds suggested below are nice hacks and provide neat things like "confidence" of speech-to-text but it would be nice to have a simple button just for this simple text. I'd also like to separately use the audio clips to determine who spoke when so that "Download transcript" would produce an easy-to-read file like: [12:20] Bob: good morning, everyone welcome to the meeting. [12:21] Alice: today we will speak about a new on Teams etc... Related: https://techcommunity.microsoft.com/t5/microsoft-stream-forum/how-do-i-download-the-transcript-and-determine-who-spoke-when/m-p/2279254
Recent Blog Articles
No content to show