Forum Discussion
How to extract audio from mp4
In Teams personal chats, there’s no direct option to send an audio message. But Teams does have a “Record video clip” feature, where users can just speak and send a short video (which is actually an MP4 file).
Now if we want to use that voice for speech-to-text (like with Azure Speech Service), we’ll need to extract the audio from the MP4 first.
Best way is to convert that MP4 into an MP3 or WAV file. For Azure, WAV is better — ideally mono channel, 16kHz sample rate.
You can use ffmpeg to do that:
bash
ffmpeg -i input.mp4 -ac 1 -ar 16000 output.wav
Once you’ve got the .wav file, just send it to Azure Speech API and get the transcription.
So yeah, even though Teams doesn’t support direct audio messages in personal chats, this is a pretty solid workaround. Users just record a quick video, we extract the audio, and we’re good to go.
Hello Lakshmi_145, Could you please confirm if your issue has resolved with above suggestion or still looking for any help?