Forum Discussion
mkcmichael
Feb 10, 2021Copper Contributor
Batch speech questions
For batch speech:
- Automatic Punctuation - this seems to be overly "greedy" on adding sentences - are there some refinement options here?
- How to tell which speech models accept Human Labeled transcript AND audio? What's the recommended hours of audio to include? I've seen conflicting amounts (20 - 1000 hours)
4 Replies
Sort By
- HeikoRa
Microsoft
There is currently no ability to adjust how the automatic punctuation works. Languages that specify "Acoustic Model" in the Language Support page here https://docs.microsoft.com/en-us/azure/cognitive-services/speech-service/language-support leverage the audio you provide for adaptation. The maximum amount of data is about 20 hours. I would suggest at least a few hours of audio data for adaptation.- mkcmichaelCopper Contributor
I am confused because there is no indication here on which supports audio:
From trial and error, I found that 20201019 does accept audio and 20200715 does not
- HeikoRa
Microsoft
mkcmichael sorry for the confusion. I appreciate your feedback and we will look into making this clearer in the future.