Aug 31 2023 09:19 PM - edited Sep 01 2023 04:59 PM
When the SaraNeural TTS reads "haha" it makes a laughing sound. However, it is not pitched up accordingly - I pitch the voice up by 25% in SSML, but the laugh sounds like the default pitch. This makes it sound jarring and out of place.
A solution I'm looking for would be to either pitch up the laughter, or for the laughter to instead just be read as "haha". I assume this issue is present with other sounds such as groans that the TTS may attempt to do.
As for my second problem, the tts reads text with emojis such as 😻 correctly, however the SpeechSynthesizer returns completely broken word-for-word subtitles for the text.
Thanks for any help!