SOLVED

"Simpler" or more basic GUI

Copper Contributor

Hey, 
is there or will there be a "simpler" or more basic GUI for users who just want to punch in some Text and call it a day?


5 Replies

@LeonhardBleicher if I understand your question, are you looking for some portal where you can just enter text and voice (along with your cognitive services token) and be able to download the audio file produced?

best response confirmed by LeonhardBleicher (Copper Contributor)
Solution

@LeonhardBleicher 

 

Are you asking about the Speech Portal for creating custom voices and language models, or just a simple way to try our speech?

If it is the later,  take a look at the Speech CLI.

https://docs.microsoft.com/en-us/azure/cognitive-services/speech-service/spx-basics?tabs=dockerinsta...

We have both a nuget tool package, and a docker container that are easy to setup.
It takes longer to get your speech keys setup, that to pull the tool packages and make your first calls to the service via the Speech CLI.

If this sounds like what you want, let me know and I can reply back with some more detailed instructions and an example or two.

Thanks,

Brian.

yes, at the moment there are alot of settings you can choose from. This can be overwhelming to some users. A broken down version like "Input"->choose Language/Speaker -> GO!" woud be great.
Just a way to create Speech from Text.
Thank you.
I will take a look at it :).

@LeonhardBleicher 

If you have .NetCore installed, and a speech key, try this.

dotnet tool install Microsoft.CognitiveServices.Speech.CLI --version 1.15.0
spx config synthesize @region --set westus2
spx config synthesize @key --set {YourWestUS2Key}

spx synthesize --text "Hello, would you like a glass of water?" --voice "en-AU-NatashaNeural" --audio output test.wav

That will allow you to quickly and easily create audio from your text, with either the standard or one of our pre-made Neural Voices.

It's not a GUI, but would that work for you?

1 best response

Accepted Solutions
best response confirmed by LeonhardBleicher (Copper Contributor)
Solution

@LeonhardBleicher 

 

Are you asking about the Speech Portal for creating custom voices and language models, or just a simple way to try our speech?

If it is the later,  take a look at the Speech CLI.

https://docs.microsoft.com/en-us/azure/cognitive-services/speech-service/spx-basics?tabs=dockerinsta...

We have both a nuget tool package, and a docker container that are easy to setup.
It takes longer to get your speech keys setup, that to pull the tool packages and make your first calls to the service via the Speech CLI.

If this sounds like what you want, let me know and I can reply back with some more detailed instructions and an example or two.

Thanks,

Brian.

View solution in original post