Problem: When using Azure cloud voices for TTS in AP, the neural voice sounds more mechanical than normal Azure samples. This happens both with and without SSML tags.
Is there a way to get less mechanical and more natural sounding voice through TSS?
Target audio quality should be the same as the sample available here: https://azure.microsoft.com/en-us/services/cognitive-services/text-to-speech/#features
With the following settings:
In the AP I tried to produce the audio with the following settings, but did not get even close to the required quality:
<mstts:express-as style=“customerservice”>You can replace this text with any text you wish. You can either write in this text box or paste your own text here.
I would love to attach some audio samples to this message but I am currently unable to do so as a new user.
ActivePresenter version: 8.5.4
OS: Win 10
It would save me and my colleagues a lot of time, because we are required to use the “less mechanical voice” in our videos which we currently have to generate via and external app and put into our videos one by one manually.