All versions of the TTS I see on the web are hosted on sites that connect to their own external API (I think?) to generate the voice lines, i.e. this site hosted on tetyys.com
Is there any way I could locally host Microsoft Sam on my own machine? It’s possible with the Software Automatic Mouth TTS, so I’ve been wondering if I could do the same with this one.
I was looking for something similar back at the start of summer and the best I could find at the time was a Microsoft model on hugging face - https://huggingface.co/microsoft/speecht5_tts. It’s a bit robotic, but its pretty versatile and since it outputs a .wav file it’s easy to integrate it into any system you might be working on/with.
Only thing that’s difficult about it is that you need to understand sampling rates to make sure the voice is created correctly, but I think the example on the hugging face page works as is.