I am targeting not only Japanese, but also Chinese. I might try to use Forvo API, but that is not always possible for random words and sentences.
I know there is Windows’ built-in and macOS’s built-in; where you can access easily with Anki’s AwesomeTTS. But, is there a decent version for Linux / Docker? espeak sounds bad for Chinese…
I plan to make a web service, where I shouldn’t rely on Google TTS hacks. It should be free and non-dependent forever.
I see that there is TensorflowTTS, but what about the training data?
BTW, for those who just want to make Anki TTS, and want to save time,
I’ve only used IBM Watson and Microsoft Azure TTS so far. They both have some great neural voices that sound quite realistic, with Microsoft having the much larger selection of available languages and voices. You can try it out here, the Japanese sounds pretty realistic to me. It’s only free up to a monthly limit though.
The macOS TTS for Japanese is pretty decent, but it doesn’t work with the Siri voices as Apple restricts the use of it. You can easily output the audio with the “say” command in the terminal with the -o flag with the filename. Obviously, the Microsoft Azure voice still better, which I have added to my SRS study app I am currently developing.
I renamed the URL a little. It will try to find in WaniKani API / Forvo, or synthesize from Azure, or gTTS. In addition, I stored the generated files in Object storage, so it won’t cost too much Forvo or TTS resources.
Also, I parsed {{ExampleSentences}} more complex than that.