New TTS engines

lolodomo · January 18, 2014, 5:16pm

I open a new topic after a short discussion in private with one of you.

Apparently it exists alternatives to Google TTS engine like Cepstral or the OSX TTS engine that we could easily use in the plugin (in place of Google).

First I changed a little the code to make it more easy to add an alternative TTS engine (mainly put the specific code for Google in a function).

I hope we can have good news soon.

flyveleder · January 18, 2014, 6:20pm

One of the best engines I have come across is this one : http://demo.acapela-group.com/

Sounds perfect in my language (Danish)

/Martin

lolodomo · January 18, 2014, 6:28pm

Of course, we need an engine offering an API and if possible through a simple HTTP request.

lolodomo · January 18, 2014, 6:37pm

[quote=“flyveleder, post:2, topic:178940”]One of the best engines I have come across is this one : http://demo.acapela-group.com/

Sounds perfect in my language (Danish)[/quote]

Oh yes, not too bad in French too 8)

lolodomo · January 18, 2014, 6:40pm

Google TTS is a free service available on the Internet.

We could imagine to use a TTS engine not directly on the Internet but running on a server in our LAN network.
I think this is what will be explained for example with the OSX TTS engine.
What we need here is an HTTP API to request the engine to produce the audio file.

flyveleder · January 18, 2014, 6:46pm

Vera being Linux - maybe installed straight onto the device itself ? …

lolodomo · January 18, 2014, 6:48pm

A cloud service seems to be available.
I imagine it is not free ?

ericonvera · January 19, 2014, 6:43pm

I’ve succeeded in implementing this using the TTS Server provided here → [url=http://wolfpaulus.com/jounal/mac/ttsserver/]http://wolfpaulus.com/jounal/mac/ttsserver/[/url]

The big benefits of using this are that it doesn’t require an internet connection, it will always exist (meaning google can’t just pull the plug), it’s customizeable (you can select from many voices and adjust the speech rate), and it’s just cool.

I have an implementation file for @lolodomo to review as I’m not sure of the best way to implement this in a way without my hardcoded values pointing at my server.

lolodomo · January 19, 2014, 7:26pm

That’s really a great news. 8)

The big benefits of using this are that it doesn't require an internet connection, it will always exist (meaning google can't just pull the plug), it's customizeable (you can select from many voices and adjust the speech rate), and it's just cool.

I am curious to know if it exists available TTS engine, free if possible, that could be run for example on a raspberry Pi ?

I have an implementation file for @lolodomo to review as I'm not sure of the best way to implement this in a way without my hardcoded values pointing at my server.

I have to see your code but I think a simple variable to define the URL of the TTS server could probably solve this problem.

ericonvera · January 19, 2014, 7:43pm

Cepstral works on a raspberry pi. They have a page on their website talking about this. It’s quite inexpensive from what I understand. The API could probably be put together in a few lines of python.

Sent from my SCH-I545 using Tapatalk

Edit: here’s the link → [url=http://www.cepstral.com/en/raspberrypi]http://www.cepstral.com/en/raspberrypi[/url]

lolodomo · January 19, 2014, 8:09pm

I tried the demo voices from Cepstral and I am not fully convinced by the quality ! Plus there is no French voices of France

lolodomo · January 19, 2014, 8:57pm

I just tested patched version by @ericonvera and it is working very well … but you don’t have the choice of the language.

The way to request TTS is absolutely similar to what we do with Google, but in this case you point to a personal TTS server rather than Google.

I will make the required changes and then commit that very soon. 8)

Integration of other TTS engines is of course welcome.

lolodomo · January 19, 2014, 9:06pm

This article has a link to LumenVox: LumenVox Product Overview | Modular, Speech-Enabling Technology
You can try TTS and it is working pretty well. But prices seem to be very high. Maybe not a solution for personal use.

ericonvera · January 19, 2014, 9:22pm

I just downloaded a French voice on my mac. If you choose “Audrey” from the list of available voices in that admin panel I linked you, you’ll get French support. I’ve set it up as the default just so you can hear it. Let me know if it works for you.

SM2k · January 20, 2014, 1:58am

Some time ago, I built a TTS replacement using python’s SimpleHTTPServer and OSX’s say command, just in case google’s service ever went away. If anybody’s interested, I can find the code and post it.

ericonvera · January 20, 2014, 2:10am

I’m pretty sure that’s the same idea behind the one that I got working from this site → [url=http://wolfpaulus.com/jounal/mac/ttsserver/]http://wolfpaulus.com/jounal/mac/ttsserver/[/url] but I’m not sure exactly how he implemented it.

lolodomo · January 20, 2014, 8:01pm

I just downloaded a French voice on my mac. If you choose “Audrey” from the list of available voices in that admin panel I linked you, you’ll get French support. I’ve set it up as the default just so you can hear it. Let me know if it works for you.[/quote]

Yes, it works and the quality is really excellent, maybe the best quality I have heard ! But the speech is just too fast and I cannot use any special characters (french accents). Difficult to find a long sentence without that Maybe there is a charset encoding for the text to setup on server side ?

Even if it is not a real problem (generally you need one language), can you confirm that with this solution, we can have only one language supported at a time. Because now, if I use en or fr for the language, I always get Audrey’s voice.

ericonvera · January 20, 2014, 8:46pm

This can be changed through the admin panel (I think).

I’m not sure on the charset what our options are. Ideally, I’d like to get the TTSServer code on github and modify it to allow the voice to be set from the URL as well as make adjustments for speech speed and see what needs to be done for encoding. I’ve emailed the original author letting him know about our application here. If I hear back from him, I’ll ask about forking the project - otherwise I’ll look into writing from scratch or using a start from @SM2k.

lolodomo · January 20, 2014, 10:17pm

Read chapter 4.4, you can apparently choose your voice but you have to use a HTTP POST request.

lolodomo · January 20, 2014, 10:50pm

What about the Microsoft translator service ? A HTTP API exists with a speak feature ? Any idea of the audio quality?
The service is free.