Cleanup client.speaker and add additional tts speakers #155

Holzhaus · 2014-09-06T09:24:16Z

This is the second chunk of split up pull request #124.

I've added svox-pico-tts and google-tts. Pico TTS is the old Google OpenSource TTS Engine. Try it, it's much better than espeak!

Also, get rid of aplay and use pyaudio instead (pyaudio is a dependency anyway, so why rely on an external tool that might not be installed)

charliermarsh · 2014-09-08T21:04:08Z

client/speaker.py

@@ -56,7 +195,7 @@ def newSpeaker():
        ValueError if no speaker implementation is supported on this platform
    """

-    for cls in [eSpeakSpeaker, saySpeaker]:
+    for cls in [googleSpeaker, picoSpeaker, eSpeakSpeaker, saySpeaker]:


Do we want to add a profile option to configure the speaker?

Holzhaus · 2014-09-09T16:55:44Z

OK. I'll do that tomorrow.
What if the selected speaker is not available? Fallback to another one or just sys.exit(1) and display an error message?

charliermarsh · 2014-09-09T18:13:02Z

Error message, IMO. I don't think that will happen very often and, when it does, you probably want to know that the speaker failed.

Holzhaus · 2014-09-10T19:19:47Z

Parsing profile.yml and getting the config value for tts_engine will be added tomorrow.

Holzhaus · 2014-09-11T12:29:02Z

Done.

Holzhaus · 2014-09-12T16:20:46Z

Can I merge this in?

charliermarsh · 2014-09-12T21:44:02Z

I will test this shortly. Hopefully we can merge it in today.

charliermarsh · 2014-09-12T22:05:22Z

On the Pi, for whatever reason, the output is incredibly choppy. Must be something to do with using pyaudio vs. aplay. Will dig a little deeper. (Interesting, I see this elsewhere around the web, e.g., http://stackoverflow.com/questions/21903597/pyaudio-sound-quality-when-playing-a-file.)

charliermarsh · 2014-09-12T22:06:46Z

client/speaker.py

 """
 import os
+import platform
+import re
+import sys
 import json


Unused import.

charliermarsh · 2014-09-12T22:22:21Z

I've tried a bunch of different chunk sizes (12000, 44100) to no avail.

Holzhaus · 2014-09-13T11:17:51Z

Have You tried the example from documentation?

Holzhaus · 2014-09-15T08:38:11Z

The PyAudio example code runs without problems. I've added testing code to speaker.py. Just run speaker.py directly and it tries to say "This is a test" with all available speakers.

This also works fine on my Raspberry Pi (running ArchLinuxARM). Can you confirm that?

Holzhaus · 2014-09-15T10:51:03Z

If that works for you, too, the problem could be caused by:

Slow SD card (simultaneous reading of wave file and writing to stream)
High CPU load
Misconfigured Sound System (likely, because you needed to specify the sound device explicitly when using aplay)
We're initializing PyAudio/Portaudio multiple times (in mic.py, in speaker.py) simultaneously. Maybe we're allowed to do this only once?

If the latter is true, we should create a dedicated audio class as a wrapper for pyaudio (which is probably a good idea anyway, so that we can easily switch from pyaudio to something else, if we need to.)

charliermarsh · 2014-09-15T14:29:54Z

Thanks @Holzhaus. Will test again tonight and look into a few of these suggestions.

charliermarsh · 2014-09-16T04:56:38Z

I still get the same poor audio quality. I tried flushing the file as well, but to no avail. Not nearly familiar enough with the Pi audio settings to figure out what's wrong with my config--I'm using the primary disk image that we ship. I suspect that it's a buffering problem, perhaps related to the SD card.

Is there a reason why we need to do this play indirection for the eSpeakSpeaker? Why not just call eSpeak directly, like we did in the past?

Holzhaus · 2014-09-16T17:46:43Z

@crm416 What tests did you run exactly? Please do these test in that order to isolate the problem.

1. ALSA config

Does playing a wave file with aplay work without any other arguments (i.e. aplay /path/to/file.wav)?
If that works fine, continue with 2. If not, your alsa config needs to be fixed. Try putting something like this in your ~/.asoundrc (or /etc/asound.conf):

pcm.rpiaudio
{
    type hw
    card 0
}
pcm.usbmic
{
    type hw
    card 1
}

pcm.!default
{
    type asym
    playback.pcm
    {
        type plug
        slave.pcm "rpiaudio"
    }
    capture.pcm 
    {
        type plug
        slave.pcm "usbmic"
    }
}

You may need to edit the card numbers. You can look them up by using cat /proc/asound/cards.

2. PyAudio Init

Did you execute the speaker.py module directly (without running jasper)? If that works fine, the Problem is probably with PyAudio being initialized multiple times. If not, continue with 3.

3. Slow SD card

Replace lines 52-55 with this:

frame_num = f.getnframes()
data = f.readframes(frame_num)
# The whole file has now been read into memory
stream.write(data)

If that works fine, the problem is probably a slow SD card.
If not, it might be some other strange issue. Maybe a bug in PyAudio/Portaudio (which version are you using anyway?)

Holzhaus · 2014-09-16T18:02:54Z

@crm416

Is there a reason why we need to do this play indirection for the eSpeakSpeaker? Why not just call eSpeak directly, like we did in the past?

The original code was also using the play() method:

def say(self, phrase, OPTIONS=" -vdefault+m3 -p 40 -s 160 --stdout > say.wav"):
        os.system("espeak " + json.dumps(phrase) + OPTIONS)
        self.play("say.wav")

def play(self, filename):
        os.system("aplay -D hw:1,0 " + filename)

Anyway, we can't rely on espeak detecting the correct audio setup. Furthermore, we might want to switch to the python-espeak module in the future.

Also, we want to use platform-independent output, so using plain aplay is not an option, either.

And last but not least: We need code to play wave files anyway, e.g. the beeps in Mic.activeListen() and the output of pico2wave (if the user does not like the sound of espeak).

Holzhaus · 2014-09-16T18:04:59Z

If we fail to fix the problem, I can live with changing the play() method to use aplay for the moment, but this is something we should definitely get rid of.

charliermarsh · 2014-09-19T01:48:07Z

Okay. aplay /path/to/file.wav sounds great. espeak "some text" sounds okay too. Solutions 2 and 3 didn't seem to have any affect. I'm on PyAudio version 0.2.8.

When I do run python speaker.py, I get:

ALSA lib pcm_dmix.c:1018:(snd_pcm_dmix_open) unable to open slave
ALSA lib pcm.c:2217:(snd_pcm_open_noupdate) Unknown PCM cards.pcm.rear
ALSA lib pcm.c:2217:(snd_pcm_open_noupdate) Unknown PCM cards.pcm.center_lfe
ALSA lib pcm.c:2217:(snd_pcm_open_noupdate) Unknown PCM cards.pcm.side
ALSA lib pcm.c:2217:(snd_pcm_open_noupdate) Unknown PCM cards.pcm.hdmi
ALSA lib pcm.c:2217:(snd_pcm_open_noupdate) Unknown PCM cards.pcm.hdmi
ALSA lib pcm.c:2217:(snd_pcm_open_noupdate) Unknown PCM cards.pcm.modem
ALSA lib pcm.c:2217:(snd_pcm_open_noupdate) Unknown PCM cards.pcm.modem
ALSA lib pcm.c:2217:(snd_pcm_open_noupdate) Unknown PCM cards.pcm.phoneline
ALSA lib pcm.c:2217:(snd_pcm_open_noupdate) Unknown PCM cards.pcm.phoneline
ALSA lib pcm_dmix.c:957:(snd_pcm_dmix_open) The dmix plugin supports only playback stream
ALSA lib pcm_dmix.c:1018:(snd_pcm_dmix_open) unable to open slave

I do not get these errors when I run alsa /path/to/file.

Holzhaus · 2014-09-19T06:47:01Z

This is kind of normal, as PyAudio likes to spam stderr.

Ill revert the play nethod to aplay for now.

… playback see jasperproject#188 for more info

…r.py

…ss names

Holzhaus · 2014-09-26T11:35:33Z

I rebased to latest upstream version to make this mergeable again.

Holzhaus · 2014-09-26T14:52:22Z

If nobody has comments or finds a problem, I'll merge this tomorrow.

Cleanup client.speaker and add additional tts speakers

charliermarsh reviewed Sep 8, 2014
View reviewed changes

Holzhaus added the enhancement label Sep 11, 2014

Holzhaus self-assigned this Sep 11, 2014

charliermarsh reviewed Sep 12, 2014
View reviewed changes

client/speaker.py

"""

import os

import platform

import re

import sys

import json

Copy link

charliermarsh Sep 12, 2014

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Unused import.

Holzhaus assigned charliermarsh and Holzhaus and unassigned Holzhaus and charliermarsh Sep 17, 2014

Holzhaus mentioned this pull request Sep 19, 2014

Use Platform-independent audio output #188

Closed

Holzhaus force-pushed the new-speakers branch from 1f2e8a0 to be241c8 Compare September 19, 2014 14:56

Holzhaus added 22 commits September 26, 2014 13:34

Add yaml import to speaker.py

7adb200

Use slug in Exception

cd3367c

Added __main__ testing code to client/speaker.py

367a073

Simplify is_available() method of osx-speaker

98ac755

Fix missing brackets in speaker.py testing code

b390603

Change AbstractSpeaker.play() to use aplay due to issues with PyAudio…

c00663d

… playback see jasperproject#188 for more info

Change AbstractMP3Speaker.play_mp3() to use AbstractSpeaker.play()

d0ab112

Add better TTS engine detection/testing code

af10754

Use distutils.spawn.find_executable instead of which

f73cf30

Use os.devnull instead of hardcoded /dev/null

16bea34

Added logging system to client/speaker.py

570e3bd

Added festival TTS engine

407e393

Added dummy tts engine (for testing purposes)

b194e99

Check for aplay in AbstractSpeaker (because of af31dc5)

7427780

Remove TTS_ENGINES constant from client/speaker.py

6f4d1c6

Move tts_engine_slug to jasper.py, improve module functions of speake…

dd9e464

…r.py

Move tts engine options from say method to __init__

2d36db2

Rename speaker.py to tts.py to match stt.py and also change cla…

645f91f

…ss names

Reuse speaker from mic in musicmode

5aeddb1

Added testcase for tts with DummyTTS engine

4a95d86

Get rid of pyaudio in tts.py

321925e

Delete tempfiles in play_mp3() method of AbstractMp3TTSEngine

bbc688e

Holzhaus force-pushed the new-speakers branch from 8d4c62a to bbc688e Compare September 26, 2014 11:34

Readd hardcoded alsa playback device (should be removed later)

49037ca

Holzhaus added a commit that referenced this pull request Sep 27, 2014

Merge pull request #155 from Holzhaus/new-speakers

5167f9f

Cleanup client.speaker and add additional tts speakers

Holzhaus merged commit 5167f9f into jasperproject:master Sep 27, 2014

Holzhaus removed the needstesting label Sep 27, 2014

Holzhaus deleted the new-speakers branch October 1, 2014 17:41

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Cleanup client.speaker and add additional tts speakers #155

Cleanup client.speaker and add additional tts speakers #155

Holzhaus commented Sep 6, 2014

charliermarsh Sep 8, 2014

Holzhaus commented Sep 9, 2014

charliermarsh commented Sep 9, 2014

Holzhaus commented Sep 10, 2014

Holzhaus commented Sep 11, 2014

Holzhaus commented Sep 12, 2014

charliermarsh commented Sep 12, 2014

charliermarsh commented Sep 12, 2014

charliermarsh Sep 12, 2014

charliermarsh commented Sep 12, 2014

Holzhaus commented Sep 13, 2014

Holzhaus commented Sep 15, 2014

Holzhaus commented Sep 15, 2014

charliermarsh commented Sep 15, 2014

charliermarsh commented Sep 16, 2014

Holzhaus commented Sep 16, 2014

Holzhaus commented Sep 16, 2014

Holzhaus commented Sep 16, 2014

charliermarsh commented Sep 19, 2014

Holzhaus commented Sep 19, 2014

Holzhaus commented Sep 26, 2014

Holzhaus commented Sep 26, 2014

Cleanup client.speaker and add additional tts speakers #155

Cleanup client.speaker and add additional tts speakers #155

Conversation

Holzhaus commented Sep 6, 2014

charliermarsh Sep 8, 2014

Choose a reason for hiding this comment

Holzhaus commented Sep 9, 2014

charliermarsh commented Sep 9, 2014

Holzhaus commented Sep 10, 2014

Holzhaus commented Sep 11, 2014

Holzhaus commented Sep 12, 2014

charliermarsh commented Sep 12, 2014

charliermarsh commented Sep 12, 2014

charliermarsh Sep 12, 2014

Choose a reason for hiding this comment

charliermarsh commented Sep 12, 2014

Holzhaus commented Sep 13, 2014

Holzhaus commented Sep 15, 2014

Holzhaus commented Sep 15, 2014

charliermarsh commented Sep 15, 2014

charliermarsh commented Sep 16, 2014

Holzhaus commented Sep 16, 2014

1. ALSA config

2. PyAudio Init

3. Slow SD card

Holzhaus commented Sep 16, 2014

Holzhaus commented Sep 16, 2014

charliermarsh commented Sep 19, 2014

Holzhaus commented Sep 19, 2014

Holzhaus commented Sep 26, 2014

Holzhaus commented Sep 26, 2014