Replies: 4 comments 4 replies
-
I've had a go at Dockerising openWakeWord, still a WIP (and I need to build and upload images onto Dockerhub), but the basic UDP audio receiving from Rhasspy, and triggering wake word via MQTT(Hermes protocol) is working. |
Beta Was this translation helpful? Give feedback.
-
Making a Docker image for openWakeWord was definitely somewhere on the backlog, and I certainly agree with your line of thinking regarding integrating with other libraries and frameworks (Rhasspy, OVOS, etc.). Thanks for creating an initial Docker file and associated serving code, it looks quite good! Once there are some features in openWakeWord that will benefit from a web UI it should be an simple extension to add that to the Docker file so it's easily accessible. Would you mind if I linked to your repo in the openWakeWord Readme as an example of other deployment options? |
Beta Was this translation helpful? Give feedback.
-
Sure! Next I want to add ability to have multiple input audio threads, one for each satellite. Possibly also be able to receive audio over MQTT Hermes... although I am less sure of its use, maybe it's easier for users to set up because they don't need to fiddle with UDP ports, and using MQTT we could auto-discover satellites. I haven't looked into OVOS, or had a solid explore of Rhasspy 3 yet. I'm more than happy to take input on what would be useful to add to this "wrapper". |
Beta Was this translation helpful? Give feedback.
-
Quick update:
|
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
-
I run Rhasspy in Base/Satellite mode. Currently each Satellite captures audio, does the wake word detection and streams audio to the Base which does everything else. My Pi4 satellite runs the Rhasspy Docker container, launched with compose. My Base Rhasspy container is runon a much more pwerful i7 with loads of RAM (also runs other home automation software.)
I've been thinking of Docerising OWW, and then using Rhasspy 2.5's Hermes MQTT to send data from the Rhasspy container to OWW container, do the detection and send the result back on MQTT again. For Rhasspy 3 we could build an adapter that sends audio via e.g. a UDP or websocket stream.
Other API options could also be provided for those who don't use Rhasspy.
Running OWW in Docker eases distribution and setup (Python dependencies), allows OWW to develop at a separate pace to Rhasspy (instead of being bundled and released with Rhasspy). The single instance of OWW container could be run on a more powerful base station PC which all satellites use for wake word detection, allowing lower power satellites (ESP32) richer wake word options.
OWW could present a web UI for configuration: which words to detect, thresholds, custom verifier models and maybe speaker identification
It could also have a live demo built in for testing and diagnostics.
@dscripka what have your thoughts been on integrating OWW and Rhasspy more easily?
Beta Was this translation helpful? Give feedback.
All reactions