Overview
Themoorcheh-edge voice commands run speech-to-text (Whisper), text-to-speech (Piper), and a voice RAG loop on the same machine as Moorcheh Edge - for example an Arduino UNO Q with a USB mic and speaker.
Voice commands require Linux with ALSA audio. They are not supported on Windows or macOS. On a PC demo, use browser-based voice in your app instead.
Prerequisites
- Moorcheh Edge running with RAG:
moorcheh-edge up --with-llm --warm-llm -y - One-time voice setup:
moorcheh-edge voice setup - Optional:
moorcheh-edge voice cache-holdingfor kiosk welcome audio
Command groups
| Command | Purpose |
|---|---|
setup | Install Whisper, Piper, whisper.cpp, and audio deps |
cache-holding | Pre-generate the fixed kiosk holding WAV |
check | List ALSA devices and probe mic/speaker |
listen | Record from mic and print transcribed text |
speak | Synthesize and play text on the speaker |
transcribe | Transcribe an existing WAV file |
ask | Record → RAG answer → speak (local kiosk loop) |
serve | HTTP server on :8766 for remote UIs (mic/speaker on device) |
Typical workflow (UNO Q)
up).
Terminal 2 - voice HTTP server for a remote display PC:
HTTP API
When you runvoice serve, the CLI starts a small HTTP server (default http://0.0.0.0:8766) documented in Voice server API.