Gemini Live Edition
Built on Pollen Robotics' conversation app, this fork builds upon the base app's Gemini Live support and extends it to include persistent per-user memory, MCP tool servers, Home Assistant home control, and an offline wake word โ all managed from a browser.
All the motion and personality of the base app โ plus memory, tools, and a wake word.
Fork additions
Everything in pollen-robotics/reachy_mini_conversation_app still works. These are the new layers on top.
Vertex AI Agent Engine stores facts about each person across sessions. Reachy greets you by name, picks up mid-topic, and keeps each household member's context private. Recency-weighted retrieval surfaces what matters now; Reachy can call recall mid-conversation, or remember(fact, lifetime_days=โฆ) for ephemeral facts that auto-expire.
One-click Compact memories in the Personality Studio runs an LLM-driven dedup pass over the active user's facts: deletes duplicates, merges overlaps, drops expired ephemerals. Preview (red = delete, teal = merge) before anything is touched, then click Apply to execute exactly what you reviewed.
Per-profile toggle. When on, Gemini retrieves live search results before answering time-sensitive questions โ weather, news, sports scores, flight status.
โ Gemini grounding docsAdded MCP server support, extending the tool ecosystem beyond built-in tools. Each profile defines its own servers with secure credential injection. Home Assistant, weather, calendar โ swap tools per persona without touching code.
โ Model Context ProtocolLights, climate, media, locks, shades, sprinklers, Google Calendar โ all via natural voice through the HA MCP server. No entity IDs exposed to the user.
โ HA MCP ServerSay "Hey Reachy" to wake from sleep โ a rooster crow chirps when detected. Tell Reachy "good night" or "you can rest now" and it ends the session itself. Auto-sleeps after a configurable idle window (1โ60 min, default 10) โ Gemini billing stops, Edge Impulse model detects the wake word offline on-device.
โ luisomoreau/hey_reachy_wake_word_detection1fps camera feed, real-time transcript stream, activity log, and wake/sleep controls โ all in the browser at http://ROBOT_IP:7860. Not in the base app.
Headless web profile editor at http://ROBOT_IP:7860 โ edit system prompt, voice (30 Gemini voices with audio preview), tools, MCP servers, and per-profile feature flags from your phone. No SSH, no file editing. Save startup profile, apply live without restart.
The identify_user tool uses the camera to recognize who's speaking and switches the active memory scope mid-conversation. Each household member gets their own private context โ preferences, ongoing topics, calendar, all kept separate. One robot, every person remembered individually.
Memory architecture
Three-stage pipeline: semantic recall at session start, fact capture mid-conversation, transcript ingestion at session end. Pattern from Shubhamsaboo/openclaw-vertexai-memorybank.
Multi-user family assistant
Built on Pollen Robotics' foundation
The base app's voice loop, dance library, emotion system, camera tools, and profile UI are all here. This fork adds Gemini Live as the sole backend, Vertex AI for memory, and MCP for extensible home control.