Reachy Mini โ€” Gemini Live Edition
Fork of pollen-robotics/reachy_mini_conversation_app ยท Gemini Live ยท Persistent memory ยท MCP tools

Gemini Live Edition

Reachy Mini that remembers you.

Built on Pollen Robotics' conversation app, this fork builds upon the base app's Gemini Live support and extends it to include persistent per-user memory, MCP tool servers, Home Assistant home control, and an offline wake word โ€” all managed from a browser.

Gemini Live API Vertex AI Agent Engine memory Hey Reachy wake word Home Assistant control
Reachy Mini dancing

All the motion and personality of the base app โ€” plus memory, tools, and a wake word.

Fork additions

What this adds over the base app

Everything in pollen-robotics/reachy_mini_conversation_app still works. These are the new layers on top.

๐Ÿง 

Persistent memory per user

Vertex AI Agent Engine stores facts about each person across sessions. Reachy greets you by name, picks up mid-topic, and keeps each household member's context private. Recency-weighted retrieval surfaces what matters now; Reachy can call recall mid-conversation, or remember(fact, lifetime_days=โ€ฆ) for ephemeral facts that auto-expire.

โ†— Vertex AI Agent Engine โ†— Memory bank pattern
๐Ÿงน

Memory hygiene

One-click Compact memories in the Personality Studio runs an LLM-driven dedup pass over the active user's facts: deletes duplicates, merges overlaps, drops expired ephemerals. Preview (red = delete, teal = merge) before anything is touched, then click Apply to execute exactly what you reviewed.

๐Ÿ”

Google Search grounding

Per-profile toggle. When on, Gemini retrieves live search results before answering time-sensitive questions โ€” weather, news, sports scores, flight status.

โ†— Gemini grounding docs
๐Ÿ”Œ

MCP servers โ€” extended tool support

Added MCP server support, extending the tool ecosystem beyond built-in tools. Each profile defines its own servers with secure credential injection. Home Assistant, weather, calendar โ€” swap tools per persona without touching code.

โ†— Model Context Protocol
๐Ÿ 

Home Assistant control

Lights, climate, media, locks, shades, sprinklers, Google Calendar โ€” all via natural voice through the HA MCP server. No entity IDs exposed to the user.

โ†— HA MCP Server
๐Ÿ˜ด

Wake word + sleep modes

Say "Hey Reachy" to wake from sleep โ€” a rooster crow chirps when detected. Tell Reachy "good night" or "you can rest now" and it ends the session itself. Auto-sleeps after a configurable idle window (1โ€“60 min, default 10) โ€” Gemini billing stops, Edge Impulse model detects the wake word offline on-device.

โ†— luisomoreau/hey_reachy_wake_word_detection
๐Ÿ“บ

Live browser monitor

1fps camera feed, real-time transcript stream, activity log, and wake/sleep controls โ€” all in the browser at http://ROBOT_IP:7860. Not in the base app.

โ†— Gemini Live API
๐ŸŽญ

Personality Studio

Headless web profile editor at http://ROBOT_IP:7860 โ€” edit system prompt, voice (30 Gemini voices with audio preview), tools, MCP servers, and per-profile feature flags from your phone. No SSH, no file editing. Save startup profile, apply live without restart.

๐Ÿ‘ค

Multi-user awareness

The identify_user tool uses the camera to recognize who's speaking and switches the active memory scope mid-conversation. Each household member gets their own private context โ€” preferences, ongoing topics, calendar, all kept separate. One robot, every person remembered individually.

Memory architecture

How Vertex AI Agent Engine works

Three-stage pipeline: semantic recall at session start, fact capture mid-conversation, transcript ingestion at session end. Pattern from Shubhamsaboo/openclaw-vertexai-memorybank.

Vertex AI Agent Engine memory architecture

Multi-user family assistant

One robot, every household member remembered

  • ๐Ÿ“ธ Camera identifies who's there at session start.
  • ๐Ÿง  Loads that person's memories โ€” preferences, projects, ongoing topics.
  • ๐Ÿ”„ Switches context mid-conversation when someone else walks up.
  • ๐Ÿ”’ Each person's memories are scoped privately by user + profile.

Built on Pollen Robotics' foundation

All the original features, plus a Google Cloud layer

The base app's voice loop, dance library, emotion system, camera tools, and profile UI are all here. This fork adds Gemini Live as the sole backend, Vertex AI for memory, and MCP for extensible home control.

Gemini Live API Agent Engine memory MCP servers Edge Impulse wake word HA integration Live monitor