Eyes, ears, and a voice: building Reachy Mini's media stack
Blog post from HuggingFace
Reachy Mini, a robot developed by Pollen Robotics, interacts with its surroundings using audio and video processing capabilities facilitated by its Raspberry Pi Camera 3 Wide and custom microphone array. The design allows seamless local and remote usage, with a consistent API for both the Lite and Wireless versions. GStreamer is employed for media handling, enabling direct AI app development using the robot's audio and video streams on various platforms, including laptops and Hugging Face Spaces. WebRTC facilitates low-latency streaming and control, while the SDK simplifies installation and usage. This setup supports advanced applications like speech recognition and object tracking, offering flexibility between local and remote processing. The design ensures minimal latency for real-time interactions, with open-source code available for further development.