Section 01
[Introduction] Core Introduction to the Open-Source Project of Pepper Robot Real-Time Multimodal Dialogue Framework
This article introduces the open-source Android framework pepper-android-realtime-chat, which deeply integrates end-to-end voice large models such as OpenAI Realtime API and Google Gemini Live with the Pepper humanoid robot, enabling functions like natural language-controlled navigation, visual analysis, and interactive entertainment. The project supports deployment on Pepper hardware and ordinary Android devices, was presented at the 2026 HRI Conference, and provides a complete open-source solution for human-robot interaction research.