Build a Voice-Controlled GitHub Agent in Python (MCP + Vision Agents)

Post Details

Company

Stream

Date Published

Dec. 23, 2025

Author

Amos G.

Word Count

1,965

Company Posts That Month

32

Language

English

Hacker News Points

-

Source URL

getstream.io/blog/github-mcp-agent-python

Summary

A novel integration allows users to transform any GitHub repository into a voice-controlled assistant using OpenAI's Realtime API, GitHub's Model Context Protocol (MCP), and Vision Agents, enabling interactions such as querying branches, managing pull requests, and listing contributors through natural conversation. This setup facilitates secure GitHub interactions via a personal access token and leverages the OpenAI Realtime API for low-latency voice processing, while the Vision Agents provide seamless orchestration and real-time function calling capabilities. The demo showcases the assistant's ability to understand spoken repository names and respond to queries about repository details, offering a hands-on guide to setting up this system, which involves configuring environment variables and running a Python script. This powerful stack provides a sub-second voice response time and integrates GitHub MCP for structured access without needing complex API wrappers, emphasizing ease of use and conversational interaction within a browser-based video call interface.

Trends Found in this Post

Trend	Post Mentions	Total Month Mentions	Posts	Companies	MoM
Real-time	36	7,285	1,202	224	+60%
MCP	35	4,899	392	145	+47%
LLM	13	3,775	638	202	-32%