Home / Companies / AssemblyAI / Blog / Post Details
Content Deep Dive

Build a Daily.co voice agent with AssemblyAI's Voice Agent API

Blog post from AssemblyAI

Post Details
Company
Date Published
Author
Kelsey Foster
Word Count
1,615
Language
English
Hacker News Points
-
Summary

This tutorial demonstrates how to create a server-side voice agent using Daily.co and AssemblyAI's Voice Agent API, enabling a bot to join a WebRTC room, listen to participants, and respond with a real voice through a single WebSocket connection. Leveraging the daily-python SDK, this setup simplifies the typical voice-agent stack by integrating Daily.co's WebRTC infrastructure for managing rooms and participants with AssemblyAI's comprehensive AI capabilities for speech recognition, language model processing, and text-to-speech conversion. The process involves configuring a Daily.co room and AssemblyAI API keys, setting up a virtual microphone for publishing audio, and ensuring proper audio resampling between Daily.co’s 16 kHz and AssemblyAI's 24 kHz formats. The system supports multi-participant interactions, telephony integration, and includes options for tuning voice settings and handling interruptions, with troubleshooting guidance provided for common setup issues.