Home / Companies / AssemblyAI / Blog / Post Details
Content Deep Dive

Build a Daily.co voice agent with AssemblyAI's Voice Agent API

Blog post from AssemblyAI

Post Details
Company
Date Published
Author
Kelsey Foster
Word Count
1,615
Company Posts That Month
40
Language
English
Hacker News Points
-
Summary

This tutorial demonstrates how to create a server-side voice agent using Daily.co and AssemblyAI's Voice Agent API, enabling a bot to join a WebRTC room, listen to participants, and respond with a real voice through a single WebSocket connection. Leveraging the daily-python SDK, this setup simplifies the typical voice-agent stack by integrating Daily.co's WebRTC infrastructure for managing rooms and participants with AssemblyAI's comprehensive AI capabilities for speech recognition, language model processing, and text-to-speech conversion. The process involves configuring a Daily.co room and AssemblyAI API keys, setting up a virtual microphone for publishing audio, and ensuring proper audio resampling between Daily.co’s 16 kHz and AssemblyAI's 24 kHz formats. The system supports multi-participant interactions, telephony integration, and includes options for tuning voice settings and handling interruptions, with troubleshooting guidance provided for common setup issues.

Trends Found in this Post
Trend Post Mentions Total Month Mentions Posts Companies MoM
Voice AI 28 3,462 242 43 +46%
LLM 6 9,074 1,640 224 +53%
Real-time 3 5,735 1,391 247 -9%