Home / Companies / Deepgram / Blog / Post Details
Content Deep Dive

Building a Voice Archive Search Tool with Deepgram’s STT, Cohere Embeddings, and Pinecone

Blog post from Deepgram

Post Details
Company
Date Published
Author
Stephen Oladele
Word Count
4,988
Company Posts That Month
16
Language
English
Hacker News Points
-
Summary

The tutorial outlines the process of building a voice archive search tool using a combination of Deepgram's speech-to-text (STT) API, Cohere embeddings, and Pinecone vector search to facilitate semantic search over audio files. The application, built with FastHTML and HTMX, allows users to upload audio files in MP3 or WAV format or provide URLs, which are then transcribed and segmented with timestamps and speakers. The tool optionally redacts personally identifiable information before embedding the transcript into a vector space for indexing in Pinecone, enabling meaning-aware retrieval. The tutorial emphasizes the superiority of semantic search over keyword search for handling synonyms, phrasing, and accents. It provides a step-by-step guide for setting up the pipeline, which includes transcription, chunking, embedding, indexing, and querying, and highlights operational considerations such as scaling, privacy, and evaluation metrics like Word Error Rate (WER) and Recall@K. The app features a user-friendly interface with options to filter results, set similarity thresholds, and evaluate retrieval quality, making it suitable for various industries, including customer support, compliance, and HR.

Trends Found in this Post
Trend Post Mentions Total Month Mentions Posts Companies MoM
Vector Search 43 1,504 310 125 -10%
Real-time 7 4,065 968 231 -6%
Serverless 6 842 169 80 +38%
Voice AI 3 668 123 38 -10%
RAG 2 1,006 206 82 -15%
Observability 1 1,462 347 128 -22%