Gladia has launched its enterprise-grade Speech-to-Text API, emphasizing accuracy and speed, capable of transcribing one hour of audio in just 60 seconds, and supporting features like speaker diarization, word-level timestamps, code-switching, and beta translation in 99 languages. The API is designed for scalability and versatility, processing various file sizes without restrictions and offering competitive, transparent pricing with a pay-as-you-go model. Privacy and data security are prioritized, with full GDPR compliance and support for cloud, on-premise, and air-gap hosting. The API is developer-friendly, compatible with all tech stacks, and includes a dedicated playground for testing. Gladia plans to expand its offerings with multilingual Audio Intelligence add-ons like summarization and sentiment analysis, while also working on a proprietary large language model (LLM) to enhance its AI capabilities further.