The AI company, Speechmatics, aims to build a seamless AI stack called Speech Intelligence that connects the latest AI technologies to the spoken world, with a focus on improving speech recognition accuracy and providing a compounding advantage throughout the stack. The company has made significant progress in reducing Word Error Rates (WER) for its best-performing model, with a 50% reduction over the last two years. To achieve this, Speechmatics is investing in self-supervised learning to tackle the challenge of global transcription, which faces a unique obstacle due to the scarcity of labeled data, especially for diverse languages and speakers. The company believes that superb accuracy across any language will always be the preferred choice for conversational AI stacks, and it plans to pursue this goal with its next-generation self-supervised models and capabilities over one or more transcripts. Speechmatics also envisions a future where people can interact with technology seamlessly using their voices, without latency or misheard words, and is committed to investing in paradigm changes to make this vision a reality.