AI Gateway production index
Blog post from Vercel
In the rapidly evolving AI industry, different models excel in varied use cases, as evidenced by data from Vercel's AI Gateway, which serves tens of trillions of tokens across numerous models. Anthropic leads in spending due to high-stakes applications, while Google dominates in token volume with its consumer-focused, cost-efficient models. OpenAI's share is increasing, driven by recent model updates, highlighting the dynamic nature of model adoption. The data reveals that AI workloads are becoming more agentic, with tool-using requests carrying a significantly higher token volume. Teams handling large-scale requests utilize an average of 35 models, reflecting a shift towards flexible, multi-model architectures that allow rapid adaptation to new releases and circumvent provider outages. This approach underscores the importance of designing AI systems based on workload efficiency and reliability rather than allegiance to specific providers, drawing parallels to early cloud computing strategies.