Home / Companies / Vercel / Blog / Post Details
Content Deep Dive

AI Gateway production index

Blog post from Vercel

Post Details
Company
Date Published
Author
Harpreet Arora
Word Count
1,915
Language
English
Hacker News Points
-
Summary

In the rapidly evolving AI industry, different models excel in varied use cases, as evidenced by data from Vercel's AI Gateway, which serves tens of trillions of tokens across numerous models. Anthropic leads in spending due to high-stakes applications, while Google dominates in token volume with its consumer-focused, cost-efficient models. OpenAI's share is increasing, driven by recent model updates, highlighting the dynamic nature of model adoption. The data reveals that AI workloads are becoming more agentic, with tool-using requests carrying a significantly higher token volume. Teams handling large-scale requests utilize an average of 35 models, reflecting a shift towards flexible, multi-model architectures that allow rapid adaptation to new releases and circumvent provider outages. This approach underscores the importance of designing AI systems based on workload efficiency and reliability rather than allegiance to specific providers, drawing parallels to early cloud computing strategies.