The Anthropic API has made several updates to optimize throughput and reduce token usage with the release of Claude 3.7 Sonnet, including cache-aware rate limits, simpler prompt caching, and token-efficient tool use. These updates aim to help developers process more requests within their existing rate limits while reducing costs. The new features include prompt caching, which reduces costs by up to 90% and latency by up to 85%, as well as a text_editor tool designed for collaborative document editing workflows. Early users such as Cognition are already leveraging these updates to improve token efficiency and response quality. These features are available today with minimal code changes, allowing developers to take advantage of the new capabilities.