Gemini 2.5 Flash API - Pricing, Quickstart & Provider Comparison
Blog post from OpenRouter
Gemini 2.5 Flash is a versatile model developed by Google for high-volume, latency-sensitive tasks requiring reasoning, with capabilities to process text, code, images, audio, video, and documents. It introduces a unique feature called "thinking," allowing users to control the model's reasoning depth through a thinking budget parameter, which can be adjusted to balance response quality, speed, and cost. The model is accessible via Google AI Studio, Vertex AI, and OpenRouter, each offering different pricing structures and functionalities. OpenRouter provides a seamless integration experience by routing requests through multiple Google providers, ensuring high availability and enabling easy model switching without code changes. While it supports a wide range of inputs, Gemini 2.5 Flash is limited to text output and lacks capabilities for audio and image generation, with a separate model required for the latter. Scheduled for discontinuation in October 2026, users are advised to plan migrations to successor models for long-term projects.