

Google has released Gemini 3 Flash, the latest model in the Gemini 3 series, joining Gemini 3 Pro and Gemini 3 Deep Think. Designed for speed, efficiency, and lower token costs, the new AI model is aimed at both users and developers. Google claims Gemini 3 Flash can outperform Gemini 3 Pro in coding-related tasks and delivers better performance than the entire Gemini 2.5 series.
According to Google, Gemini 3 Flash is more capable while maintaining low latency and affordability, a key trait of Flash models. It is globally available via the Gemini app and website, AI Mode in Search, and for developers through the Gemini API in Google AI Studio, Gemini CLI, Antigravity, Vertex AI, and Gemini Enterprise. Internal benchmarks show strong results across reasoning, coding, and multimodal tasks, with performance rivaling or exceeding Gemini 3 Pro in several areas.
The model can handle complex queries with longer reasoning while using about 30 percent fewer tokens than Gemini 2.5 Pro, improving efficiency and cost-effectiveness. Pricing is set at $0.50 per million input tokens and $3 per million output tokens, with audio input priced at $1 per million tokens. While it is significantly cheaper than Gemini 3 Pro, Gemini 3 Flash is slightly more expensive than the 2.5 Pro variant.












Comments (0)
No comments yet
Be the first to comment!