✨Google Launches Gemini 3.1 Flash-Lite at $0.25 per Million Input Tokens
Google Launches Gemini 3.1 Flash-Lite at $0.25 per Million …
TL;DR
Google introduced Gemini 3.1 Flash-Lite, an efficiency-focused model delivering 2.5x faster response times and 45% faster output generation compared with earli…
Google introduced Gemini 3.1 Flash-Lite, an efficiency-focused model delivering 2.5x faster response times and 45% faster output generation compared with earlier Gemini versions. The model is priced at just $0.25 per million input tokens, undercutting comparable Western models.

Key Points
2.5x faster response times
45% faster output generation
Priced at $0.25 / 1M input tokens
Why It Matters
Aggressive low-cost pricing pushes the price floor for high-volume inference workloads and intensifies pressure on OpenAI and Anthropic to match on cost-efficient tiers.
Frequently Asked Questions
Why does this matter?
Aggressive low-cost pricing pushes the price floor for high-volume inference workloads and intensifies pressure on OpenAI and Anthropic to match on cost-efficient tiers.
What happened?
Google introduced Gemini 3.1 Flash-Lite, an efficiency-focused model delivering 2.5x faster response times and 45% faster output generation compared with earli…
Comments
Be the first to comment
Enjoyed this article?
Get it daily. 7am. Free. Reads in 5 minutes.