Google Blog·

Google Launches Gemini 3.1 Flash-Lite at $0.25 per Million Input Tokens

Google Launches Gemini 3.1 Flash-Lite at $0.25 per Million …

TL;DR

Google introduced Gemini 3.1 Flash-Lite, an efficiency-focused model delivering 2.5x faster response times and 45% faster output generation compared with earli…

Google introduced Gemini 3.1 Flash-Lite, an efficiency-focused model delivering 2.5x faster response times and 45% faster output generation compared with earlier Gemini versions. The model is priced at just $0.25 per million input tokens, undercutting comparable Western models.

Google Launches Gemini 3.1 Flash-Lite at $0.25 per Million Input Tokens — Google Blog

Key Points

1

2.5x faster response times

2

45% faster output generation

3

Priced at $0.25 / 1M input tokens

Why It Matters

Aggressive low-cost pricing pushes the price floor for high-volume inference workloads and intensifies pressure on OpenAI and Anthropic to match on cost-efficient tiers.

GoogleGeminipricinginference

Frequently Asked Questions

Why does this matter?

Aggressive low-cost pricing pushes the price floor for high-volume inference workloads and intensifies pressure on OpenAI and Anthropic to match on cost-efficient tiers.

What happened?

Google introduced Gemini 3.1 Flash-Lite, an efficiency-focused model delivering 2.5x faster response times and 45% faster output generation compared with earli…

Comments

Subscribe to join the conversation...

Be the first to comment

Enjoyed this article?

Get it daily. 7am. Free. Reads in 5 minutes.