Skip to content
daily-hour-news·

🤖MiniMax M3 Open Model Beats GPT-5.5 at 5-10% of the Cost

TL;DR

China's MiniMax released M3, an open-weight model that tops GPT-5.5 and Gemini 3.1 Pro on key benchmarks while costing 5-10% as much to run. It pairs a 1M-token context with sparse attention that cuts per-token compute to a twentieth of the prior generation.

China's MiniMax released M3, an open-weight model that tops GPT-5.5 and Gemini 3.1 Pro on key benchmarks while costing 5-10% as much to run. It pairs a 1M-token context with sparse attention that cuts per-token compute to a twentieth of the prior generation.

MiniMax M3 Open Model Beats GPT-5.5 at 5-10% of the Cost — daily-hour-news

Key Points

1

Scores 59.0% on open-weight SWE-Bench Pro, rivaling proprietary leaders

2

1M-token context; sparse attention cuts per-token compute about 20x

3

Roughly 9x faster prefill and 15x faster decode vs the prior MiniMax model

4

Natively multimodal; pretraining corpus exceeds 100 trillion tokens

5

Open weights, aimed at making long-context agents economical to deploy

Why It Matters

Cheap, open, long-context models out of China keep dropping the price floor that US labs have to price against.

Quick Facts

MiniMaxopen weightsSWE-Benchlong contextChina AIsparse attentionLLM

Frequently Asked Questions

Why does this matter?

Cheap, open, long-context models out of China keep dropping the price floor that US labs have to price against.

What happened?

China's MiniMax released M3, an open-weight model that tops GPT-5.5 and Gemini 3.1 Pro on key benchmarks while costing 5-10% as much to run. It pairs a 1M-token context with sparse attention that cuts per-token compute to a twentieth of the prior generation.

Comments

Subscribe to join the conversation...

Be the first to comment

Enjoyed this article?

Get it daily. 7am. Free. Reads in 5 minutes.

Join 1,970 builders reading daily.