MiniMax M3 Open Model Beats GPT-5.5 at 5-10% of the Cost

ContentBuffer

daily-hour-news·Jun 9, 2026

🤖MiniMax M3 Open Model Beats GPT-5.5 at 5-10% of the Cost

TL;DR

China's MiniMax released M3, an open-weight model that tops GPT-5.5 and Gemini 3.1 Pro on key benchmarks while costing 5-10% as much to run. It pairs a 1M-token context with sparse attention that cuts per-token compute to a twentieth of the prior generation.

MiniMax M3 Open Model Beats GPT-5.5 at 5-10% of the Cost — daily-hour-news

Key Points

1

Scores 59.0% on open-weight SWE-Bench Pro, rivaling proprietary leaders

2

1M-token context; sparse attention cuts per-token compute about 20x

3

Roughly 9x faster prefill and 15x faster decode vs the prior MiniMax model

4

Natively multimodal; pretraining corpus exceeds 100 trillion tokens

5

Open weights, aimed at making long-context agents economical to deploy

Why It Matters

Cheap, open, long-context models out of China keep dropping the price floor that US labs have to price against.

Quick Facts

MiniMaxopen weightsSWE-Benchlong contextChina AIsparse attentionLLM

Frequently Asked Questions

Why does this matter?

Cheap, open, long-context models out of China keep dropping the price floor that US labs have to price against.

What happened?

China's MiniMax released M3, an open-weight model that tops GPT-5.5 and Gemini 3.1 Pro on key benchmarks while costing 5-10% as much to run. It pairs a 1M-token context with sparse attention that cuts per-token compute to a twentieth of the prior generation.

🤖MiniMax M3 Open Model Beats GPT-5.5 at 5-10% of the Cost

Key Points

Why It Matters

Quick Facts

Frequently Asked Questions

Comments

Enjoyed this article?