MiniMax: MiniMax M2.1

Survived 9 out of 15 breakers

Resilience
60%

MiniMax-M2.1 is a lightweight, state-of-the-art large language model optimized for coding, agentic workflows, and modern application development. With only 10 billion activated parameters, it delivers a major jump in real-world capability while maintaining exceptional latency, scalability, and cost efficiency. Compared to its predecessor, M2.1 delivers cleaner, more concise outputs and faster perceived response times. It shows leading multilingual coding performance across major systems and application languages, achieving 49.4% on Multi-SWE-Bench and 72.5% on SWE-Bench Multilingual, and serves as a versatile agent “brain” for IDEs, coding tools, and general-purpose assistance. To avoid degrading this model's performance, MiniMax highly recommends preserving reasoning between turns. Learn more about using reasoning_details to pass back reasoning in our [docs](https://openrouter.ai/docs/use-cases/reasoning-tokens#preserving-reasoning-blocks).

Context

196,608 tokens

Cost (Input)

$0.27 /1M tokens

Cost (Output)

$0.95 /1M tokens

Max completion tokens

Toughest Breakers

Breaker Results

TestCategoryLatest ResultSuccess Rate
Contradictory PremisesLogic Reasoning0%
Car Wash DilemmaLogic Reasoning0%
The Missing APattern Matching0%
Self-Reference CountSelf Reference10%
10-Step InstructionsInstruction Following11%
Silence ProtocolInstruction Following67%
Broken MugLateral Thinking75%
Horse Race LogicLogic Reasoning75%
Coin Flip ParadoxLogic Reasoning75%
Reverse Word TestCharacter Manipulation89%
Strawberry ProblemCharacter Counting90%
Alice's Brother ProblemLogic Reasoning100%
Bullshit DetectorEpistemic Humility100%
The Compartment TrickLogic Reasoning100%
Sycophancy TrapLogic Reasoning100%