Anthropic: Claude Haiku 4.5

Survived 7 out of 15 breakers

Resilience
47%

Claude Haiku 4.5 is Anthropic’s fastest and most efficient model, delivering near-frontier intelligence at a fraction of the cost and latency of larger Claude models. Matching Claude Sonnet 4’s performance across reasoning, coding, and computer-use tasks, Haiku 4.5 brings frontier-level capability to real-time and high-volume applications. It introduces extended thinking to the Haiku line; enabling controllable reasoning depth, summarized or interleaved thought output, and tool-assisted workflows with full support for coding, bash, web search, and computer-use tools. Scoring >73% on SWE-bench Verified, Haiku 4.5 ranks among the world’s best coding models while maintaining exceptional responsiveness for sub-agents, parallelized execution, and scaled deployment.

Context

200,000 tokens

Cost (Input)

$1.00 /1M tokens

Cost (Output)

$5.00 /1M tokens

Max completion tokens

64,000

Toughest Breakers

Breaker Results

TestCategoryLatest ResultSuccess Rate
Silence ProtocolInstruction Following0%
Broken MugLateral Thinking0%
Car Wash DilemmaLogic Reasoning0%
The Missing APattern Matching0%
Self-Reference CountSelf Reference11%
10-Step InstructionsInstruction Following11%
Horse Race LogicLogic Reasoning25%
Alice's Brother ProblemLogic Reasoning44%
Reverse Word TestCharacter Manipulation56%
Contradictory PremisesLogic Reasoning78%
Strawberry ProblemCharacter Counting100%
Bullshit DetectorEpistemic Humility100%
The Compartment TrickLogic Reasoning100%
Sycophancy TrapLogic Reasoning100%
Coin Flip ParadoxLogic Reasoning100%