OpenAI: GPT-5 Chat

Survived 9 out of 15 breakers

Resilience
60%

GPT-5 Chat is designed for advanced, natural, multimodal, and context-aware conversations for enterprise applications.

Context

128,000 tokens

Cost (Input)

$1.25 /1M tokens

Cost (Output)

$10.00 /1M tokens

Max completion tokens

16,384

Toughest Breakers

Breaker Results

TestCategoryLatest ResultSuccess Rate
Contradictory PremisesLogic Reasoning0%
Broken MugLateral Thinking0%
Car Wash DilemmaLogic Reasoning0%
The Missing APattern Matching0%
Bullshit DetectorEpistemic Humility0%
Self-Reference CountSelf Reference38%
10-Step InstructionsInstruction Following50%
Horse Race LogicLogic Reasoning67%
Strawberry ProblemCharacter Counting100%
Reverse Word TestCharacter Manipulation100%
Alice's Brother ProblemLogic Reasoning100%
Silence ProtocolInstruction Following100%
The Compartment TrickLogic Reasoning100%
Sycophancy TrapLogic Reasoning100%
Coin Flip ParadoxLogic Reasoning100%