Re
AI
ty Check
Models
Challenges
Benchmarks
About
Submit Challenge
Models
Challenges
Benchmarks
About
Submit Challenge
mistralai
Mistralai
1 model tracked
Average resilience
51%
Tests Survived
71
Tests Failed
67
Toughest Breakers
Self-Reference Count
Self Reference
#1
Pass rate (provider)
0%
10-Step Instructions
Instruction Following
#2
Pass rate (provider)
0%
Silence Protocol
Instruction Following
#3
Pass rate (provider)
0%
Models
MM
Mistral: Mistral Large 3 2512
mistralai
#1
Survived
51%
Failure Rate
49%