Re
AI
ty Check
Models
Challenges
Benchmarks
About
Submit Challenge
Models
Challenges
Benchmarks
About
Submit Challenge
qwen
Qwen
1 model tracked
Average resilience
82%
Tests Survived
98
Tests Failed
22
Toughest Breakers
Self-Reference Count
Self Reference
#1
Pass rate (provider)
0%
10-Step Instructions
Instruction Following
#2
Pass rate (provider)
0%
The Missing A
Pattern Matching
#3
Pass rate (provider)
0%
Models
QQ
Qwen: Qwen3.5 397B A17B
qwen
#1
Survived
82%
Failure Rate
18%