Re
AI
ty Check
Models
Challenges
Benchmarks
About
Submit Challenge
Models
Challenges
Benchmarks
About
Submit Challenge
nvidia
Nvidia
1 model tracked
Average resilience
59%
Tests Survived
218
Tests Failed
153
Toughest Breakers
Self-Reference Count
Self Reference
#1
Pass rate (provider)
0%
10-Step Instructions
Instruction Following
#2
Pass rate (provider)
0%
Reverse Word Test
Character Manipulation
#3
Pass rate (provider)
0%
Models
NN
NVIDIA: Nemotron 3 Nano 30B A3B (free)
nvidia
#1
Survived
59%
Failure Rate
41%