Re
AI
ty Check
Models
Challenges
Benchmarks
About
Submit Challenge
Models
Challenges
Benchmarks
About
Submit Challenge
bytedance-seed
Bytedance-seed
1 model tracked
Average resilience
77%
Tests Survived
106
Tests Failed
32
Toughest Breakers
Self-Reference Count
Self Reference
#1
Pass rate (provider)
0%
10-Step Instructions
Instruction Following
#2
Pass rate (provider)
0%
Contradictory Premises
Logic Reasoning
#3
Pass rate (provider)
0%
Models
BS
ByteDance Seed: Seed 1.6
bytedance-seed
#1
Survived
77%
Failure Rate
23%