WorldOlympiad: Can Your World Model Survive a Triathlon?
WorldOlympiad: Can Your World Model Survive a Triathlon?
要約
We introduce WorldOlympiad, a benchmark for diagnosing video-based world models across physical faithfulness, geometric consistency, and interaction fidelity. While existing benchmarks often focus on visual quality, semantic alignment, or short-term temporal coherence, they provide limited insight i…