DS1 spectrogram: RoboChallenge: Large-scale Real-robot Evaluation of Embodied Policies

RoboChallenge: Large-scale Real-robot Evaluation of Embodied Policies

2510.17950

Authors

Tiezhen Wang,Yajun Wei,Youqiang Gui,Yunchao Ma,Bin Xie

Abstract

Testing on real machines is indispensable for robotic control algorithms. In the context of learning-based algorithms, especially VLA models, demand for large-scale evaluation, i.e.

testing a large number of models on a large number of tasks, is becoming increasingly urgent. However, doing this right is highly non-trivial, especially when scalability and reproducibility is taken into account.

In this report, we describe our methodology for constructing RoboChallenge, an online evaluation system to test robotic control algorithms, and our survey of recent state-of-the-art VLA models using our initial benchmark Table30.

Resources

Stay in the loop

Every AI paper that matters, free in your inbox daily.

Details

  • © 2026 takara.ai Ltd
  • Content is sourced from third-party publications.