SolidGeo is the first large-scale benchmark specifically designed to evaluate the performance of MLLMs on mathematical reasoning tasks in solid geometry. SolidGeo consists of 3,113 real-world K–12 and ...
Unele rezultate au fost ascunse, deoarece pot fi inaccesibile pentru dvs.
Afișați rezultatele inaccesibile