MetaVQA
A Benchmark for Embodied Scene Understanding of Vision-Language Models
Please see MetaVQA official Website for details!
A Benchmark for Embodied Scene Understanding of Vision-Language Models
Please see MetaVQA official Website for details!