Ovis2.5 Collection Our next-generation MLLMs for native-resolution vision and advanced reasoning • 5 items • Updated Aug 19 • 57
Time Travel: A Comprehensive Benchmark to Evaluate LMMs on Historical and Cultural Artifacts Paper • 2502.14865 • Published Feb 20 • 1
Fann or Flop: A Multigenre, Multiera Benchmark for Arabic Poetry Understanding in LLMs Paper • 2505.18152 • Published May 23 • 1
ARB: A Comprehensive Arabic Multimodal Reasoning Benchmark Paper • 2505.17021 • Published May 22 • 1
VideoMathQA: Benchmarking Mathematical Reasoning via Multimodal Understanding in Videos Paper • 2506.05349 • Published Jun 5 • 24
ARB: A Comprehensive Arabic Multimodal Reasoning Benchmark Paper • 2505.17021 • Published May 22 • 1
Fann or Flop: A Multigenre, Multiera Benchmark for Arabic Poetry Understanding in LLMs Paper • 2505.18152 • Published May 23 • 1