MM-EVAL - a Oliver2021 Collection

Oliver2021 's Collections

VLA

Agent

MLLM

LLM understanding

RAG

MM-EVAL

MMLM

MM-EVAL

updated Jun 11

MMRA: A Benchmark for Multi-granularity Multi-image Relational Association

Paper • 2407.17379 • Published Jul 24, 2024 • 3
MMSearch: Benchmarking the Potential of Large Models as Multi-modal Search Engines

Paper • 2409.12959 • Published Sep 19, 2024 • 37
MMMR: Benchmarking Massive Multi-Modal Reasoning Tasks

Paper • 2505.16459 • Published May 22 • 45
VideoReasonBench: Can MLLMs Perform Vision-Centric Complex Video Reasoning?

Paper • 2505.23359 • Published May 29 • 39
MORSE-500: A Programmatically Controllable Video Benchmark to Stress-Test Multimodal Reasoning

Paper • 2506.05523 • Published Jun 5 • 34