13.6k
Open LLM Leaderboard
🏆
Track, rank and evaluate open LLMs and chatbots
Track, rank and evaluate open LLMs and chatbots
Embedding Leaderboard
Display LMArena Leaderboard
Explore and analyze code completion benchmarks
VLMEvalKit Evaluation Results Collection
Explore visual document retrieval benchmark results
View and request speech recognition model benchmarks
Display Berkeley Function-Calling Leaderboard
Generate a leaderboard for evaluating language models