Spaces:

fair-forward
/

evals-for-every-language

Running

David Pomerenke commited on Mar 9

Commit

be99aac

1 Parent(s): 4973af4

Update README

Files changed (1) hide show

README.md CHANGED Viewed

@@ -8,12 +8,14 @@ license: cc-by-sa-4.0
 short_description: Evaluating LLM performance across all human languages.
 datasets:
 - openlanguagedata/flores_plus
 - mozilla-foundation/common_voice_1_0
 models:
 - meta-llama/Llama-3.3-70B-Instruct
 - mistralai/Mistral-Small-24B-Instruct-2501
 - deepseek-ai/DeepSeek-V3
 - microsoft/phi-4
 tags:
 - leaderboard
 - submission:manual
@@ -36,11 +38,3 @@ For tag meaning, see https://huggingface.co/spaces/leaderboards/LeaderboardsExpl
 # AI Language Monitor 🌍
 Benchmarking all big AI models on all benchmarkable languages.
-Sources:
-1. For AI models: [OpenRouter](https://openrouter.ai/)
-2. For language benchmarks: [FLORES+](https://github.com/openlanguagedata/flores)
-3. For language statistics: [Wikidata](https://gist.github.com/unhammer/3e8f2e0f79972bf5008a4c970081502d), [Ethnologue](https://www.ethnologue.com/browse/names/)
-[UI sketch](https://www.tldraw.com/ro/5YkWi9dfBixOkQ4FV23zA?d=v192.-1.2090.1569.page)

 short_description: Evaluating LLM performance across all human languages.
 datasets:
 - openlanguagedata/flores_plus
+- google/fleurs
 - mozilla-foundation/common_voice_1_0
 models:
 - meta-llama/Llama-3.3-70B-Instruct
 - mistralai/Mistral-Small-24B-Instruct-2501
 - deepseek-ai/DeepSeek-V3
 - microsoft/phi-4
+- openai/whisper-large-v3
 tags:
 - leaderboard
 - submission:manual
 # AI Language Monitor 🌍
 Benchmarking all big AI models on all benchmarkable languages.