Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
fair-forward
/
evals-for-every-language
like
2
Running
App
Files
Files
Community
Fetching metadata from the HF Docker repository...
f046407
evals-for-every-language
/
evals
/
tasks.py
Commit History
Shorter classification prompt + error handling
0384b92
David Pomerenke
commited on
17 days ago
Implement MMLU task
a683732
David Pomerenke
commited on
27 days ago
MMLU data loader for 3 parallel datasets
47170a5
David Pomerenke
commited on
27 days ago
Add Global MMLU benchmark
ce2acb0
David Pomerenke
commited on
27 days ago
Translation both from and to
731eddd
David Pomerenke
commited on
Apr 13
Run on 100 languages, adjust display
8274634
David Pomerenke
commited on
Apr 6
spBLEU tokenizer, run on more languages
eaf2d97
David Pomerenke
commited on
Mar 25
Refactor eval code into files
da6e1bc
David Pomerenke
commited on
Mar 15