Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
fair-forward
/
evals-for-every-language
like
2
Running
App
Files
Files
Community
Fetching metadata from the HF Docker repository...
d15babf
evals-for-every-language
Commit History
Separate requirements_dev.txt file
d15babf
David Pomerenke
commited on
9 days ago
Add dev dependencies to requirements.txt
6485aff
David Pomerenke
commited on
9 days ago
Add missing tqdm dependency
b7bd747
David Pomerenke
commited on
9 days ago
Fix Python version
35d713b
David Pomerenke
commited on
9 days ago
Simplify HF upload call
44748ce
David Pomerenke
commited on
9 days ago
Add GH Action for nightly evals
3dc9ba2
David Pomerenke
commited on
9 days ago
Use most popular current + historical models
9983b5f
David Pomerenke
commited on
9 days ago
Only run tasks for which there is no result yet
2f9dee1
David Pomerenke
commited on
9 days ago
Add commit message to HF push
019cada
David Pomerenke
commited on
9 days ago
Add GH action for pushing to HF
f9431d1
David Pomerenke
commited on
9 days ago
Add symbols for progress plot
68e918f
David Pomerenke
commited on
16 days ago
Display more language names
de40d0a
David Pomerenke
commited on
16 days ago
Run on 40 languages, additional models
260c1a3
David Pomerenke
commited on
16 days ago
Add scores to world map hover title
3680a5f
David Pomerenke
commited on
16 days ago
Change frontend text
f046407
David Pomerenke
commited on
16 days ago
Shorter classification prompt + error handling
0384b92
David Pomerenke
commited on
16 days ago
Run evals
b0c61ed
David Pomerenke
commited on
16 days ago
Move functions for sharing them
55406ba
David Pomerenke
commited on
16 days ago
Add Babel-670
7283eaa
David Pomerenke
commited on
16 days ago
Fix response when no evals data is available
c856043
David Pomerenke
commited on
18 days ago
Fix response when no evals data is available
32d50b0
David Pomerenke
commited on
18 days ago
Remove unnecessary function
a5cf2d9
David Pomerenke
commited on
18 days ago
Add WIP disclaimer
37ec45a
David Pomerenke
commited on
18 days ago
Fix: don't cache model metadata forever
c29b8da
David Pomerenke
commited on
18 days ago
Fix: sort copy, not in place
2eeba23
David Pomerenke
commited on
18 days ago
Change title and add blurb
58de179
David Pomerenke
commited on
18 days ago
test push - updated gitignore
c34b267
jonas
commited on
20 days ago
Run on 15 languages
f8a3dad
David Pomerenke
commited on
25 days ago
Improve plots and dataset table
a9e6b9b
David Pomerenke
commited on
25 days ago
Reorder datasets
603effe
David Pomerenke
commited on
25 days ago
Update models
8941a67
David Pomerenke
commited on
25 days ago
Add model history plot
f52ec6e
David Pomerenke
commited on
25 days ago
Add nice cumulative language population plot
b54f543
David Pomerenke
commited on
25 days ago
Implement MMLU task
a683732
David Pomerenke
commited on
25 days ago
MMLU data loader for 3 parallel datasets
47170a5
David Pomerenke
commited on
25 days ago
Add visual QA, reorder datasets
276ec94
David Pomerenke
commited on
25 days ago
Add dataset metadata about human/machine translation
d8f2dee
David Pomerenke
commited on
25 days ago
Analyze MMLU datasets
031925d
David Pomerenke
commited on
26 days ago
Refactor score columns
4106f13
David Pomerenke
commited on
26 days ago
Add Global MMLU benchmark
ce2acb0
David Pomerenke
commited on
26 days ago
Add rich dependency
9e3bc4f
David Pomerenke
commited on
26 days ago
Translation both from and to
731eddd
David Pomerenke
commited on
about 1 month ago
Add language lists for MMLU
60d1364
David Pomerenke
commited on
about 1 month ago
Get popular models from OpenRouter
a32a92f
David Pomerenke
commited on
Apr 11
Datasets: add OpenGPT-X icon and reorder
a0679b4
David Pomerenke
commited on
Apr 11
Add OpenRouter metadata to models
9002fc2
David Pomerenke
commited on
Apr 11
Run on 100 languages, adjust display
8274634
David Pomerenke
commited on
Apr 6
Dataset table grouping
9051509
David Pomerenke
commited on
Apr 6
Adjust font sizes
51cb38c
David Pomerenke
commited on
Apr 6
Re-add dataset logos
003fe33
David Pomerenke
commited on
Apr 6
Previous
1
2
3
4
Next