Commit History
Separate requirements_dev.txt file
d15babf
David Pomerenke
commited on
Add dev dependencies to requirements.txt
6485aff
David Pomerenke
commited on
Add missing tqdm dependency
b7bd747
David Pomerenke
commited on
Fix Python version
35d713b
David Pomerenke
commited on
Simplify HF upload call
44748ce
David Pomerenke
commited on
Add GH Action for nightly evals
3dc9ba2
David Pomerenke
commited on
Use most popular current + historical models
9983b5f
David Pomerenke
commited on
Only run tasks for which there is no result yet
2f9dee1
David Pomerenke
commited on
Add commit message to HF push
019cada
David Pomerenke
commited on
Add GH action for pushing to HF
f9431d1
David Pomerenke
commited on
Add symbols for progress plot
68e918f
David Pomerenke
commited on
Display more language names
de40d0a
David Pomerenke
commited on
Run on 40 languages, additional models
260c1a3
David Pomerenke
commited on
Add scores to world map hover title
3680a5f
David Pomerenke
commited on
Change frontend text
f046407
David Pomerenke
commited on
Shorter classification prompt + error handling
0384b92
David Pomerenke
commited on
Run evals
b0c61ed
David Pomerenke
commited on
Move functions for sharing them
55406ba
David Pomerenke
commited on
Add Babel-670
7283eaa
David Pomerenke
commited on
Fix response when no evals data is available
c856043
David Pomerenke
commited on
Fix response when no evals data is available
32d50b0
David Pomerenke
commited on
Remove unnecessary function
a5cf2d9
David Pomerenke
commited on
Add WIP disclaimer
37ec45a
David Pomerenke
commited on
Fix: don't cache model metadata forever
c29b8da
David Pomerenke
commited on
Fix: sort copy, not in place
2eeba23
David Pomerenke
commited on
Change title and add blurb
58de179
David Pomerenke
commited on
test push - updated gitignore
c34b267
Run on 15 languages
f8a3dad
David Pomerenke
commited on
Improve plots and dataset table
a9e6b9b
David Pomerenke
commited on
Reorder datasets
603effe
David Pomerenke
commited on
Update models
8941a67
David Pomerenke
commited on
Add model history plot
f52ec6e
David Pomerenke
commited on
Add nice cumulative language population plot
b54f543
David Pomerenke
commited on
Implement MMLU task
a683732
David Pomerenke
commited on
MMLU data loader for 3 parallel datasets
47170a5
David Pomerenke
commited on
Add visual QA, reorder datasets
276ec94
David Pomerenke
commited on
Add dataset metadata about human/machine translation
d8f2dee
David Pomerenke
commited on
Analyze MMLU datasets
031925d
David Pomerenke
commited on
Refactor score columns
4106f13
David Pomerenke
commited on
Add Global MMLU benchmark
ce2acb0
David Pomerenke
commited on
Add rich dependency
9e3bc4f
David Pomerenke
commited on
Translation both from and to
731eddd
David Pomerenke
commited on
Add language lists for MMLU
60d1364
David Pomerenke
commited on
Get popular models from OpenRouter
a32a92f
David Pomerenke
commited on
Datasets: add OpenGPT-X icon and reorder
a0679b4
David Pomerenke
commited on
Add OpenRouter metadata to models
9002fc2
David Pomerenke
commited on
Run on 100 languages, adjust display
8274634
David Pomerenke
commited on
Dataset table grouping
9051509
David Pomerenke
commited on
Adjust font sizes
51cb38c
David Pomerenke
commited on