Commit History

uv sync
7ba92ec

David Pomerenke commited on

Revert pyproject.toml
b5d0dc6

David Pomerenke commited on

Fix path and dev group declaration
1614427

David Pomerenke commited on

GH Action pipeline: Huggingface CLI login
decacfd

David Pomerenke commited on

GH Action pipeline: install dev dependencies
d694f87

David Pomerenke commited on

Fix paths
01ca19b

David Pomerenke commited on

Use uv in GH action pipeline
ddde856

David Pomerenke commited on

Fix import paths
c567aee

David Pomerenke commited on

Download data in pipeline
bed0f1b

David Pomerenke commited on

added download function and edited INFO
f529b7b

jonas commited on

Separate requirements_dev.txt file
d15babf

David Pomerenke commited on

Add dev dependencies to requirements.txt
6485aff

David Pomerenke commited on

Add missing tqdm dependency
b7bd747

David Pomerenke commited on

Fix Python version
35d713b

David Pomerenke commited on

Simplify HF upload call
44748ce

David Pomerenke commited on

Add GH Action for nightly evals
3dc9ba2

David Pomerenke commited on

Use most popular current + historical models
9983b5f

David Pomerenke commited on

Only run tasks for which there is no result yet
2f9dee1

David Pomerenke commited on

Add commit message to HF push
019cada

David Pomerenke commited on

Add GH action for pushing to HF
f9431d1

David Pomerenke commited on

Add symbols for progress plot
68e918f

David Pomerenke commited on

Display more language names
de40d0a

David Pomerenke commited on

Run on 40 languages, additional models
260c1a3

David Pomerenke commited on

Add scores to world map hover title
3680a5f

David Pomerenke commited on

Change frontend text
f046407

David Pomerenke commited on

Shorter classification prompt + error handling
0384b92

David Pomerenke commited on

Run evals
b0c61ed

David Pomerenke commited on

Move functions for sharing them
55406ba

David Pomerenke commited on

Add Babel-670
7283eaa

David Pomerenke commited on

Fix response when no evals data is available
c856043

David Pomerenke commited on

Fix response when no evals data is available
32d50b0

David Pomerenke commited on

Remove unnecessary function
a5cf2d9

David Pomerenke commited on

Add WIP disclaimer
37ec45a

David Pomerenke commited on

Fix: don't cache model metadata forever
c29b8da

David Pomerenke commited on

Fix: sort copy, not in place
2eeba23

David Pomerenke commited on

Change title and add blurb
58de179

David Pomerenke commited on

test push - updated gitignore
c34b267

jonas commited on

Run on 15 languages
f8a3dad

David Pomerenke commited on

Improve plots and dataset table
a9e6b9b

David Pomerenke commited on

Reorder datasets
603effe

David Pomerenke commited on

Update models
8941a67

David Pomerenke commited on

Add model history plot
f52ec6e

David Pomerenke commited on

Add nice cumulative language population plot
b54f543

David Pomerenke commited on

Implement MMLU task
a683732

David Pomerenke commited on

MMLU data loader for 3 parallel datasets
47170a5

David Pomerenke commited on

Add visual QA, reorder datasets
276ec94

David Pomerenke commited on

Add dataset metadata about human/machine translation
d8f2dee

David Pomerenke commited on

Analyze MMLU datasets
031925d

David Pomerenke commited on

Refactor score columns
4106f13

David Pomerenke commited on

Add Global MMLU benchmark
ce2acb0

David Pomerenke commited on