Commit History
Upload from nightly evaluation run
47bcf10
verified
Upload from GitHub Actions: Fix vibecoding
75010c2
verified
Upload from GitHub Actions: Ugly fix for CI errors
adc94d7
verified
Upload from GitHub Actions: Use Python 3.12 in all environments (uv, Docker, CI) and upgrade packages
a61d2b3
verified
Upload from GitHub Actions: Try moving `cache` calls that cause CI issues
bc4afa0
verified
Upload from GitHub Actions: Exclude free models from evals
c9e9db6
verified
Upload from GitHub Actions: Change HF repo URL
3409596
verified
Upload from nightly evaluation run
dcb356d
verified
Upload from nightly evaluation run
d2c1cb4
verified
Upload from GitHub Actions: Display N/A scores as such
1e8952a
verified
Upload from GitHub Actions: Copy data files to Docker image
6f68367
verified
Only upload relevant files
b258643
David Pomerenke
commited on
Use uv for pushing to HF
c144fd8
David Pomerenke
commited on
Update evaluation results [skip ci]
53c941c
github-actions[bot]
commited on
Try unsetting stuff
006f88d
David Pomerenke
commited on
Fix pushing
ce9de0c
David Pomerenke
commited on
Use GH_PAT
9851df9
David Pomerenke
commited on
Use GITHUB_TOKEN
6dd85c2
David Pomerenke
commited on
Block gemini-2.5-pro-exp-03-25
092c06a
David Pomerenke
commited on
Pass through kwargs
5fa433f
David Pomerenke
commited on
Fix dataset loading
c990cb9
David Pomerenke
commited on
Temporarily disable classification task
a48ff53
David Pomerenke
commited on
uv sync
7ba92ec
David Pomerenke
commited on
Revert pyproject.toml
b5d0dc6
David Pomerenke
commited on
Fix path and dev group declaration
1614427
David Pomerenke
commited on
GH Action pipeline: Huggingface CLI login
decacfd
David Pomerenke
commited on
GH Action pipeline: install dev dependencies
d694f87
David Pomerenke
commited on
Fix paths
01ca19b
David Pomerenke
commited on
Use uv in GH action pipeline
ddde856
David Pomerenke
commited on
Fix import paths
c567aee
David Pomerenke
commited on
Download data in pipeline
bed0f1b
David Pomerenke
commited on
added download function and edited INFO
f529b7b
Separate requirements_dev.txt file
d15babf
David Pomerenke
commited on
Add dev dependencies to requirements.txt
6485aff
David Pomerenke
commited on
Add missing tqdm dependency
b7bd747
David Pomerenke
commited on
Fix Python version
35d713b
David Pomerenke
commited on
Simplify HF upload call
44748ce
David Pomerenke
commited on
Add GH Action for nightly evals
3dc9ba2
David Pomerenke
commited on
Use most popular current + historical models
9983b5f
David Pomerenke
commited on
Only run tasks for which there is no result yet
2f9dee1
David Pomerenke
commited on
Add commit message to HF push
019cada
David Pomerenke
commited on
Add GH action for pushing to HF
f9431d1
David Pomerenke
commited on
Add symbols for progress plot
68e918f
David Pomerenke
commited on
Display more language names
de40d0a
David Pomerenke
commited on
Run on 40 languages, additional models
260c1a3
David Pomerenke
commited on
Add scores to world map hover title
3680a5f
David Pomerenke
commited on
Change frontend text
f046407
David Pomerenke
commited on
Shorter classification prompt + error handling
0384b92
David Pomerenke
commited on
Run evals
b0c61ed
David Pomerenke
commited on