Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
fair-forward
/
evals-for-every-language
like
2
Running
App
Files
Files
Community
Fetching metadata from the HF Docker repository...
7ba92ec
evals-for-every-language
Commit History
uv sync
7ba92ec
David Pomerenke
commited on
8 days ago
Revert pyproject.toml
b5d0dc6
David Pomerenke
commited on
8 days ago
Fix path and dev group declaration
1614427
David Pomerenke
commited on
8 days ago
GH Action pipeline: Huggingface CLI login
decacfd
David Pomerenke
commited on
8 days ago
GH Action pipeline: install dev dependencies
d694f87
David Pomerenke
commited on
8 days ago
Fix paths
01ca19b
David Pomerenke
commited on
9 days ago
Use uv in GH action pipeline
ddde856
David Pomerenke
commited on
9 days ago
Fix import paths
c567aee
David Pomerenke
commited on
9 days ago
Download data in pipeline
bed0f1b
David Pomerenke
commited on
9 days ago
added download function and edited INFO
f529b7b
jonas
commited on
19 days ago
Separate requirements_dev.txt file
d15babf
David Pomerenke
commited on
9 days ago
Add dev dependencies to requirements.txt
6485aff
David Pomerenke
commited on
9 days ago
Add missing tqdm dependency
b7bd747
David Pomerenke
commited on
9 days ago
Fix Python version
35d713b
David Pomerenke
commited on
9 days ago
Simplify HF upload call
44748ce
David Pomerenke
commited on
9 days ago
Add GH Action for nightly evals
3dc9ba2
David Pomerenke
commited on
9 days ago
Use most popular current + historical models
9983b5f
David Pomerenke
commited on
9 days ago
Only run tasks for which there is no result yet
2f9dee1
David Pomerenke
commited on
9 days ago
Add commit message to HF push
019cada
David Pomerenke
commited on
9 days ago
Add GH action for pushing to HF
f9431d1
David Pomerenke
commited on
9 days ago
Add symbols for progress plot
68e918f
David Pomerenke
commited on
16 days ago
Display more language names
de40d0a
David Pomerenke
commited on
16 days ago
Run on 40 languages, additional models
260c1a3
David Pomerenke
commited on
16 days ago
Add scores to world map hover title
3680a5f
David Pomerenke
commited on
16 days ago
Change frontend text
f046407
David Pomerenke
commited on
16 days ago
Shorter classification prompt + error handling
0384b92
David Pomerenke
commited on
16 days ago
Run evals
b0c61ed
David Pomerenke
commited on
16 days ago
Move functions for sharing them
55406ba
David Pomerenke
commited on
16 days ago
Add Babel-670
7283eaa
David Pomerenke
commited on
16 days ago
Fix response when no evals data is available
c856043
David Pomerenke
commited on
18 days ago
Fix response when no evals data is available
32d50b0
David Pomerenke
commited on
18 days ago
Remove unnecessary function
a5cf2d9
David Pomerenke
commited on
18 days ago
Add WIP disclaimer
37ec45a
David Pomerenke
commited on
18 days ago
Fix: don't cache model metadata forever
c29b8da
David Pomerenke
commited on
18 days ago
Fix: sort copy, not in place
2eeba23
David Pomerenke
commited on
18 days ago
Change title and add blurb
58de179
David Pomerenke
commited on
18 days ago
test push - updated gitignore
c34b267
jonas
commited on
21 days ago
Run on 15 languages
f8a3dad
David Pomerenke
commited on
25 days ago
Improve plots and dataset table
a9e6b9b
David Pomerenke
commited on
25 days ago
Reorder datasets
603effe
David Pomerenke
commited on
25 days ago
Update models
8941a67
David Pomerenke
commited on
25 days ago
Add model history plot
f52ec6e
David Pomerenke
commited on
25 days ago
Add nice cumulative language population plot
b54f543
David Pomerenke
commited on
25 days ago
Implement MMLU task
a683732
David Pomerenke
commited on
25 days ago
MMLU data loader for 3 parallel datasets
47170a5
David Pomerenke
commited on
25 days ago
Add visual QA, reorder datasets
276ec94
David Pomerenke
commited on
25 days ago
Add dataset metadata about human/machine translation
d8f2dee
David Pomerenke
commited on
25 days ago
Analyze MMLU datasets
031925d
David Pomerenke
commited on
26 days ago
Refactor score columns
4106f13
David Pomerenke
commited on
26 days ago
Add Global MMLU benchmark
ce2acb0
David Pomerenke
commited on
26 days ago
Previous
1
2
3
4
Next