Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
fair-forward
/
evals-for-every-language
like
2
Running
App
Files
Files
Community
Fetching metadata from the HF Docker repository...
main
evals-for-every-language
/
evals
Ctrl+K
Ctrl+K
5 contributors
History:
69 commits
davidpomerenke
Upload from GitHub Actions: Exclude TruthfulQA from proficiency score
3fbff09
verified
3 days ago
datasets_
Upload from GitHub Actions: TruthfulQA translation WIP
3 days ago
__init__.py
Safe
1 Bytes
Refactor eval code into files
4 months ago
backend.py
Safe
5.11 kB
Upload from GitHub Actions: Exclude TruthfulQA from proficiency score
3 days ago
countries.py
Safe
1.42 kB
Add Dockerfile
3 months ago
download_data.py
Safe
8.44 kB
Upload from GitHub Actions: Use FLORES+ via Huggingface
about 2 months ago
languages.py
Safe
2.08 kB
Upload from GitHub Actions: More results
about 2 months ago
main.py
Safe
2.5 kB
Upload from GitHub Actions: Get more results, compute average based on all tasks
5 days ago
models.py
Safe
9.5 kB
Upload from GitHub Actions: Get more results, compute average based on all tasks
5 days ago
plots.py
Safe
4.8 kB
Upload from GitHub Actions: TruthfulQA translation WIP
3 days ago
tasks.py
Safe
14.4 kB
Upload from GitHub Actions: Get more results, compute average based on all tasks
5 days ago
translate.py
272 Bytes
Upload from GitHub Actions: Translate MMLU and evaluate
7 days ago