Commit History
Only run tasks for which there is no result yet
2f9dee1
David Pomerenke
commited on
Run on 40 languages, additional models
260c1a3
David Pomerenke
commited on
Move functions for sharing them
55406ba
David Pomerenke
commited on
Implement MMLU task
a683732
David Pomerenke
commited on
MMLU data loader for 3 parallel datasets
47170a5
David Pomerenke
commited on
Analyze MMLU datasets
031925d
David Pomerenke
commited on
Refactor eval code into files
da6e1bc
David Pomerenke
commited on