Commit History

Only run tasks for which there is no result yet
2f9dee1

David Pomerenke commited on

Run on 40 languages, additional models
260c1a3

David Pomerenke commited on

Move functions for sharing them
55406ba

David Pomerenke commited on

Implement MMLU task
a683732

David Pomerenke commited on

MMLU data loader for 3 parallel datasets
47170a5

David Pomerenke commited on

Analyze MMLU datasets
031925d

David Pomerenke commited on

Refactor eval code into files
da6e1bc

David Pomerenke commited on