open-r1-eval-leaderboard / eval_results

Commit History

Upload eval_results/Qwen/Qwen2-1.5B-Instruct/main/mixeval/results_2024-08-26T12-08-12.json with huggingface_hub
c6428d5
verified

lewtun HF Staff commited on

Upload eval_results/Qwen/Qwen2-1.5B-Instruct/main/mixeval_hard/results_2024-08-26T12-03-21.json with huggingface_hub
ae6297b
verified

lewtun HF Staff commited on

Upload eval_results/Qwen/Qwen2-0.5B-Instruct/main/mixeval/results_2024-08-26T12-03-10.json with huggingface_hub
fa8b181
verified

lewtun HF Staff commited on

Upload eval_results/Qwen/Qwen2-0.5B-Instruct/main/mixeval_hard/results_2024-08-26T12-03-05.json with huggingface_hub
787f8cc
verified

lewtun HF Staff commited on

Upload eval_results/lewtun/qwen2-0.5B-lr-3e-6/main/mixeval/results_2024-08-26T12-02-09.json with huggingface_hub
c29763a
verified

lewtun HF Staff commited on

Upload eval_results/lewtun/qwen2-0.5B-lr-3e-6/main/mixeval_hard/results_2024-08-26T11-57-26.json with huggingface_hub
de3bca4
verified

lewtun HF Staff commited on

Upload eval_results/lewtun/qwen2-1.5B-lr-3e-6/main/mixeval/results_2024-08-26T11-57-10.json with huggingface_hub
fc852a7
verified

lewtun HF Staff commited on

Upload eval_results/lewtun/qwen2-1.5B-lr-3e-6/main/mixeval_hard/results_2024-08-26T11-51-39.json with huggingface_hub
0d9b31d
verified

lewtun HF Staff commited on

Upload eval_results/mistralai/Mistral-7B-Instruct-v0.3/main/mixeval/results_2024-08-26T09-35-18.json with huggingface_hub
1f73ae0
verified

lewtun HF Staff commited on

Upload eval_results/lewtun/qwen2-0.5B-lr-3e-6/main/mixeval/results_2024-08-26T09-33-44.json with huggingface_hub
6311c27
verified

lewtun HF Staff commited on

Upload eval_results/Qwen/Qwen2-1.5B-Instruct/main/alpaca_eval/results_2024-08-25T19-01-46.json with huggingface_hub
0f37962
verified

lewtun HF Staff commited on

Upload eval_results/lewtun/qwen2-1.5B-lr-3e-6/main/ifeval/results_2024-08-25T18-59-32.625679.json with huggingface_hub
918fc27
verified

lewtun HF Staff commited on

Upload eval_results/Qwen/Qwen2-0.5B-Instruct/main/alpaca_eval/results_2024-08-25T18-43-56.json with huggingface_hub
9419480
verified

lewtun HF Staff commited on

Upload eval_results/lewtun/qwen2-1.5B-lr-3e-6/main/alpaca_eval/results_2024-08-25T18-42-56.json with huggingface_hub
276b2a5
verified

lewtun HF Staff commited on

Upload eval_results/lewtun/qwen2-0.5B-lr-3e-6/main/alpaca_eval/results_2024-08-25T18-27-08.json with huggingface_hub
fdba288
verified

lewtun HF Staff commited on

Upload eval_results/lewtun/qwen2-0.5B-lr-3e-6/main/ifeval/results_2024-08-25T09-49-23.074510.json with huggingface_hub
82ea927
verified

lewtun HF Staff commited on

Upload eval_results/lewtun/qwen2-1.5B-lr-3e-6/175f7047474e2d7d1bc3b935ffdf8b7548784110/ifeval/results_2024-08-24T20-29-18.026712.json with huggingface_hub
e1bcf61
verified

lewtun HF Staff commited on

Upload eval_results/lewtun/qwen2-0.5B-lr-3e-6/72e44bab27d4dc1a6573f534de36fa4bb914ba00/ifeval/results_2024-08-24T20-28-36.811969.json with huggingface_hub
659bf1f
verified

lewtun HF Staff commited on

Upload eval_results/lewtun/qwen2-1.5B-lr-3e-6/c4a64a56b026810bf8e55db0ca9f27b393fdc99a/ifeval/results_2024-08-24T10-05-38.205425.json with huggingface_hub
7b8278d
verified

lewtun HF Staff commited on

Upload eval_results/lewtun/qwen2-0.5B-lr-3e-6/145d72f20bac0ee1fc46e8531b0db8e91c69cfec/ifeval/results_2024-08-24T09-51-30.540181.json with huggingface_hub
c152bf4
verified

lewtun HF Staff commited on

Upload eval_results/lewtun/qwen2-0.5B-lr-3e-6/145d72f20bac0ee1fc46e8531b0db8e91c69cfec/ifeval/results_2024-08-24T09-51-33.331680.json with huggingface_hub
31bd918
verified

lewtun HF Staff commited on

Upload eval_results/lewtun/qwen2-0.5B-lr-3e-6/46351da80b089312fb70cca21b24589a52853bbf/ifeval/results_2024-08-24T09-51-24.253482.json with huggingface_hub
96008bc
verified

lewtun HF Staff commited on

Upload eval_results/lewtun/qwen2-0.5B-lr-3e-6/main/ifeval/results_2024-08-24T09-51-15.396849.json with huggingface_hub
06e5e6a
verified

lewtun HF Staff commited on

Upload eval_results/HuggingFaceTB/SmolLM-360M-Instruct/main/ifeval/results_2024-08-22T20-36-28.226057.json with huggingface_hub
7038b16
verified

lewtun HF Staff commited on

Upload eval_results/HuggingFaceTB/SmolLM-135M-Instruct/main/ifeval/results_2024-08-22T20-33-35.886928.json with huggingface_hub
94fe8bd
verified

lewtun HF Staff commited on

Upload eval_results/HuggingFaceTB/SmolLM-1.7B-Instruct/main/ifeval/results_2024-08-22T20-33-11.064056.json with huggingface_hub
a4a6e36
verified

lewtun HF Staff commited on

Upload eval_results/Qwen/Qwen2-7B-Instruct/main/ifeval/results_2024-08-22T20-23-16.600685.json with huggingface_hub
a53913f
verified

lewtun HF Staff commited on

Upload eval_results/Qwen/Qwen2-1.5B-Instruct/main/ifeval/results_2024-08-22T19-32-51.212658.json with huggingface_hub
a689427
verified

lewtun HF Staff commited on

Upload eval_results/Qwen/Qwen2-0.5B-Instruct/main/ifeval/results_2024-08-22T19-11-14.012305.json with huggingface_hub
55b99b4
verified

lewtun HF Staff commited on

Upload eval_results/AI-MO/NuminaMath-7B-TIR/main/amc_2023/results_2024-07-18T08-08-42.916465.json with huggingface_hub
f13f526
verified

edbeeching HF Staff commited on

Upload eval_results/AI-MO/qwen2-72b-sft/aimo_v02.01/math_v3/results_2024-07-17T21-19-51.995991.json with huggingface_hub
7d0e882
verified

lewtun HF Staff commited on

Upload eval_results/AI-MO/NuminaMath-70B-TIR-release-candidate-1/main/aime_2024/results_2024-07-17T21-14-23.207355.json with huggingface_hub
645fb7b
verified

edbeeching HF Staff commited on

Upload eval_results/AI-MO/NuminaMath-70B-TIR-release-candidate-1/main/aime_2024/results_2024-07-17T21-12-39.631679.json with huggingface_hub
2f28154
verified

edbeeching HF Staff commited on

Upload eval_results/AI-MO/NuminaMath-70B-TIR-release-candidate-1/main/aime_2024/results_2024-07-17T21-12-20.880527.json with huggingface_hub
108bb94
verified

edbeeching HF Staff commited on

Upload eval_results/AI-MO/NuminaMath-70B-TIR-release-candidate-1/main/aime_2024/results_2024-07-17T21-12-16.953012.json with huggingface_hub
783550a
verified

edbeeching HF Staff commited on

Upload eval_results/AI-MO/NuminaMath-70B-TIR-release-candidate-1/main/aime_2024/results_2024-07-17T21-11-33.396613.json with huggingface_hub
5aae388
verified

edbeeching HF Staff commited on

Upload eval_results/AI-MO/qwen2-72b-sft/aimo_v02.01-step-20379/math_v3/results_2024-07-17T21-01-43.760071.json with huggingface_hub
9deb8b3
verified

lewtun HF Staff commited on

Upload eval_results/AI-MO/NuminaMath-7B-TIR-release-candidate-2/main/aime_2024/results_2024-07-17T20-51-30.764608.json with huggingface_hub
9d15677
verified

edbeeching HF Staff commited on

Upload eval_results/AI-MO/NuminaMath-7B-TIR/main/amc_2023/results_2024-07-17T20-47-09.745938.json with huggingface_hub
8646abf
verified

edbeeching HF Staff commited on

Upload eval_results/AI-MO/NuminaMath-7B-TIR/main/aime_2024/results_2024-07-17T20-45-20.008010.json with huggingface_hub
5d142bc
verified

edbeeching HF Staff commited on

Upload eval_results/AI-MO/NuminaMath-7B-TIR/main/amc_2023/results_2024-07-17T20-43-55.375512.json with huggingface_hub
516e143
verified

edbeeching HF Staff commited on

Upload eval_results/AI-MO/NuminaMath-7B-TIR/main/amc_2023/results_2024-07-17T20-43-53.094982.json with huggingface_hub
54df3b9
verified

edbeeching HF Staff commited on

Upload eval_results/AI-MO/NuminaMath-7B-TIR/main/amc_2023/results_2024-07-17T20-43-40.529273.json with huggingface_hub
10e0c43
verified

edbeeching HF Staff commited on

Upload eval_results/AI-MO/NuminaMath-7B-TIR/main/amc_2023/results_2024-07-17T20-43-18.013572.json with huggingface_hub
d8d5f8e
verified

edbeeching HF Staff commited on

Upload eval_results/AI-MO/NuminaMath-7B-TIR/main/aime_2024/results_2024-07-17T20-42-47.641308.json with huggingface_hub
285a9fe
verified

edbeeching HF Staff commited on

Upload eval_results/AI-MO/NuminaMath-7B-TIR/main/aime_2024/results_2024-07-17T20-42-37.945722.json with huggingface_hub
65b6663
verified

edbeeching HF Staff commited on

Upload eval_results/AI-MO/NuminaMath-7B-TIR/main/aime_2024/results_2024-07-17T20-42-32.896430.json with huggingface_hub
e6fffcd
verified

edbeeching HF Staff commited on

Upload eval_results/AI-MO/NuminaMath-7B-TIR/main/aime_2024/results_2024-07-17T20-42-10.456496.json with huggingface_hub
c70f6d7
verified

edbeeching HF Staff commited on

Upload eval_results/AI-MO/NuminaMath-7B-TIR/main/amc_2023/results_2024-07-17T20-34-38.796280.json with huggingface_hub
5d06390
verified

edbeeching HF Staff commited on

Upload eval_results/AI-MO/NuminaMath-7B-TIR/main/amc_2023/results_2024-07-17T20-34-15.074966.json with huggingface_hub
60f797e
verified

edbeeching HF Staff commited on