Update README.md
Browse files
README.md
CHANGED
@@ -29,6 +29,18 @@ You cloud test this model with [Colab](https://colab.research.google.com/drive/1
|
|
29 |
|
30 |
I published mix-instruction alpaca-style dataset '[xiaodongguaAIGC/alpaca_en_zh_ruozhiba](https://huggingface.co/datasets/xiaodongguaAIGC/alpaca_en_zh_ruozhiba)'
|
31 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
32 |
# test
|
33 |
|
34 |
## generation like
|
|
|
29 |
|
30 |
I published mix-instruction alpaca-style dataset '[xiaodongguaAIGC/alpaca_en_zh_ruozhiba](https://huggingface.co/datasets/xiaodongguaAIGC/alpaca_en_zh_ruozhiba)'
|
31 |
|
32 |
+
# evaluation
|
33 |
+
|
34 |
+
Result:
|
35 |
+
|
36 |
+
| Model | MMLU | C-EVAL | C-MMLU |
|
37 |
+
| ------------------- | ----- | ------ | ------ |
|
38 |
+
| Llama-3-8B | 66.6 | 49.5 | 50.8 |
|
39 |
+
| Llama-3-8B-Instruct | 68.4 | 45.9 | / |
|
40 |
+
| Llama-3-8B-xdg | 56.71 | 42.83 | 45.04 |
|
41 |
+
|
42 |
+
- Llama-3-8B evaluation result from [Qwen2](https://huggingface.co/Qwen/Qwen2-7B-Instruct)
|
43 |
+
|
44 |
# test
|
45 |
|
46 |
## generation like
|