HachiML
/

Llama-2-13b-hf-qlora-dolly-ja-2ep

Model card Files Files and versions Community

HachiML commited on Aug 8, 2023

Commit

7d2571d

·

1 Parent(s): 4500e0d

Update README.md

Files changed (1) hide show

README.md +9 -8

README.md CHANGED Viewed

@@ -7,17 +7,18 @@ language:
 - ja
 ---
 ## JGLUE Score
-We evaluated our model using the following JGLUE tasks. Here are the scores:
-| Task                | Score     |
-|---------------------|----------:|
-| JCOMMONSENSEQA(acc) | 75.78     |
-| JNLI(acc)           | 50.69     |
-| MARC_JA(acc)        | 79.64     |
-| JSQUAD(exact_match) | 62.83     |
-| **Average**         | **67.23** |
 - Note: Use v0.3 prompt template
 - The JGLUE scores were measured using the following script:
 [Stability-AI/lm-evaluation-harness](https://github.com/Stability-AI/lm-evaluation-harness/tree/jp-stable)
 ## How to use

 - ja
 ---
 ## JGLUE Score
+I evaluated this model using the following JGLUE tasks. Here are the scores:
+| Task                | Llama-2-7b-hf (*) | This Model |
+|---------------------|:-----------------:|:----------:|
+| JCOMMONSENSEQA(acc) | 51.56             | 75.78      |
+| JNLI(acc)           | 29.74             | 50.69      |
+| MARC_JA(acc)        | 85.72             | 79.64      |
+| JSQUAD(exact_match) | 64.16             | 62.83      |
+| **Average**         | **57.79**         | **67.23**  |
 - Note: Use v0.3 prompt template
 - The JGLUE scores were measured using the following script:
 [Stability-AI/lm-evaluation-harness](https://github.com/Stability-AI/lm-evaluation-harness/tree/jp-stable)
+- (*) Refer to the following article: [Google Colab での JP Language Model Evaluation Harness による日本語LLMの評価手順](https://note.com/npaka/n/nedf4dacd4037)
 ## How to use