Commit
·
189873c
1
Parent(s):
3ff1322
Update README.md
Browse files
README.md
CHANGED
@@ -12,13 +12,17 @@ tags:
|
|
12 |
## Model Architecture
|
13 |
The Mistral-7B-v0.1 model is a transformer-based model that can handle a variety of tasks including but not limited to translation, summarization, and text completion. It's particularly designed for the Italian language and can be fine-tuned for specific tasks.
|
14 |
|
15 |
-
|
16 |
[Leaderboard Ita LLM](https://huggingface.co/spaces/FinancialSupport/open_ita_llm_leaderboard)
|
17 |
|
18 |
| hellaswag_it acc_norm | arc_it acc_norm | m_mmlu_it 5-shot acc | Average |
|
19 |
|:----------------------| :--------------- | :-------------------- | :------- |
|
20 |
| 0.6734 | 0.5466 | 0.5334 | 0,5844 |
|
21 |
|
|
|
|
|
|
|
|
|
22 |
## How to Use
|
23 |
How to utilize my Mistral for Italian text generation
|
24 |
|
|
|
12 |
## Model Architecture
|
13 |
The Mistral-7B-v0.1 model is a transformer-based model that can handle a variety of tasks including but not limited to translation, summarization, and text completion. It's particularly designed for the Italian language and can be fine-tuned for specific tasks.
|
14 |
|
15 |
+
## Evaluation
|
16 |
[Leaderboard Ita LLM](https://huggingface.co/spaces/FinancialSupport/open_ita_llm_leaderboard)
|
17 |
|
18 |
| hellaswag_it acc_norm | arc_it acc_norm | m_mmlu_it 5-shot acc | Average |
|
19 |
|:----------------------| :--------------- | :-------------------- | :------- |
|
20 |
| 0.6734 | 0.5466 | 0.5334 | 0,5844 |
|
21 |
|
22 |
+
## Quantized Version
|
23 |
+
|
24 |
+
[4_bit_version](https://huggingface.co/DeepMount00/Mistral-Ita-7b-GGUF)
|
25 |
+
|
26 |
## How to Use
|
27 |
How to utilize my Mistral for Italian text generation
|
28 |
|