lianghsun
/

Llama-3.2-Taiwan-3B-Instruct

Text Generation

text-generation-inference

Model card Files Files and versions

lianghsun commited on Jan 1

Commit

415f0b2

·

2 Parent(s): 44cd75b 3a5bb99

Fix remote differ branch

Files changed (1) hide show

README.md +1 -0

README.md CHANGED Viewed

@@ -997,6 +997,7 @@ metrics:
   | Update Date  | Model Version         | Key Changes                         |
   |--------------|-----------------------|-------------------------------------|
   | 2025/01/01   | v2025.01.01           | Fine-tuning is based on the [foundation model](https://huggingface.co/lianghsun/Llama-3.2-Taiwan-3B) version v2024.12.28, and it uses self-prepared instruction datasets for this round of fine-tuning. |
   | 2024/11/27   | v2024.11.27           | Completed SFT training (5/5 epochs). Preparing for multi-round DPO training. |
   | 2024/11/25   | v2024.11.25           | Updated model version to v2024.11.25, training progressed to (3/5) epochs. Still in SFT stage, DPO training remains pending. |
   | 2024/11/22   | v2024.11.22           | Initial upload: Model version v2024.11.22, training completed up to (1/5) epochs. Currently trained only on SFT, DPO training not yet performed. |

   | Update Date  | Model Version         | Key Changes                         |
   |--------------|-----------------------|-------------------------------------|
   | 2025/01/01   | v2025.01.01           | Fine-tuning is based on the [foundation model](https://huggingface.co/lianghsun/Llama-3.2-Taiwan-3B) version v2024.12.28, and it uses self-prepared instruction datasets for this round of fine-tuning. |
+  | 2024/12/13   | v2024.12.13           | Completed 1st round DPO training (10/10 epochs). Preparing for next round DPO training. |
   | 2024/11/27   | v2024.11.27           | Completed SFT training (5/5 epochs). Preparing for multi-round DPO training. |
   | 2024/11/25   | v2024.11.25           | Updated model version to v2024.11.25, training progressed to (3/5) epochs. Still in SFT stage, DPO training remains pending. |
   | 2024/11/22   | v2024.11.22           | Initial upload: Model version v2024.11.22, training completed up to (1/5) epochs. Currently trained only on SFT, DPO training not yet performed. |