OmniAICreator
/

Galgame-Llasa-1B-v3

Model card Files Files and versions

OmniAICreator commited on 25 days ago

Commit

e3f797a

·

verified ·

1 Parent(s): e0394c9

Create README.md

Files changed (1) hide show

README.md +39 -0

README.md ADDED Viewed

	@@ -0,0 +1,39 @@

+---
+license: cc-by-nc-4.0
+datasets:
+- OOPPEENN/56697375616C4E6F76656C5F44617461736574
+- amphion/Emilia-Dataset
+- litagin/ehehe-corpus
+- joujiboi/japanese-anime-speech
+language:
+- ja
+base_model:
+- HKUSTAudio/Llasa-1B-Multilingual
+pipeline_tag: text-to-speech
+---
+# Galgame-Llasa-1B-v3
+## Overview
+This is the version 3 of the Galgame-Llasa-1B, a Text-to-Speech (TTS) model fine-tuned for Japanese. This model is based on [HKUSTAudio/Llasa-1B-Multilingual](https://huggingface.co/HKUSTAudio/Llasa-1B-Multilingual).
+## What's New in v3?
+The primary improvement in v3 is the **modification of the text normalization process** during training.
+This update leads to more consistent and accurate speech synthesis, further improving upon the advances made in v2.
+## What's New in v2 (from v1)?
+Version 2 was trained on a larger and more diverse dataset, including the original Galgame dataset and other sources.
+As a result, v2 offered several key improvements over the original version:
+- **Improved Kanji Reading:** The model handled the reading of Kanji characters more accurately.
+- **Enhanced Prosody:** The generated speech had more natural intonation and expressiveness.
+- **Greater Voice Diversity:** The model could produce a wider range of voice styles than the previous version.
+## License
+This model is licensed under the **CC-BY-NC-4.0**.