Pendrokar commited on
Commit
24ca692
·
verified ·
1 Parent(s): be6c16d
Files changed (1) hide show
  1. README.md +10 -1
README.md CHANGED
@@ -39,7 +39,10 @@ tags:
39
  pipeline_tag: text-to-speech
40
  ---
41
 
42
- GitHub project: https://github.com/DanRuta/xVA-Synth
 
 
 
43
 
44
  The base model for training other [🤗 xVASynth's](https://huggingface.co/spaces/Pendrokar/xVASynth-TTS) "xVAPitch" type models (v3). Model itself is used by the xVATrainer TTS model training app and not for inference. All created by Dan ["@dr00392"](https://huggingface.co/dr00392) Ruta.
45
 
@@ -54,6 +57,12 @@ xVAPitch_5820651 model sample: <audio controls>
54
 
55
  There are hundreds of fine-tuned models on the web. But most of them use non-permissive datasets.
56
 
 
 
 
 
 
 
57
  Papers:
58
  - VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech - https://arxiv.org/abs/2106.06103
59
  - YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for Everyone - https://arxiv.org/abs/2112.02418
 
39
  pipeline_tag: text-to-speech
40
  ---
41
 
42
+ GitHub project, inference Windows/Electron app: https://github.com/DanRuta/xVA-Synth
43
+
44
+ Fine-tuning app: https://github.com/DanRuta/xva-trainer
45
+
46
 
47
  The base model for training other [🤗 xVASynth's](https://huggingface.co/spaces/Pendrokar/xVASynth-TTS) "xVAPitch" type models (v3). Model itself is used by the xVATrainer TTS model training app and not for inference. All created by Dan ["@dr00392"](https://huggingface.co/dr00392) Ruta.
48
 
 
57
 
58
  There are hundreds of fine-tuned models on the web. But most of them use non-permissive datasets.
59
 
60
+ ## xVASynth Editor v3 walkthrough video ▶:
61
+ [![Video](https://img.youtube.com/vi/5u4xpI-cAd8/hqdefault.jpg)](https://www.youtube.com/watch?v=5u4xpI-cAd8)
62
+
63
+ ## xVATrainer v1 walkthrough video ▶:
64
+ [![Video](https://img.youtube.com/vi/PXv_SeTWk2M/hqdefault.jpg)](https://www.youtube.com/watch?v=PXv_SeTWk2M)
65
+
66
  Papers:
67
  - VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech - https://arxiv.org/abs/2106.06103
68
  - YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for Everyone - https://arxiv.org/abs/2112.02418