Pendrokar
/

xvapitch

speech-to-speech

voice conversion

Model card Files Files and versions Community

Pendrokar commited on May 13

Commit

24ca692

·

verified ·

1 Parent(s): be6c16d

videos

Files changed (1) hide show

README.md +10 -1

README.md CHANGED Viewed

@@ -39,7 +39,10 @@ tags:
 pipeline_tag: text-to-speech
 ---
-GitHub project: https://github.com/DanRuta/xVA-Synth
 The base model for training other [🤗 xVASynth's](https://huggingface.co/spaces/Pendrokar/xVASynth-TTS) "xVAPitch" type models (v3). Model itself is used by the xVATrainer TTS model training app and not for inference. All created by Dan ["@dr00392"](https://huggingface.co/dr00392) Ruta.
@@ -54,6 +57,12 @@ xVAPitch_5820651 model sample: <audio controls>
 There are hundreds of fine-tuned models on the web. But most of them use non-permissive datasets.
 Papers:
 - VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech - https://arxiv.org/abs/2106.06103
 - YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for Everyone - https://arxiv.org/abs/2112.02418

 pipeline_tag: text-to-speech
 ---
+GitHub project, inference Windows/Electron app: https://github.com/DanRuta/xVA-Synth
+Fine-tuning app: https://github.com/DanRuta/xva-trainer
 The base model for training other [🤗 xVASynth's](https://huggingface.co/spaces/Pendrokar/xVASynth-TTS) "xVAPitch" type models (v3). Model itself is used by the xVATrainer TTS model training app and not for inference. All created by Dan ["@dr00392"](https://huggingface.co/dr00392) Ruta.
 There are hundreds of fine-tuned models on the web. But most of them use non-permissive datasets.
+## xVASynth Editor v3 walkthrough video ▶:
+[![Video](https://img.youtube.com/vi/5u4xpI-cAd8/hqdefault.jpg)](https://www.youtube.com/watch?v=5u4xpI-cAd8)
+## xVATrainer v1 walkthrough video ▶:
+[![Video](https://img.youtube.com/vi/PXv_SeTWk2M/hqdefault.jpg)](https://www.youtube.com/watch?v=PXv_SeTWk2M)
 Papers:
 - VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech - https://arxiv.org/abs/2106.06103
 - YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for Everyone - https://arxiv.org/abs/2112.02418