ggmbr commited on
Commit
bbef05c
·
1 Parent(s): 4affcfc

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +17 -3
README.md CHANGED
@@ -8,8 +8,22 @@ language:
8
  - en
9
  base_model:
10
  - microsoft/wavlm-large
 
 
 
11
  ---
12
 
13
- This model has been pushed to the Hub using the [PytorchModelHubMixin](https://huggingface.co/docs/huggingface_hub/package_reference/mixins#huggingface_hub.PyTorchModelHubMixin) integration:
14
- - Library: ggmbr/wnt
15
- - Docs: [More Information Needed]
 
 
 
 
 
 
 
 
 
 
 
 
8
  - en
9
  base_model:
10
  - microsoft/wavlm-large
11
+ datasets:
12
+ - VCTK
13
+ - VoxCeleb
14
  ---
15
 
16
+ # Non-timbral Embeddings extractor
17
+ This model has been derived from the self-supervised pretrained model WavLM-large [lien]. It produces embeddings that represent the non-timbral traits (prosody, accent, ...) of a speaker,
18
+ which can be used the same way as for a classical ASV (automatic speaker verification) embeddings, except that only the non-timbral traits are compared.
19
+
20
+ See section below for an eplanation on how to use these embeddings.
21
+
22
+ # Citation
23
+ paper
24
+
25
+ # Usage
26
+ code
27
+
28
+ # Limitations
29
+ The fine tuning data used to produce this model (VoxCeleb, VCTK) are mostly in english, which may affect the performance on other languages.