@ -4,11 +4,31 @@ I am organizing and uploading the codes. It will be public in one day.
https://www.bilibili.com/video/BV12g4y1m7Uw/
https://www.bilibili.com/video/BV12g4y1m7Uw/
todo
features:
1、input 5s vocal, zero shot TTS
2、1min training dataset, fine tune (few shot TTS. The TTS model trained using few-shot techniques exhibits significantly better similarity and realism in the speaker's voice compared to zero-shot.)
3、Cross lingual (inference another language that is different from the training dataset language), now support English, Japanese and Chinese
4、This WebUI integrates tools such as voice accompaniment separation, automatic segmentation of training sets, Chinese ASR, text labeling, etc., to help beginners quickly create their own training datasets and GPT/SoVITS models.