直播数字人的GPT-sovits服务
You cannot select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
 
 
 
 
 
 
RVC-Boss 377ceafb7c
Merge pull request #1 from ricecakey06/main
Add requirements.txt
2 years ago
GPT_SoVITS Create init 2 years ago
i18n Add files via upload 2 years ago
tools Add files via upload 2 years ago
LICENSE Initial commit 2 years ago
README.md Update README.md 2 years ago
config.py Add files via upload 2 years ago
go-webui.bat Add files via upload 2 years ago
requirements.txt Update requirements.txt 2 years ago
webui.py Add files via upload 2 years ago

README.md

I am organizing and uploading the codes. It will be public in one day.

demo video and features

https://www.bilibili.com/video/BV12g4y1m7Uw/

todo

todolist

todo

Requirments (How to install)

python and pytorch version

py39+pytorch2.0.1+cu11 passed the test.

pip packages

pip install torch numpy scipy tensorboard librosa==0.9.2 numba==0.56.4 pytorch-lightning gradio==3.14.0 ffmpeg-python onnxruntime tqdm==4.59.0 cn2an pypinyin pyopenjtalk g2p_en

additionally

If you need the Chinese ASR feature supported by funasr, you should

pip install modelscope sentencepiece funasr

You need ffmpeg.

Ubuntu/Debian users

sudo apt install ffmpeg

MacOS users

brew install ffmpeg

Windows users

download and put them in the GPT-SoVITS root.

You need download some pretrained models

pretrained GPT-SoVITS models/SSL feature model/Chinese BERT model

put these files

https://huggingface.co/lj1995/GPT-SoVITS

to

GPT_SoVITS\pretrained_models

Chinese ASR (Additionally)

put these files

https://modelscope.cn/models/damo/speech_paraformer-large_asr_nat-zh-cn-16k-common-vocab8404-pytorch/files

https://modelscope.cn/models/damo/speech_fsmn_vad_zh-cn-16k-common-pytorch/files

https://modelscope.cn/models/damo/punc_ct-transformer_zh-cn-common-vocab272727-pytorch/files

to

tools/damo_asr/models

image

UVR5 (Vocals/Accompaniment Separation & Reverberation Removal. Additionally)

put the models you need from

https://huggingface.co/lj1995/VoiceConversionWebUI/tree/main/uvr5_weights

to

tools/uvr5/uvr5_weights

Credits

todo