Skip to content

mikeyang01/Digital_Life_Server_custom

 
 

Repository files navigation

Digital Life Server

这是「数字生命」服务部分代码。包括与前端通信,语音识别,chatGPT接入和语音合成。
For other part of the project, please refer to:
Launcher 启动此服务器的图形界面。
UE Client 用于渲染人物动画,录音,和播放声音的前端部分。
详细的配置流程可参见readme_detail.md

Getting stuffs ready to roll:

Clone this repo

git clone https://github.com/zixiiu/Digital_Life_Server.git --recursive

Install prerequisites

  1. install pytorch

    pip install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cu118
  2. install other requirements

    pip install -r requirements.txt
  3. Build monotonic_align
    This may not work that well but you know what that suppose to mean.

    cd "TTS/vits/monotonic_align"
    mkdir monotonic_align
    python setup.py build_ext --inplace

    Windows系统的话, 执行

    cp monotonic_align/*.pyd .
    

    Linux系统的话, 执行

    cp monotonic_align/*.so .
    
  4. Download models
    百度网盘
    HuggingFace

    ASR Model:
    to /ASR/resources/models
    Sentiment Model:
    to /SentimentEngine/models
    TTS Model:
    to /TTS/models

  5. (对于没有Nvidia显卡的电脑,采用cpu来跑的话)需要额外做一步:

    ​ 将 Digital_Life_Server\TTS\TTService.py 文件下 36行

    self.net_g = SynthesizerTrn(...).cuda()
    修改为
    self.net_g = SynthesizerTrn(...).cpu()
    

    到这里,项目构建完毕🥰

Start the server

run-gpt3.5-api.bat

About

Yet another voice assistant, but alive.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 99.7%
  • Batchfile 0.3%