Coder Social home page Coder Social logo

dsi's Introduction

dsi's People

Contributors

jack-cherish avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar

dsi's Issues

训练集、验证集区分问题

up主你好,你的工作对我很有帮助。在finetune_webui.py代码中,您似乎使得训练集与验证集相同,请问这种方法是否合适?是否在训练中会出现过拟合的情况?非常感谢
final_annos = cleaned_new_annos
# save annotation file
with open("final_annotation_train.txt", 'w', encoding='utf-8') as f:
for line in final_annos:
f.write(line)
# save annotation file for validation
with open("final_annotation_val.txt", 'w', encoding='utf-8') as f:
for line in cleaned_new_annos:
f.write(line)

开源模型和数据

大神你好,我是新手小白,从 b 站过来的,b 站上合成的视频太牛了,想问一下能不能开源模型和数据,想学习~

关于https://github.com/Jack-Cherish/dsi/blob/main/vits_script/finetune_webui.py

wav, sr = torchaudio.load(os.path.join("separated", "htdemucs", fname, "vocals.wav"), frame_offset=0, num_frames=-1, normalize=True, channels_first=True)
这一行代码在读取VITS_fast_finetune\separated\htdemucs{fname}\vocals.wav这个文件,但是我的separated目录里是空的。而且我看到代码里没有任何地方在separated目录里创建这些文件,请问这是怎么回事呢?

建议为 finetune_webui.py 增加运行环境说明

尝试使用 anaconda + python 3.9.17 / 3.11.4 运行该脚本,持续出现报错:

  1. whisper 安装应该使用 openai-whisper
  2. commons 包安装失败

建议说明需要使用的 python 版本,以及增加 requirements.txt

继续训练的逻辑问题

在前端界面。您声称:“重新训练选择是,接着已经保存的模型继续训练选择否”

然而,当选择“是”时:
continue_train=“是”

这导致get_hparams()函数中,cont = True
if continue_train == "是":
drop_speaker_embed = False
cont = True

在run()函数中的,hps.cont == True,会导致加载G_latest.pth、D_latest.pth。
if hps.cont:
try:
_, _, _, epoch_str = utils.load_checkpoint(utils.latest_checkpoint_path(hps.model_dir, "G_latest.pth"), net_g, None)
_, _, _, epoch_str = utils.load_checkpoint(utils.latest_checkpoint_path(hps.model_dir, "D_latest.pth"), net_d, None)
global_step = (epoch_str - 1) * len(train_loader)

因此,我认为当选择“是”时,继续训练,而选择”否“,重新训练。

音频文件识别失败

2023-09-09 00:44:33 【已完成】音频文件命名修改
2023-09-09 00:44:40 音频文件识别失败
2023-09-09 00:44:40 【已完成】音频对应txt文本生成
2023-09-09 00:44:40 【已完成】数据预处理,训练集、验证集切分
2023-09-09 00:44:40 【进行中】开始训练,训练进度请看后台

请问,这个识别失败是对音频有什么要求吗

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.