Coder Social home page Coder Social logo

Comments (11)

mushanwei avatar mushanwei commented on August 16, 2024

I know how to train custom data, just follow the readme steps.

from dinet.

Inferencer avatar Inferencer commented on August 16, 2024

Bit more difficult than that, loss convergence etc.. I tried to eyeball the results before moving onto each stage and the results did not match the work I had to put into collecting datasets etc, there are plenty of issues in this repo regarding similar issues which is why op has rather cleverly tried to avoid the headache/ learning curve.

from dinet.

mushanwei avatar mushanwei commented on August 16, 2024

Bit more difficult than that, loss convergence etc.. I tried to eyeball the results before moving onto each stage and the results did not match the work I had to put into collecting datasets etc, there are plenty of issues in this repo regarding similar issues which is why op has rather cleverly tried to avoid the headache/ learning curve.

I don't know what is your data looks like, but in my data, it seems work, athough this is some loss convergence issue, but it seems results not bad.

from dinet.

Inferencer avatar Inferencer commented on August 16, 2024

Bit more difficult than that, loss convergence etc.. I tried to eyeball the results before moving onto each stage and the results did not match the work I had to put into collecting datasets etc, there are plenty of issues in this repo regarding similar issues which is why op has rather cleverly tried to avoid the headache/ learning curve.

I don't know what is your data looks like, but in my data, it seems work, athough this is some loss convergence issue, but it seems results not bad.

Mine was a single person dataset so perhaps this was some over fitting etc, the dataset itself was high quality about 3 hours of front facing studio lighting, the results where a higher chin+cheek fidelity and less bounding box but jittery lips. Perhaps a more diverse dataset would have resolved this, I saw a lot of people asking about syncnet training and wasn't 100% aware of what that was

from dinet.

NaMoCv avatar NaMoCv commented on August 16, 2024

Bit more difficult than that, loss convergence etc.. I tried to eyeball the results before moving onto each stage and the results did not match the work I had to put into collecting datasets etc, there are plenty of issues in this repo regarding similar issues which is why op has rather cleverly tried to avoid the headache/ learning curve.比这更困难的是,损失收敛等。我试图在进入每个阶段之前观察结果,结果与我必须投入收集数据集等的工作不匹配,这个回购协议中有很多关于类似问题的问题这就是为什么 op 相当聪明地试图避免头痛/学习曲线。

I don't know what is your data looks like, but in my data, it seems work, athough this is some loss convergence issue, but it seems results not bad.我不知道你的数据是什么样的,但在我的数据中,它似乎有效,虽然这是一些损失收敛问题,但看起来结果还不错。

有尝试过中文的效果吗?

from dinet.

Inferencer avatar Inferencer commented on August 16, 2024

Bit more difficult than that, loss convergence etc.. I tried to eyeball the results before moving onto each stage and the results did not match the work I had to put into collecting datasets etc, there are plenty of issues in this repo regarding similar issues which is why op has rather cleverly tried to avoid the headache/ learning curve.比这更困难的是,损失收敛等。我试图在进入每个阶段之前观察结果,结果与我必须投入收集数据集等的工作不匹配,这个回购协议中有很多关于类似问题的问题这就是为什么 op 相当聪明地试图避免头痛/学习曲线。

I don't know what is your data looks like, but in my data, it seems work, athough this is some loss convergence issue, but it seems results not bad.我不知道你的数据是什么样的,但在我的数据中,它似乎有效,虽然这是一些损失收敛问题,但看起来结果还不错。

有尝试过中文的效果吗?

No but others have, but deepspeech wasn't trained on Chinese so the lip movements won't be fully accurate to Chinese audio

from dinet.

NaMoCv avatar NaMoCv commented on August 16, 2024

Bit more difficult than that, loss convergence etc.. I tried to eyeball the results before moving onto each stage and the results did not match the work I had to put into collecting datasets etc, there are plenty of issues in this repo regarding similar issues which is why op has rather cleverly tried to avoid the headache/ learning curve.比这更困难的是,损失收敛等。我试图在进入每个阶段之前观察结果,结果与我必须投入收集数据集等的工作不匹配,这个回购协议中有很多关于类似问题的问题这就是为什么 op 相当聪明地试图避免头痛/学习曲线。

I don't know what is your data looks like, but in my data, it seems work, athough this is some loss convergence issue, but it seems results not bad.我不知道你的数据是什么样的,但在我的数据中,它似乎有效,虽然这是一些损失收敛问题,但看起来结果还不错。

有尝试过中文的效果吗?

No but others have, but deepspeech wasn't trained on Chinese so the lip movements won't be fully accurate to Chinese audio

I tried to use the examples in the README for training and reasoning, but the lip tremors were severe, and the mouth still retained its original movements when there was no sound. So I think this project is not very good.

from dinet.

 avatar commented on August 16, 2024

I'm not working for money, I'm working for fun. So you can try my project here it will be included training dinet full pipeline in several days..
https://github.com/primepake/better_wav2lip

from dinet.

Chechgroup avatar Chechgroup commented on August 16, 2024

I know how to train custom data, just follow the readme steps.

how to contact you? I would pay I need your help

from dinet.

Chechgroup avatar Chechgroup commented on August 16, 2024

Bit more difficult than that, loss convergence etc.. I tried to eyeball the results before moving onto each stage and the results did not match the work I had to put into collecting datasets etc, there are plenty of issues in this repo regarding similar issues which is why op has rather cleverly tried to avoid the headache/ learning curve.

I don't know what is your data looks like, but in my data, it seems work, athough this is some loss convergence issue, but it seems results not bad.

Mine was a single person dataset so perhaps this was some over fitting etc, the dataset itself was high quality about 3 hours of front facing studio lighting, the results where a higher chin+cheek fidelity and less bounding box but jittery lips. Perhaps a more diverse dataset would have resolved this, I saw a lot of people asking about syncnet training and wasn't 100% aware of what that was

how to contact you? I would pay I need your help

from dinet.

tailangjun avatar tailangjun commented on August 16, 2024

t full pipeline in several days..

#New Features: DINet full pipeline training
Very much looking forward to it!

from dinet.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.