Coder Social home page Coder Social logo

关于增强后的语音问题 about sednn HOT 15 OPEN

lymiou avatar lymiou commented on June 3, 2024
关于增强后的语音问题

from sednn.

Comments (15)

yongxuUSTC avatar yongxuUSTC commented on June 3, 2024

from sednn.

lymioumm avatar lymioumm commented on June 3, 2024

0dB的时候 首先 任务比较难,确实 有吱吱的声音,主要是模型不够强大,带来的非线性失真

On Fri, 15 May 2020 at 19:37, lymiou @.***> wrote: 您好,我想问下为什么增强后的语音发出来的只有吱吱吱的声音呢? 下面是它对应的语谱图,请问下是出了什么问题呢?期待您的解答,谢谢! [image: image] https://user-images.githubusercontent.com/46339102/82108425-e69e2580-9760-11ea-9ed4-871079d63e87.png — You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHub <#57>, or unsubscribe https://github.com/notifications/unsubscribe-auth/ABJGHUXL7VRVOZSQ3G2OFRLRRX35HANCNFSM4NCWY3CQ .

谢谢您的解答,还有一点想麻烦您解答一下,项目里的特征提取采用的是什么方法呢?好像不是MFCC,再次感谢您!

from sednn.

qiuqiangkong avatar qiuqiangkong commented on June 3, 2024

from sednn.

lymioumm avatar lymioumm commented on June 3, 2024

您好, 分离的结果是对的,是否使用了全部数据训练?、

On Sat, 16 May 2020 at 10:37, lymiou @.***> wrote: 您好,我想问下为什么增强后的语音发出来的只有吱吱吱的声音呢? 下面是它对应的语谱图,请问下是出了什么问题呢?期待您的解答,谢谢! [image: image] https://user-images.githubusercontent.com/46339102/82108425-e69e2580-9760-11ea-9ed4-871079d63e87.png — You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHub <#57>, or unsubscribe https://github.com/notifications/unsubscribe-auth/ADFXTSMCYXPOHXIGFN3PMQDRRX35HANCNFSM4NCWY3CQ .

您好,是的,使用了mini_data文件夹下的数据,另外train_speech中还另加了一些语音数据

from sednn.

qiuqiangkong avatar qiuqiangkong commented on June 3, 2024

from sednn.

lymioumm avatar lymioumm commented on June 3, 2024

数据太少了,需要使用全部timit数据训练

On Sun, 17 May 2020 at 09:56, lymioumm @.> wrote: 您好, 分离的结果是对的,是否使用了全部数据训练?、 … <#m_-3999892883733078355_> On Sat, 16 May 2020 at 10:37, lymiou @.> wrote: 您好,我想问下为什么增强后的语音发出来的只有吱吱吱的声音呢? 下面是它对应的语谱图,请问下是出了什么问题呢?期待您的解答,谢谢! [image: image] https://user-images.githubusercontent.com/46339102/82108425-e69e2580-9760-11ea-9ed4-871079d63e87.png — You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHub <#57 <#57>>, or unsubscribe https://github.com/notifications/unsubscribe-auth/ADFXTSMCYXPOHXIGFN3PMQDRRX35HANCNFSM4NCWY3CQ . 您好,是的,使用了mini_data文件夹下的数据,另外train_speech中还另加了一些语音数据 — You are receiving this because you commented. Reply to this email directly, view it on GitHub <#57 (comment)>, or unsubscribe https://github.com/notifications/unsubscribe-auth/ADFXTSO5K22PTO3HI6A355LRR474NANCNFSM4NCWY3CQ .

好的,明白了,谢谢您!

from sednn.

QianYing1996 avatar QianYing1996 commented on June 3, 2024

您好,看到你跑出了增强后的结果,想请问您是用的python3吗?
有遇到spectrogram_to_wav.py这个文件里real_to_complex这个函数报错吗?
报错是显示相乘的两项维度不一致,具体可以看我提出的issue#58,谢谢您!

from sednn.

qiuqiangkong avatar qiuqiangkong commented on June 3, 2024

from sednn.

QianYing1996 avatar QianYing1996 commented on June 3, 2024

您好,谢谢您的回复,请问可以具体说下怎么看spectrogram的维度是否正确吗?是对训练语料有什么要求吗?
我是用mini_data跑的,没有修改源码,就遇到了这个问题。
我是刚入门语音的小白,麻烦您指点一下,谢谢!

from sednn.

qiuqiangkong avatar qiuqiangkong commented on June 3, 2024

from sednn.

smylab avatar smylab commented on June 3, 2024

您好,我想问下为什么增强后的语音发出来的只有吱吱吱的声音呢?
下面是它对应的语谱图,请问下是出了什么问题呢?期待您的解答,谢谢!
image

from sednn.

smylab avatar smylab commented on June 3, 2024

您好,我是刚接触语音增强的小白,本来是打算用这份代码学习的,结果运行了好几次minidata里的数据,代码是原封不动运行的,环境也是按照要求去配置的,,,但增强后的结果除了吱吱吱吱吱吱吱吱吱吱吱,基本的混合语音都听不到了,只有吱吱吱的声音,我快疯了,,这个怎么回事,您最终怎么解决这个问题的,,万分感谢,我快被搞疯了。

from sednn.

yongxuUSTC avatar yongxuUSTC commented on June 3, 2024

from sednn.

smylab avatar smylab commented on June 3, 2024

你把增强的wav和mix wav发给我听听,用mini data 训练数据是不够的,需要更大的训练数据来训练,才有比较好的结果。另外你可以试下SNR高的例子 增强看看效果,验证代码是否正确

On Fri, 23 Apr 2021 at 22:09, smylab @.***> wrote: 您好,我是刚接触语音增强的小白,本来是打算用这份代码学习的,结果运行了好几次minidata里的数据,代码是原封不动运行的,环境也是按照要求去配置的,,,但增强后的结果除了吱吱吱吱吱吱吱吱吱吱吱,基本的混合语音都听不到了,只有吱吱吱的声音,我快疯了,,这个怎么回事,您最终怎么解决这个问题的,,万分感谢,我快被搞疯了。 — You are receiving this because you commented. Reply to this email directly, view it on GitHub <#57 (comment)>, or unsubscribe https://github.com/notifications/unsubscribe-auth/ABJGHUSIN6A2F6W3K3SABN3TKJHAVANCNFSM4NCWY3CQ .

感谢您的回复,谢谢。这是0db和5db的minidata测试结果,按道理说如果因为数据集小,没充分的训练模型,增强后的结果最起码不会比原始的混合语音差才对啊。感谢您给出的建议,这个问题困惑了我好几天,我马上用大数据集再训练测试一下。

sednn_minidata_test.zip

from sednn.

smylab avatar smylab commented on June 3, 2024

您好,我想问下为什么增强后的语音发出来的只有吱吱吱的声音呢?
下面是它对应的语谱图,请问下是出了什么问题呢?期待您的解答,谢谢!
image

你把增强的wav和mix wav发给我听听,用mini data 训练数据是不够的,需要更大的训练数据来训练,才有比较好的结果。另外你可以试下SNR高的例子 增强看看效果,验证代码是否正确

On Fri, 23 Apr 2021 at 22:09, smylab @.***> wrote: 您好,我是刚接触语音增强的小白,本来是打算用这份代码学习的,结果运行了好几次minidata里的数据,代码是原封不动运行的,环境也是按照要求去配置的,,,但增强后的结果除了吱吱吱吱吱吱吱吱吱吱吱,基本的混合语音都听不到了,只有吱吱吱的声音,我快疯了,,这个怎么回事,您最终怎么解决这个问题的,,万分感谢,我快被搞疯了。 — You are receiving this because you commented. Reply to this email directly, view it on GitHub <#57 (comment)>, or unsubscribe https://github.com/notifications/unsubscribe-auth/ABJGHUSIN6A2F6W3K3SABN3TKJHAVANCNFSM4NCWY3CQ .

感动哭,感谢作者对我们初学者的一一回复,大数据集训练就没有吱吱吱吱吱的声音了,增强效果很好,之前一直找不到问题所在,感谢。

from sednn.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.