wangtianrui / dccrn Goto Github PK

View Code? Open in Web Editor NEW

48.0 48.0 16.0 1.22 MB

implementation of "DCCRN-Deep Complex Convolution Recurrent Network for Phase-Aware Speech Enhancement" by pytorch

Python 100.00%

dccrn's People

Contributors

Stargazers

Watchers

Forkers

cst781 hh1992 shishichang zhaoliang1983x rafle0 ishine jasonzhang892 iron-y kangyuanxun cenwurong j-muyan martinmml fragrantrookie jeffery-work wendonggan

dccrn's Issues

Hello, I have a question about dataset

Can you tell where I can find the dataset, I find the dataset from "https://github.com/microsoft/DNS-Challenge/tree/master/datasets", but the file of dataset is too big. Are you simplifying the dataset? Thank you very much if you canm answer

关于数据集

请问数据集在哪里下载呀，需要怎么设置？

JSONDecodeError

Traceback (most recent call last):
File "train.py", line 19, in
my_system = MyBaseSystem("./conf.yml")
File "/root/DCCRN-master/BaseModel.py", line 38, in init
data_home=conf["preset"]["data_home"]
File "/root/DCCRN-master/BaseModel.py", line 106, in get_dns_data_loader
val_set = DNSDataset(val_json_file, data_home=data_home)
File "/root/DCCRN-master/dns_loader.py", line 21, in init
self.mix_infos = json.load(f)
File "/root/anaconda3/envs/DCCRN/lib/python3.7/json/init.py", line 296, in load
parse_constant=parse_constant, object_pairs_hook=object_pairs_hook, **kw)
File "/root/anaconda3/envs/DCCRN/lib/python3.7/json/init.py", line 348, in loads
return _default_decoder.decode(s)
File "/root/anaconda3/envs/DCCRN/lib/python3.7/json/decoder.py", line 337, in decode
obj, end = self.raw_decode(s, idx=_w(s, 0).end())
File "/root/anaconda3/envs/DCCRN/lib/python3.7/json/decoder.py", line 355, in raw_decode
raise JSONDecodeError("Expecting value", s, err.value) from None
json.decoder.JSONDecodeError: Expecting value: line 1 column 1 (char 0)

关于训练数据

"{"mix": "F:\Traindata\DNS-Challenge\make_data\noisy\book_09395_chp_0023_reader_07848_7_door_Freesound_validated_112347_5-door_Freesound_validated_458454_2-P2-3vmp1KWA_snr20_fileid_3154.wav", "clean": "F:\Traindata\DNS-Challenge\make_data\clean\clean_fileid_3154.wav", "noise": "F:\Traindata\DNS-Challenge\make_data\noise\noise_fileid_3154.wav", "snr": 20, "file_len": 480000"
你好请问下大概需要多少的数据可以训练一个通用的去混响模型，我准备训练24k的模型

模型训练过程中ComplexBatchNorm函数导致内存爆炸

您好：

当我在模型中使用ComplexBatchNorm时，训练一段时间内存就会爆炸。但是我把ComplexBatchNorm换成torch自带的BatchNorm就不会出现该问题。请问您在训练DCCRN的时候没有出现这个情况吗？

谢谢；
祝好！

Inference code

Hello, Thank you for sharing your code. Can you please the inference script as well.

training loss

can you give me the loss log? thx

below is my loss

Epoch 0: 100%|██████████| 21209/21209 [15:24:41<00:00, 2.62s/it, loss=13.395, v_num=0]ts/DCCRN# tail -f nohup.out
Epoch 00000: val_loss reached 13.25967 (best 13.25967), saving model to /data/juicefs_speech_tts_v3/11139034/work/ptts/DCCRN/logs/_ckpt_epoch_0.ckpt as top 5
Epoch 1: 29%|██▉ | 6162/21209 [4:45:09<11:36:18, 2.78s/it, loss=14.446, v_num=0, val_loss=13.3]

is it right?

发错了

AttributeError: can't set attribute

你好，我安装了torch==1.6.0和asteroid==0.3.4，我的cuda版本为10.2，但是在运行train.py文件的时候报错：AttributeError: can't set attribute，具体报错如下：
Traceback (most recent call last):
File "/media/li/TOSHIBA EXT/WDCCRN /DCCRN-master/train.py", line 21, in
my_system.init_system_and_trainer(SystemClass=MySystem, model=model_, gpus=[0])
File "/media/li/TOSHIBA EXT/WDCCRN /DCCRN-master/BaseModel.py", line 85, in init_system_and_trainer
config=self.conf
File "/home/li/anaconda3/envs/W-DCCRN/lib/python3.6/site-packages/asteroid/engine/system.py", line 58, in init
self.hparams = Namespace(**self.config_to_hparams(config))
File "/home/li/anaconda3/envs/W-DCCRN/lib/python3.6/site-packages/torch/nn/modules/module.py", line 1178, in setattr
object.setattr(self, name, value)
AttributeError: can't set attribute
请问我该如何解决这个问题呢？

关于模型训练

hello，我有个疑问，训练模型的时候是直接输入原始30s的音频吗，还是要提前切片5s或者更短的音频作为模型输入？谢谢解答

关于json文件

哈喽，请问有生成json文件的代码吗？

how to reduce the cuda memory?

maybe the batch size 2 is too small, but I cannot use two RTX 2080Ti to train this model with batch size 4

download DNS dataset

Hello, I tried to download the data set using the official method, and I also tried to download the data set through "wget", but the data was not available, code show as below:
for root,dirs,files in os.walk(args.datasetpath):
wav_files = []
path = "https://github.com/microsoft/DNS-Challenge/tree/master/datasets"
for line in files:
if line.endswith(".wav"):
wav_files.append(os.path.join(root,line))
else:
pass
for file in wav_files:
URL = path+file[37:]
cmd = "wget -p " +file+" "+URL
os.system(cmd)