lochenchou / mosnet Goto Github PK
View Code? Open in Web Editor NEWImplementation of "MOSNet: Deep Learning based Objective Assessment for Voice Conversion"
License: Other
Implementation of "MOSNet: Deep Learning based Objective Assessment for Voice Conversion"
License: Other
wavs.tar.gz
Hello,
I tried your model on these audios and the scores were all around 3. These scores are not terrible, but for real audio I would expect scores close to 5 with some consistency. Am I wrong for expecting this? Please take a look if you have the time. Thanks!
Hello,
Would you be willing to add a license to this repository? See: https://opensource.stackexchange.com/questions/1720/what-can-i-assume-if-a-publicly-published-project-has-no-license
Thank you.
Such as cnn_blstm.h5
I tried to run the test.py after following the "Usage" instructions in the README for this rep. I got this error:
FileNotFoundError: [Errno 2] No such file or directory: './data/mos_list.txt'
I assume that this file should exist under "./data", and it currently doesn't. Thanks.
When bash download.sh command error.File "mosnet/lib/python3.5/site-packages/google/protobuf/internal/containers.py", line 349
f"{self.class.name} object does not support item assignment")
^
SyntaxError: invalid syntax
How can I solve this problem?
When I run
python train.py --model CNN-BLSTM
It reported an error and said
FileNotFoundError: [Errno 2] Unable to open file (unable to open file: name = './data/bin/N19_VCC2TF2_VCC2SM1_30017_HUB.h5', errno = 2, error message = 'No such file or directory', flags = 0, o_flags = 0)
What can I do to get it solved?
I am using [(https://github.com/aliutkus/speechmetrics)], which is a kind of wrapper for your repository, to evaluate the results and the output is as follows, I am not able to understand why there is a 5 value array as output to a single input.
{'mosnet': array([4.98537636, 4.95263338, 4.69211102, 5.06538916, 5.01724768])}
I was wondering the exact range of mosnet and srmr ,cuz I have seen few utterances got a result which is larger than 5 ,even up to 8.xx. Really appreciate your answer!🌼
Hi
Thank you very much for providing this project. Can this project be used to evaluate the speech quality after front-end signal processing (AEC--Acoustic Echo Canceller, NS --noise suppression , AGC - automatic gain control)?
Thanks!
does the mosnet can only run in 2080? when running in 3090,many errors happen
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.