Comments (11)
From @rajagopal28 on May 18, 2016 8:35
It seems like an API problem, I've tried hitting the actual endpoint with a POST request along with the documentation specified parameters and headers. I get the response { 'status' : 'Bad request', message: 'Not a valid WAVE file - No RIFF header'. I've tried with multipart/form-data and using file input from postman REST client. I've also tried to hit the API endpoint in the actual console provided by Microsoft (which doesn't have any way to pass the wave file as file input) by encoding the audio file into string(which starts with data:audio/wav;base64..) Can anyone from Microsoft answer this. I know its in preview stage, but it should have some understandable instructions and parameter details.
from cognitive-speakerrecognition-python.
From @momohs on May 18, 2016 9:31
Hi @rajagopal28,
Thanks for your comments. Can you please attach the *.wav file used for enrollment?
from cognitive-speakerrecognition-python.
From @rajagopal28 on May 18, 2016 10:42
I've used 3 files, I'm attaching all the three
Archive.zip
@momohs I see that you are from Microsoft. In the API console link for enrolling and verifying there are text fields to send the audio file, In what format it should be sent? I used base64 encoded text (as mentioned above), I get the same error. Can you please clarify this? Thanks for your comment.
from cognitive-speakerrecognition-python.
From @cthrash on May 19, 2016 2:23
It looks like the enrollment audio is too short. The audio file should be at least 20 seconds long and no longer than 5 minutes. The minimum number of total speech needed for enrollment, after removing silence, is 60 seconds.
@momohs - one improvement to consider is to include the response body in the exception. In this case it would have made the error much more obvious: { "error": { "code": "BadRequest", "message": "Audio too short" }
from cognitive-speakerrecognition-python.
From @rajagopal28 on May 19, 2016 7:15
@cthrash Thank you so much. It worked, I enrolled a voice phrase to the created profile.
It would be better if there is a way to know this message('Not a valid WAVE file - No RIFF header' or 'Audio too short') in the python wrapper log. It only shows the code ('BadRequest'), which is not so helping in identifying the issue.
from cognitive-speakerrecognition-python.
From @momohs on May 19, 2016 16:41
@rajagopal28 I have tried out the files you sent and I did some successful enrollments with them. However, the file "password.wav" has an incorrect sampling rate. and thus gave me an "incorrect sampling rate error". I have used a REST client for this.
Regarding the python wrapper, the enrollments were successful but I have received a "Bad request" for the file "password.wav". Indeed the exception needs to be better handled in the python wrapper.
Using the console, I am not sure how to attach the file to the request. I am in contact with the team responsible for that. I'll get back to you once it is sorted out.
@cthrash The "Audio Too Short" exception message is currently thrown out by the server if the audio is too short. At this moment, the audio should be from 1 to 15 seconds (as mentioned in the API Documentation)
from cognitive-speakerrecognition-python.
From @cthrash on May 19, 2016 18:8
1-15 seconds, IIUC, is for Speaker Verification. In the Stack Overflow Post, @rajagopal28 is asking (despite the title) about Speaker Identification, as you can see from the call stack.
from cognitive-speakerrecognition-python.
From @jjsuarez on June 25, 2016 22:21
Hello, I am also having problems enrolling an audio file in the API testing console. Please can you answer the question that @rajagopal28 asked, what format should be used in the Request body field? I am getting the same error: {
"error": {
"code": "BadRequest",
"message": "Invalid Audio Format: Not a WAVE file - no RIFF header"
}
}
My file is recorded according to the required parameter values of format and length. Any help would be greatly appreciated. Thanks a lot.
from cognitive-speakerrecognition-python.
From @momohs on June 26, 2016 13:51
Thanks for your feedback @jjsuarez!
We are aware of the issue with uploading audio files using the API Testing Console and we are still sorting it out! Meanwhile, I urge you to use the Python sample code or the C# sample code or the Online demos to test the Speaker Recognition service.
from cognitive-speakerrecognition-python.
I will close this issue for now, please feel free to re-open if you still have a problem.
from cognitive-speakerrecognition-python.
I am getting
{
"error": {
"code": "BadRequest",
"message": "InvalidPhrase"
}
}
from cognitive-speakerrecognition-python.
Related Issues (13)
- Please create a simple Speaker identification example. The documentation as well as the github are unorganized and confusing. HOT 2
- Opearation Error: SpeakerInvalid
- mic/streaming
- How to change the predefined phrases ?
- DEPENDENCIES LIST ? HOT 1
- AN EASIER WAY TO GET STARTED HOT 6
- This repo is missing important files HOT 1
- Ruby API returns empty headers, no way to track operations HOT 5
- Error enrolling HOT 5
- No module named http.client HOT 3
- Guid should contain 32 digits with 4 dashes HOT 1
- Error: message Guid should contain 32 digits with 4 dashes (xxxxxxxx-xxxx-xxxx-xxxx-xxxxxxxxxxxx) HOT 3
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from cognitive-speakerrecognition-python.