mmig / speech-to-flac Goto Github PK
View Code? Open in Web Editor NEWExample for client-side encoding of microphone audio into FLAC data
License: Other
Example for client-side encoding of microphone audio into FLAC data
License: Other
Hello,
Thanks for this amazing code! I am trying to get it to work with Google Cloud API. I added the API key, but am getting this error:
Response to preflight request doesn't pass access control check: No 'Access-Control-Allow-Origin' header is present on the requested resource.
Any ideas?
Thanks!
Daniel
Does the lib allow you to get the .wav or flac encoded data on an arbitrary timeslice ? Im looking to abstract the google speech to text service away from the client so the plan is to create the .wav or the flac on say 4 second intervals and pipe that to a server to proceed with the rest of the workflow. Couldnt see anything in the readme, any pointers would be greatly apreciated.
I can't download it as a flac file, and the demo site has the same problem.
It works fine on localhost but when i tried to run on server with HTTPS (because getUserMedia() no longer works on insecure origins), It throws an error:
Error: encode_buffer_pcm_as_flac returned false.
Uncaught abort(0) at Error
at jsStackTrace (https://mmig.github.io/speech-to-flac/libflac3-1.3.2.min.js:1:21182)
at stackTrace (https://mmig.github.io/speech-to-flac/libflac3-1.3.2.min.js:1:21365)
at abort (https://mmig.github.io/speech-to-flac/libflac3-1.3.2.min.js:14:8583)
at Array.sg (https://mmig.github.io/speech-to-flac/libflac3-1.3.2.min.js:7:41450)
at Hb (https://mmig.github.io/speech-to-flac/libflac3-1.3.2.min.js:5:12925)
at wb (https://mmig.github.io/speech-to-flac/libflac3-1.3.2.min.js:5:1880)
at $c (https://mmig.github.io/speech-to-flac/libflac3-1.3.2.min.js:5:100657)
at Xc (https://mmig.github.io/speech-to-flac/libflac3-1.3.2.min.js:5:98850)
at zd (https://mmig.github.io/speech-to-flac/libflac3-1.3.2.min.js:6:57725)
at sd (https://mmig.github.io/speech-to-flac/libflac3-1.3.2.min.js:6:44703)
If this abort() is unexpected, build with -s ASSERTIONS=1 which can give more information.
Even when i run the demo at https://mmig.github.io/speech-to-flac/ , the same error is thrown. Can anyone help me?
With the update to google's cloud speech, I am not able to get the API credentials to work properly. It keeps giving a 400 () error. Any help?
Hi,
Heads up: This question is to understand which approach will have better results.
We are facing some issues with French speech recognition (we are using Google AI services). The current process is as follows
Record using MediaRecorder JS API --> Save as webm ---> Extract FLAC using FFMPEG --> Invoke Google Speech API
After looking at your library, it seems that you are converting audio bits received from MediaRecorder to FLAC in browser itself (which implies server side processing is not required).
Now moving to the main point, do you think any of the above approach will have impact on the Transcription quality?
A declarative, efficient, and flexible JavaScript library for building user interfaces.
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. ๐๐๐
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google โค๏ธ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.