Coder Social home page Coder Social logo

Plan for the next release about vosk-api HOT 12 OPEN

alphacep avatar alphacep commented on May 11, 2024 9
Plan for the next release

from vosk-api.

Comments (12)

dtreskunov avatar dtreskunov commented on May 11, 2024 2

I can probably help with building for more platforms, including Windows and different flavors of ARM.

One approach is to use cross-compiler images from dockcross.

Let me know if you would consider merging a PR using GitHub actions to automate building/uploading Python wheels.

from vosk-api.

bedapudi6788 avatar bedapudi6788 commented on May 11, 2024 1

@nshmyrev I can do Implement more pythonic API, Automatic download of the model and Basic unit tests if you are still looking for help.

from vosk-api.

nshmyrev avatar nshmyrev commented on May 11, 2024

Let me know if you would consider merging a PR using GitHub actions to automate building/uploading Python wheels.

I checked github actions and didn't find critical advantages over travis. You can try if you want of course.

from vosk-api.

dtreskunov avatar dtreskunov commented on May 11, 2024

I made some progress on getting a Windows build out. I'm currently blocked on some errors when compiling OpenFST (opened this issue: kkm000/openfst#25).

This is much more complicated than expected :)

from vosk-api.

nshmyrev avatar nshmyrev commented on May 11, 2024

@dtreskunov Thank you. For Win build I really consider mingw with anaconda, that will be much easier than stock python with VS. I think anaconda build should be enough. We don't have to spend much time on it.

from vosk-api.

nshmyrev avatar nshmyrev commented on May 11, 2024

Also, as far as I know @kkm000 uses very recent openfst, you might have better chances with openfst-1.6.7 instead of 1.7 something he is using.

from vosk-api.

dtreskunov avatar dtreskunov commented on May 11, 2024

from vosk-api.

nshmyrev avatar nshmyrev commented on May 11, 2024

@dtreskunov ah, I see. Given openfst uses C++11, no chance we can make it working. VS14 simply doesn't support c++11 well enough.

https://docs.microsoft.com/en-us/cpp/overview/visual-cpp-language-conformance?view=vs-2019

Consider mingw and anaconda.

from vosk-api.

dtreskunov avatar dtreskunov commented on May 11, 2024

I was able to fix it thanks to this article. It turns out that the ImportError was getting thrown because the generated _vosk.pyd depended on libopenblas.dll. This despite having provided libopenblas.lib to the linker. By putting libopenblas.dll and its dependencies into the .whl, I was able to get it to work.

The resulting .whl installs and works fine in my plain Python environment: no anaconda, just regular Windows x86-64 executable installer.

I would appreciate it if you could test it: vosk-0.3.1_dtreskunov_183_ga8b2c22-cp38-cp38-win_amd64.whl.zip

I'm not sure why it works. It's built using Visual Studio 15 2017, despite the requirement to use Visual Studio 14 2015.

I'm going to clean up the scripts a bit and will send a PR soon.

from vosk-api.

nshmyrev avatar nshmyrev commented on May 11, 2024

@bedapudi6788 that would be great!

from vosk-api.

bedapudi6788 avatar bedapudi6788 commented on May 11, 2024

@nshmyrev this is my idea of "pythonic" api.

import vosk

# print available models
print(vosk.list_models())

# auto downloads the model if not found in local
asr = vosk.load("en-us")

# word_list is optional
# if stream=True, will return iterator for partial results
# if stream=False will return final result
# default stream = False
result = asr.recognize('wav_file_path', word_list, stream=True)

let me know if this is ok or any changes required.

from vosk-api.

nshmyrev avatar nshmyrev commented on May 11, 2024

@bedapudi6788 I have created a separate issue about this: #31, please check

from vosk-api.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.