Coder Social home page Coder Social logo

ssciwr / vink Goto Github PK

View Code? Open in Web Editor NEW
13.0 7.0 0.0 327 KB

A stand-alone application with GUI for OpenAI's Whisper

License: Other

Python 69.16% NSIS 30.84%
whisper gui openai pyinstaller speech-to-text transcription whisper-ai hacktoberfest iwr-hacktoberfest

vink's Introduction

Vink - an OpenAI Whisper standalone distribution

DOI

This is a stand-alone application that packages OpenAI's Whisper into a distribution that does not require users to have Python installed. Additionally, it provides a minimalistic graphical user interface for transcription.

Installation on Windows 10+

Please download the zipped distribution from Zenodo and unpack it to a target location of your choice:

Usage on Windows

Please install by double clicking on the downloaded installer. Running vink.exe (e.g. from your start-up menu) will fire up the graphical user interface, allowing you to specify input and output files, a model and device to run on.

If you want to work with whisper's command line interface instead, you can do so by providing arguments on the command_line:

vink --help

Linux

Currently, there is no (working) stand-alone packaging for Linux due to issues in pyinstaller's handling of PySide6. If you are still interested in using the GUI from Ubuntu/Debian, the following sequence of commands will install dependencies and run locally:

sudo apt install ffmpeg python3 python3-pip git
git clone https://github.com/ssciwr/vink.git
cd vink
python -m pip install -r requirements.txt
python vink.py

Licensing

The code provided in this project itself is covered by the MIT license. The overall distribution is also licensed under the MIT license. For details about bundled components and their license, please check LICENSE.md.

vink's People

Contributors

dokempf avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar

vink's Issues

updating to the letest version of whisper

there is any way that you can share exact command to use pyinstaller how to compile this package that the command line will still working.
if you interest I have personal project to complete and I'm planning to use whisper as a asr model. i can pay for it
please contact [email protected] if this is something that you interest.

Encoding issues for text output

Transcribing e.g. russian results in encoding issues on Windows. A fix should be to switch to explicitly writing as utf-8.

Pin whisper version

Whisper recently added a release tag. We should pin that tag for robustness.

Linux script can't find run_whisper.py

Following install instructions for linux:

sudo apt install ffmpeg python3 python3-pip git
git clone https://github.com/ssciwr/whisper-standalone.git
cd whisper-standalone
python -m pip install -r requirements.txt
python run_whisper.py

[Errno 2] No such file or directory

All works well until the last line, can't find the file. Which is not surprising since the file is absent from the repo.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.