gustavostz / whisper-clip Goto Github PK

WhisperClip simplifies your life by automatically transcribing audio recordings and saving the text directly to your clipboard. With just a click of a button, you can effortlessly convert spoken words into written text, ready to be pasted wherever you need it. This application harnesses the power of OpenAI’s Whisper for free.

Home Page: https://openai.com/research/whisper

License: MIT License

Python 100.00%

audio-processing audio-transcription clipboard openai productivity productivity-tools python speech-recognition speech-to-text whisper whisper-ai

whisper-clip's Issues

Create a Video Demonstration of WhisperClip Usage

Description

We are looking for a contributor to create a video demonstration showing how to use WhisperClip. The video should cover the installation process, basic usage, and highlight the key features of the application.

Requirements

The video should be clear and easy to understand.
It should cover the following:
- Cloning the repository and setting up the environment.
- Running the application.
- Starting and stopping audio recording.
- Enabling the "Save to Clipboard" option and demonstrating its use.
- Changing the default shortcut and Whisper model in the config.json file (optional).
The video can be of any length, as long as it effectively demonstrates how to use WhisperClip.

Reward

We value your contribution! If your video is selected, we will feature it in the Home Page (ReadMe section) of the WhisperClip GitHub repository. This could be a great opportunity for marketing your channel or video, as it will be visible to all visitors of the project.

Submission

Please submit your video by adding a comment to this issue with a link to the video (e.g., YouTube, Vimeo). We look forward to seeing your submissions!

winsound module issue

Having an issue running this on my Macbook Pro. Says ModuleNotFoundError: No module named 'winsound' from audio_recorder.py

Unable to run normally after clicking the button

I followed the readme steps before proceeding with the operation.
After I finish executing Python main.py
Pop up UI interface with microphone
Click on the microphone

python main.py
Exception in thread Thread-6 (record_audio):
Traceback (most recent call last):
File "C:\Users\qaz21\AppData\Local\Programs\Python\Python312\Lib\threading.py", line 1073, in _bootstrap_inner
self.run()
File "C:\Users\qaz21\AppData\Local\Programs\Python\Python312\Lib\threading.py", line 1010, in run
self._target(*self._args, **self._kwargs)
File "C:\Codes\whisper-clip\audio_recorder.py", line 96, in record_audio
with sd.InputStream(callback=self.audio_callback):
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "C:\Users\qaz21\AppData\Local\Programs\Python\Python312\Lib\site-packages\sounddevice.py", line 1421, in init
_StreamBase.init(self, kind='input', wrap_callback='array',
File "C:\Users\qaz21\AppData\Local\Programs\Python\Python312\Lib\site-packages\sounddevice.py", line 817, in init
_get_stream_parameters(kind, device, channels, dtype, latency,
File "C:\Users\qaz21\AppData\Local\Programs\Python\Python312\Lib\site-packages\sounddevice.py", line 2660, in get_stream_parameters
info = query_devices(device)
^^^^^^^^^^^^^^^^^^^^^
File "C:\Users\qaz21\AppData\Local\Programs\Python\Python312\Lib\site-packages\sounddevice.py", line 569, in query_devices
raise PortAudioError(f'Error querying device {device}')
sounddevice.PortAudioError: Error querying device -1
Exception in Tkinter callback
Traceback (most recent call last):
File "C:\Users\qaz21\AppData\Local\Programs\Python\Python312\Lib\tkinter_init.py", line 1967, in call
return self.func(*args)
^^^^^^^^^^^^^^^^
File "C:\Codes\whisper-clip\audio_recorder.py", line 54, in toggle_recording
self.stop_recording()
File "C:\Codes\whisper-clip\audio_recorder.py", line 69, in stop_recording
audio_data = np.concatenate(self.recordings)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
ValueError: need at least one array to concatenate
Exception in thread Thread-7 (record_audio):
Traceback (most recent call last):
File "C:\Users\qaz21\AppData\Local\Programs\Python\Python312\Lib\threading.py", line 1073, in _bootstrap_inner
self.run()
File "C:\Users\qaz21\AppData\Local\Programs\Python\Python312\Lib\threading.py", line 1010, in run
self._target(*self._args, **self._kwargs)
File "C:\Codes\whisper-clip\audio_recorder.py", line 96, in record_audio
with sd.InputStream(callback=self.audio_callback):
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "C:\Users\qaz21\AppData\Local\Programs\Python\Python312\Lib\site-packages\sounddevice.py", line 1421, in init
_StreamBase.init(self, kind='input', wrap_callback='array',
File "C:\Users\qaz21\AppData\Local\Programs\Python\Python312\Lib\site-packages\sounddevice.py", line 817, in init
_get_stream_parameters(kind, device, channels, dtype, latency,
File "C:\Users\qaz21\AppData\Local\Programs\Python\Python312\Lib\site-packages\sounddevice.py", line 2660, in get_stream_parameters
info = query_devices(device)
^^^^^^^^^^^^^^^^^^^^^
File "C:\Users\qaz21\AppData\Local\Programs\Python\Python312\Lib\site-packages\sounddevice.py", line 569, in query_devices
raise PortAudioError(f'Error querying device {device}')
sounddevice.PortAudioError: Error querying device -1
Exception in Tkinter callback
Traceback (most recent call last):
File "C:\Users\qaz21\AppData\Local\Programs\Python\Python312\Lib\tkinter_init.py", line 1967, in call
return self.func(*args)
^^^^^^^^^^^^^^^^
File "C:\Codes\whisper-clip\audio_recorder.py", line 54, in toggle_recording
self.stop_recording()
File "C:\Codes\whisper-clip\audio_recorder.py", line 69, in stop_recording
audio_data = np.concatenate(self.recordings)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
ValueError: need at least one array to concatenate

Recommend Projects

React

A declarative, efficient, and flexible JavaScript library for building user interfaces.
Vue.js

🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
Typescript

TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
TensorFlow

An Open Source Machine Learning Framework for Everyone
Django

The Web framework for perfectionists with deadlines.
Laravel

A PHP framework for web artisans
D3

Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

javascript

JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
web

Some thing interesting about web. New door for the world.
server

A server is a program made to process requests and deliver data to clients.
Machine learning

Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Visualization

Some thing interesting about visualization, use data art
Game

Some thing interesting about game, make everyone happy.

Recommend Org

Facebook

We are working to build community through open source technology. NB: members must have two-factor auth.
Microsoft

Open source projects and samples from Microsoft.
Google

Google ❤️ Open Source for everyone.
Alibaba

Alibaba Open Source for everyone
D3

Data-Driven Documents codes.
Tencent

China tencent open source team.