Comments (7)
Hello and thanks for the input، please open a PR with any changes you see that are useful and we can discuss them together
from whisper-diarization.
@MahmoudAshraf97 , will appreciate your take on this! Thanks for sharing your work!
from whisper-diarization.
@MahmoudAshraf97 , Thanks for your understanding! This is what I want to do:
-
Leave existing functionalities as-is.
-
Please see the attached .txt file. Currently, a lot of messages/warnings/logs are displayed in command line, I want to make this optional where users can choose if they want to see these messages.
whisper_diarization_stdout.txt -
If users want, they should be able to run the whole pipeline locally. Meaning that they can download all the models in a directory beforehand. Faster-whisper and whisperX load_align_model already have support for this. I can check if other models can also be used in this way. Do you know if this is feasible? What other models are used in this pipeline? I still have to go through the code and don't have this answer yet.
-
Format the code for readability and usability.
Let me know what you think. It will take some time to make all these changes. Before I spend any time, I wanted to align with you. Thanks!
from whisper-diarization.
@MahmoudAshraf97 , do you have any feedback?
from whisper-diarization.
@MahmoudAshraf97 , thought?
from whisper-diarization.
Related Issues (20)
- Issue with an audio/video file HOT 1
- Numpy Conflict - current requirements.txt HOT 2
- Error in diarization
- Installing from requirements.txt leads to the installation of ?every version of the packages needed HOT 7
- Audio only part time transcribed and each time a different one? HOT 3
- faster-whisper branch/revision has changed HOT 3
- diarize.py unexpected keyword argument ‘max_new_tokens’ HOT 4
- Error: got an unexpected keyword argument 'max_new_tokens' HOT 2
- AssertionError: chunk size too large, text got clipped HOT 2
- Language param not working HOT 2
- install issue HOT 1
- WhisperX forced alignment HOT 1
- How to use Yaml File HOT 1
- Failed to install on Apple Silicon HOT 10
- word_timestamps - IndexError: list index out of range HOT 1
- Transcription for non-verbal/non-speech labels(laughter etc.)? HOT 3
- Any suggestions for improving speaker diarization!! HOT 3
- Install fails on Python 3.12 due to missing distutils HOT 5
- python version it best works in ?????? HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from whisper-diarization.