deepspeakingavatar's People
deepspeakingavatar's Issues
ALL - venv should be in all the submodules .gitignore files
At moment integration requires for /venv to be created in all the submodules. For this reason submodules should have /venv ignored.
STT - dependencies not up to date
Speech to text module's dependencies are not up to date. More specifically the following version should be defined in the dependencies.txt:
torch=1.7.1
torchaudio=0.7.2
torchvision=0.8.2
DOCKER - Deep Speaking Avatar should be dockerized
STT - Better audio input method
The current way to input audio via listen_audio.py is non intuitive and requires an additional monitor to operate when used with the face since face runs in fullscreen thus hiding the audio input window.
STT listen_audio.py hardcoded output path
listen_audio.py writes audio to hardcoded path. It would be better to allow defining the location also as argument.
Installation fails on Ubuntu 20.04 with several "no such file or directory" errors
./install.sh
Submodule 'chatbot' (https://github.com/s0hv/chatbot) registered for path 'chatbot'
Submodule 'deep-speaking-avatar-text-to-audio' (https://github.com/vilukissa68/deep-speaking-avatar-text-to-audio) registered for path 'deep-speaking-avatar-text-to-audio'
Submodule 'face_detection' (https://github.com/viljamirom/face_detection) registered for path 'face_detection'
Submodule 'speech-to-text-translation' (https://github.com/Norskiii/speech-to-text-translation) registered for path 'speech-to-text-translation'
Setting up text2text
ERROR: Could not open requirements file: [Errno 2] No such file or directory: 'requirements.txt'
WARNING: You are using pip version 22.0.4; however, version 22.1 is available.
You should consider upgrading via the '/opt/DeepSpeakingAvatar/chatbot/venv/bin/python3 -m pip install --upgrade pip' command.
./install.sh: line 14: ./install-services.sh: No such file or directory
Text2text set up successfully
Setting up text2audio
./install.sh: line 23: ./src/setup_multivoice.sh: No such file or directory
Text2audio set up successfully
Setting up audio2text
./install.sh: line 31: ./setup.sh: No such file or directory
Audio2text set up successfully
Setting up face detection2
Collecting opencv-python==4.5.3.56
Downloading opencv_python-4.5.3.56-cp39-cp39-manylinux2014_x86_64.whl (49.9 MB)
━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 49.9/49.9 MB 4.4 MB/s eta 0:00:00
Collecting numpy>=1.19.3
Downloading numpy-1.22.3-cp39-cp39-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (16.8 MB)
━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 16.8/16.8 MB 4.6 MB/s eta 0:00:00
Installing collected packages: numpy, opencv-python
Successfully installed numpy-1.22.3 opencv-python-4.5.3.56
WARNING: You are using pip version 22.0.4; however, version 22.1 is available.
You should consider upgrading via the '/opt/DeepSpeakingAvatar/face_detection/venv/bin/python3 -m pip install --upgrade pip' command.
FACE DETECTION - Poll rate of the detection should be able to be set as cmd argument
DETECT FACE - Output file is hardcoded
Output file of the face detection module is hardcoded, but it should be able to be set via command line argument.
FACE DETECTION - Name of the output file is hardcoded
Currently only directory path of the output file is possible to set via command line argument, but not the name of the file. Filename should also be able to be defined this way.
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.