koljab / aivoicechat Goto Github PK
View Code? Open in Web Editor NEWLow latency ai companion voice talk in 60 lines of code using faster_whisper and elevenlabs input streaming
Low latency ai companion voice talk in 60 lines of code using faster_whisper and elevenlabs input streaming
I would like to add elevenlabs custom voice based on id and name.
custom voice = cloned own voice ;-)
Hope you can help me on this
Hi @KoljaB,
The bot is indeed very fast! But when adding Twilio (phone call) the response is very slow since the audio needs to be downloaded from Twilio servers. Have you played around with Twilio and know how to make it work that it will give low latency responses? Maybe there are alternatives which can handle a phone call super fast?
Highly appreciate your response.
Best
Hi @KoljaB !
First of all congrats on your project.
I'm really interested in using it but need to know if it is possible to use if with my ChatBot created with my custom Data in Langchain. I already use the OpenAI API to make it work but I need to use it with my Chatbot.
Is that possible?
Appreciate
Hello, this is incredible! I got it set up fairly quickly. It works great, except that the voice only says a few words at a time, then pauses for a few seconds, then continues, then pauses etc.
Would you have any advice for how to eliminate this playback problem? My internet speed is fairly good, so I doubt it's that.
Thanks for all your work on this, it's the best execution I've seen of this idea to date.
Couldn't quite find how to do this, the transcription always ends up in English. Any tips? Thanks!
Hi, I'd love to use this library, but can't because I have Cuda 12 and its a massive pain to get both 11 and 12 going at the same time. Do you plan to switch to cuda 12, or are there dependencies preventing a migration?
I wrote this in error, please delete. I had to update openai using
pip install openai==0.28.1
Hi!
One of my goals is to be able to trigger an animation based on a keyword or smiley included in the LLM answer.
Therefore, I could animate expressive faces on OBS for instance via websockets, by the script being triggered by the smiley (It's ignored be elevenlabs).
Not sure if it's the right project - it might be better for linguflex as a module, but is that possible?
elevenlabs.set_api_key("")
^^^^^^^^^^^^^^^^^^^^^^
AttributeError: module 'elevenlabs' has no attribute 'set_api_key'
Edit: you posted your key, I removed it but pls change it as soon as possible in case someone saw it
A declarative, efficient, and flexible JavaScript library for building user interfaces.
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. ๐๐๐
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google โค๏ธ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.