Comments (19)
Thanks! I really appreciate it! And thanks for trying it out, as far as I know, you're the first to use it.
from alltalk_tts.
@RandomInternetPreson fyi.. I did just post 1x minor update to deal with a gradio issue in the dropdowns. If you want you can just download server.py and drop it over the top of your current one.
from alltalk_tts.
@RandomInternetPreson and I also updated how the narrator splits text between voice/narrator (hopefully to improve it) as well as populate the settings voices with the voices in the folder. I also added a couple more voices.
So you might want to git pull it again or just drop the server.py, tty_server.py and \templates\generate_form.html over the top of the others!
Sorry, still early days and I'm just improving a few little bits while I remember.
from alltalk_tts.
You are good! I almost got all of the deepspeed stuff working for windows, I actually managed to get through all the steps but I think I needed to take more care when doing the compiling. Next on the list is to try deepspeed with a WSL install, but I'm pretty happy with the extension even without deepspeed working for now.
Thanks again, going to do a new text-gen-install to take advantage of all the new updates oobabooga's code, and your extension in on my extension installation list now <3
from alltalk_tts.
One final update done just now. So if you had updated before, its worth an update again. I've done a few more bits that should keep the generated audio a bit more on track/keeping it closer to the original.
And I really do think this is everything done for a full v1. So unless someone says there's bugs, I cant see me changing the code for a good long time (famous last words and clearly tempting fate).
from alltalk_tts.
Oh and one more thing if you're interested. I couldn't include all these voices within github
Some are better than others....I think there's about 45 there.
from alltalk_tts.
@RandomInternetPreson changed the link for the voice files... you have 6x days to grab these https://filebin.net/t97nd69ac7qm2rsf if you want them
from alltalk_tts.
Just fyi. Ive done a major rebuild of the narration engine and filtering engine. The result is a much cleaner output for voice and more control over the narration/character function.
from alltalk_tts.
FYI, someone found a way to get DeepSpeed v11.2 in the default text-generation-webui environment https://github.com/erew123/alltalk_tts/blob/main/README.md#deepspeed-112-for-windows--python-311
from alltalk_tts.
FRICK Dude!!! This extension is amazing now, deepspeed on windows working as per your instructions. TY! I've been wrapped up in things and didn't notice your messages :C Are your voices hosted anywhere else? :3 Thank you again for all the hard work you have put into this!
from alltalk_tts.
I found where you linked to your voices, thanks again for this extension. Amazing!
from alltalk_tts.
Thanks! :)
Finetuning next along with the new API suite..... just tidying up the documentation for this one!
from alltalk_tts.
and, its done https://github.com/erew123/alltalk_tts?tab=readme-ov-file#-finetuning-a-model
from alltalk_tts.
This is the best tts extension out there! The level of detail in your instructions is 🔥 next on my list is to fine-tune! I know others have been asking about this since coqi dropped. I hope oobabooga makes this one of the default extensions. Simply the best!
from alltalk_tts.
Sadly he wont... He said the code base was too complicated and he was worried he would have to do the upkeep of it. I didn't know that was the way it worked aka, he would feel that he would have to take over the code base.
Thanks for the feedback though :) I appreciate it!
from alltalk_tts.
P.S. yeah... documentation... I hate doing it haha! Its just taken me 7 hours to do all the documentation correctly for the Finetuning...double check it.. reinstall everything from scratch to make sure it works correctly etc etc.....
So thanks for appreciating that aspect of it :)
from alltalk_tts.
Interesting, I guess that makes sense. I didn't think about oobabooga needing to maintain the code too, welp I've got your code working well with my current version of booga at least.
Really great work, yeah especially on the documentation it looks like it took a lot of work to put together. Yeah, the reinstall the double checking everything, the documentation, seems to take longer than writing the code ... :c thanks for doing all of this!!
It's now possible to essentially have real time conversations with the AI in a convenient way, this helps me create training data, I edit the conversations as part of my workflow.
I don't know if you use the whisper stt extension, but I have an edit to the ui_chat.py file that puts the record button up next to the generate button so you don't need to scroll down to use it:
The combination of your extension and the button makes communicating with the AI faster than typing a lot of the time.
from alltalk_tts.
Ive not tried the whisper extension, but as the finetuning uses it.. I probably should give it a go. Thanks for the button! :)
from alltalk_tts.
Frick this is what I needed, going to try this out today!!
from alltalk_tts.
Related Issues (20)
- Integration with RVC HOT 28
- Third option for `Text Not Inside * or " is` HOT 3
- switching between trained models from the webui in standalone mode HOT 1
- [!] Warning: The text length exceeds the character limit of 239 for language 'es', this might cause truncated audio. HOT 1
- AMD GPUs support HOT 4
- Containerized Docker Build Hits a Stopping Point HOT 1
- Any way to debug why AllTalk is not using GPU HOT 16
- Please help i can't make this tool work. HOT 6
- MaryTTS Support HOT 1
- Error in Step 2 Fine tuning. HOT 3
- Intel arc gpu support HOT 2
- Allow batch size 1 by default HOT 1
- FFMPEG RuntimeError: Failed to open the input in finetune.py HOT 3
- Colab HOT 1
- Can i use MMS models? HOT 1
- Standalone Install Error HOT 2
- Free Memory HOT 4
- Expected String and got Path HOT 6
- Possible to run the models entirely on CPU+RAM or the 2nd GPU? HOT 1
- SillyTavern AllTalk extension: Pitch must be between -24 and 24. HOT 4
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from alltalk_tts.