Comments (6)
Hi @uberubert
Please see https://github.com/erew123/alltalk_tts?tab=readme-ov-file#-a-note-on-character-cards--greeting-messages to explain why asterisks are passed over to AllTalk. As such, enabling the Narrator will be what switches on/off the "Pass Asterisks" and they would not be pronounced as the word asterisk, but used to delineate Character or Narrator spoken text.
Hope that helps clarify.
Thanks
from alltalk_tts.
Just FYI, you can test this behaviour out with the provided CURL commands in the API section (replacing the settings with those that match your chosen voices and the current TTS engine).
https://github.com/erew123/alltalk_tts?tab=readme-ov-file#-example-command-lines-standard-generation
from alltalk_tts.
Finally on v2, if you had a specific type of text you do/dont want to pass through to the underlying TTS engine (post processing the initial text for character/narrator etc). you can do that within the AllTalk interface > Global Settings > AllTalk API Defaults and scroll down to API Allowed Text Filtering/Passthrough Settings
Thanks
from alltalk_tts.
Filtering is applied at the point where the chart turns red, as the TTS is generated and before being processed by RVC etc
from alltalk_tts.
I think this issue is more related to UX than how the plugin works technically.
My narrator setting was disabled, I expected only the quoted "character speaking" text to be voiced. To my surprise, the entire thing was voiced even when it was in asterisks. It was voiced because the option was re-enabled automatically upon my reloading the webapp.
So it seems I have to keep the narrator enabled in order to avoid asterisk-text to be spoken. But then there's no way to reduce spoken voice to just the quoted "character speaking" text, unless I manually go over the settings every time I load the app.
To me this makes little sense, as it only serves to override my ability to manipulate options given to me. If this option absolutely must be set to a specific value, then would it not make more sense to remove this option? As a user, I would expect this option to stay the way I put it, or not be there at all.
Also, should I really switch to V2? I always prefer stable over beta, which is why I stick with V1 for now. (Oh, and I don't mean to come across as critical here, I'm loving the generated voice lines!)
from alltalk_tts.
Hi @uberubert
In v2 there is "Enabled Silent" options for narrator and text not inside. Its detailed in the Gradio interface documentation, however, here is a partial snippet:
With SillyTavern, it does require the updated extension be updated to the v2 extension (instructions here) https://github.com/erew123/alltalk_tts/tree/alltalkbeta/system/SillyTavern%20Extension/For%20AllTalk%20V2
As for v2, the core codebase is pretty solid and Ive had little issue with that. There is mainly 40-60 hours of documentation improvement/writing to do, finish google colab, create a docker build for people who use docker and I may yet add other features listed here #74
Its purely your choice.
Thanks
from alltalk_tts.
Related Issues (20)
- [GEN] Error during audio generation: 500: Exception while running subprocess: [Errno 2] No such file or directory: 'piper' HOT 1
- problem with Polish language HOT 7
- Standard TTS generation example from the documentation does not work HOT 4
- Need help :( HOT 1
- Support for metavoice-1B HOT 1
- Are you going to add latest Parler TTS large v1 and mini V1 models? HOT 3
- Loading Parlor at 16-bit HOT 1
- Training with ddp doesnt work hangs at init_process_group HOT 1
- Add an option to send a voice (file not name) with the streaming/generation request. HOT 1
- Add ability to request specific audio types HOT 4
- Having narrator enabled in dockerconfig.json results in a silent failure in a clean container HOT 5
- Torch error on text-generation-webui 'Apply requirements' while running at_setup.bat HOT 1
- Different voice for different emotion of character / narrator HOT 5
- API Issues HOT 7
- The "start_alltalk" batch file, but that pops up really briefly and then just disappears again. HOT 2
- AllTalk v2 BETA Download Details & Discussion HOT 4
- ImportError: DLL load failed while importing _imaging: The specified module could not be found. HOT 1
- Connection between SillyTavern & Alltalk works, but Error 500 occured: Model file not found. HOT 3
- DeepSpeed N/A in Gradio Alltalk_v2 HOT 1
- CUDA not detected/python errors when running finetune/diagnostics also throwing error
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from alltalk_tts.