Coder Social home page Coder Social logo

Comments (6)

erew123 avatar erew123 commented on August 28, 2024

Hi @uberubert

Please see https://github.com/erew123/alltalk_tts?tab=readme-ov-file#-a-note-on-character-cards--greeting-messages to explain why asterisks are passed over to AllTalk. As such, enabling the Narrator will be what switches on/off the "Pass Asterisks" and they would not be pronounced as the word asterisk, but used to delineate Character or Narrator spoken text.

Hope that helps clarify.

Thanks

from alltalk_tts.

erew123 avatar erew123 commented on August 28, 2024

Just FYI, you can test this behaviour out with the provided CURL commands in the API section (replacing the settings with those that match your chosen voices and the current TTS engine).

https://github.com/erew123/alltalk_tts?tab=readme-ov-file#-example-command-lines-standard-generation

from alltalk_tts.

erew123 avatar erew123 commented on August 28, 2024

Finally on v2, if you had a specific type of text you do/dont want to pass through to the underlying TTS engine (post processing the initial text for character/narrator etc). you can do that within the AllTalk interface > Global Settings > AllTalk API Defaults and scroll down to API Allowed Text Filtering/Passthrough Settings

image

Thanks

from alltalk_tts.

erew123 avatar erew123 commented on August 28, 2024

Filtering is applied at the point where the chart turns red, as the TTS is generated and before being processed by RVC etc

AllTalk API Process

from alltalk_tts.

uberubert avatar uberubert commented on August 28, 2024

I think this issue is more related to UX than how the plugin works technically.

My narrator setting was disabled, I expected only the quoted "character speaking" text to be voiced. To my surprise, the entire thing was voiced even when it was in asterisks. It was voiced because the option was re-enabled automatically upon my reloading the webapp.

So it seems I have to keep the narrator enabled in order to avoid asterisk-text to be spoken. But then there's no way to reduce spoken voice to just the quoted "character speaking" text, unless I manually go over the settings every time I load the app.

To me this makes little sense, as it only serves to override my ability to manipulate options given to me. If this option absolutely must be set to a specific value, then would it not make more sense to remove this option? As a user, I would expect this option to stay the way I put it, or not be there at all.

Also, should I really switch to V2? I always prefer stable over beta, which is why I stick with V1 for now. (Oh, and I don't mean to come across as critical here, I'm loving the generated voice lines!)

from alltalk_tts.

erew123 avatar erew123 commented on August 28, 2024

Hi @uberubert

In v2 there is "Enabled Silent" options for narrator and text not inside. Its detailed in the Gradio interface documentation, however, here is a partial snippet:

image

With SillyTavern, it does require the updated extension be updated to the v2 extension (instructions here) https://github.com/erew123/alltalk_tts/tree/alltalkbeta/system/SillyTavern%20Extension/For%20AllTalk%20V2

As for v2, the core codebase is pretty solid and Ive had little issue with that. There is mainly 40-60 hours of documentation improvement/writing to do, finish google colab, create a docker build for people who use docker and I may yet add other features listed here #74

Its purely your choice.

Thanks

from alltalk_tts.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.