Skip to content
Discussion options

You must be logged in to vote

Question 1: Yes, an no. You are correct that the emotion you write is "fixed". My idea with {seg} was to make this dynamic in the sense that you can feed to the text interpreter model the actual TTS content in a dynamic way. The dynamic part is the TTS text not the emotion. Whatever you write before or after {seg} is fixed for all the segments generated. And no because it's not ALL the text that is in the "TTS text" node that constitute a "seg". A segment is a full generation in one go by the model, you can see in the console, if it stopped and started generating again, that is another segment.

And the text can be segmented in some ways:
1- by configuring the chunks (if the text reaches t…

Replies: 2 comments 1 reply

Comment options

You must be logged in to vote
1 reply
@Vatharian
Comment options

Answer selected by Vatharian
Comment options

You must be logged in to vote
0 replies
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants