Issue: Speaker Token Not Locking Voice Consistently in Veena TTS

#17

by Tamileditz - opened Jul 26

Jul 26

Hi team,
I'm using the Veena TTS model and specifying the speaker token (e.g., ) at the start of each prompt. However, the generated voice seems inconsistent — it often switches to a different speaker mid-generation or produces varying tones. It looks like the model doesn't reliably lock onto the intended speaker voice, Is there a way to enforce consistent speaker conditioning throughout the output?

Thanks in advance!

Abhi-P

Aug 23

Yes facing this issue a lot. Tried to use other voices but looks like its voice is automatically changing to default voice. Problem is no matter how many times you try, sentences which are converted in default voice will not convert TTS to selected voice.

shadabsayd

28 days ago

I am facing the same problem.

shadabsayd

28 days ago

one observation though the voice maitri is consistent and rest are not

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment