-
Notifications
You must be signed in to change notification settings - Fork 26
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Enhancement] New fully open source TTS with steerable voice characteristics #21
Comments
Ooh thanx so much for sharing this with me! I will look into it and consider integrating it if it's a good fit! |
FYI: mkiol/dsnote/issues/122 might be relevant and unfortunately limit the usefulness of this model until huggingface/parler-tts/issues/11 is fixed/implemented. Feel free to close the issue if you prefer. |
Thanks again! I'll leave it open to remember to check out this project from time to time. <3 |
@FlorianEagox there are also other really cool TTS models you could implement if you ever get the chance to
|
And there is another new steerable open source model which looks promising (and even seems to support translation internally, but only EN/CN for now): |
Honestly, i really loved the new GPTSoVits V2, it also has really fast generations |
Just wanted to bring this new TTS library+model to your attention which allows voice characteristics to be steered via a separate prompt:
https://github.com/huggingface/parler-tts (impressive demos on the HF space linked there).
Afterthought: Wondering whether an LLM might be able to derive such prompts from a pure text transcript...
The text was updated successfully, but these errors were encountered: