Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Q] How to use 7lang model? #148

Open
oleid opened this issue Aug 5, 2024 · 1 comment
Open

[Q] How to use 7lang model? #148

oleid opened this issue Aug 5, 2024 · 1 comment

Comments

@oleid
Copy link

oleid commented Aug 5, 2024

Dear WhisperSpeech maintainers,

I found multi-language models like s2a-v1.95-medium-7lang.model on huggingface.
When trying to use them with example/text_to_audio_playback.py by setting model_ref = "collabora/whisperspeech:s2a-v1.95-medium-7lang.model" I only get strange sounding voice output.
The default models sound fine.

It is supposed to work or what am I missing?

Thanks a lot,

My system setup:

  • Gentoo Linux
  • ROCm 6.1 with Radeon 7900 XTX
  • Pytorch 2.4 with ROCm 6.1 support
  • python 3.12
@PeterMesihaDev
Copy link

There is s2a and t2s models. 7lang indicates that it is working with 7 languages. When using the generate_to method you can pass a property called lang which defines which language the output should be, if no lang property is passed, it will use en (englisch) as default.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Development

No branches or pull requests

2 participants