[Q] How to use 7lang model? #148

oleid · 2024-08-05T18:23:26Z

Dear WhisperSpeech maintainers,

I found multi-language models like s2a-v1.95-medium-7lang.model on huggingface.
When trying to use them with example/text_to_audio_playback.py by setting model_ref = "collabora/whisperspeech:s2a-v1.95-medium-7lang.model" I only get strange sounding voice output.
The default models sound fine.

It is supposed to work or what am I missing?

Thanks a lot,

My system setup:

Gentoo Linux
ROCm 6.1 with Radeon 7900 XTX
Pytorch 2.4 with ROCm 6.1 support
python 3.12

The text was updated successfully, but these errors were encountered:

PeterMesihaDev · 2024-11-04T15:53:00Z

There is s2a and t2s models. 7lang indicates that it is working with 7 languages. When using the generate_to method you can pass a property called lang which defines which language the output should be, if no lang property is passed, it will use en (englisch) as default.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Q] How to use 7lang model? #148

[Q] How to use 7lang model? #148

oleid commented Aug 5, 2024

PeterMesihaDev commented Nov 4, 2024

[Q] How to use 7lang model? #148

[Q] How to use 7lang model? #148

Comments

oleid commented Aug 5, 2024

My system setup:

PeterMesihaDev commented Nov 4, 2024