-
Notifications
You must be signed in to change notification settings - Fork 218
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
non-latin text is not aligned #1046
Comments
Can you provide more context? |
word alignments are always empty for texts of non-latin scripts, as if no text was detected. |
You’d need an ASR model that supports your target language. You can check if there’s one available in torchaudio: https://pytorch.org/audio/stable/pipelines.html Otherwise you’d need to train or fine tune your own. You can also try word alignments from faster whisper (unmerged yet in #1017). |
Awesome! it works, thanks! |
i.e. arabic, russian, etc.
text field in response has valid non-latin transcript, yet there are no alignments.
The text was updated successfully, but these errors were encountered: