-
Notifications
You must be signed in to change notification settings - Fork 174
Whisper word level timestemp #47
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comments
They don't appear to work with the distil models. the old xenova ones seem fine though less accurate for me than the sample for the distilled version here https://huggingface.co/spaces/Xenova/distil-whisper-web but it only gives sentence timestamps |
@danieloi I managed to do it here https://github.com/apssouza22/video-text-edit |
@apssouza22 it works for word-level timestamps when you use the distil variants of the model? |
Oh sorry, distilled didn't work, but I thought it was because the models are too big. |
Same here @apssouza22, I get this error when I want word-level timestamps and use the distil variants in transformers.js: Error: Layer index 6 is out of bounds for cross attentions (length 4). |
@danieloi yeah. Got the same. |
I am wondering if it is possible to have word level timestamp when using the whisper example? I tried to include the param
word_timestamps
to the model options but it didn't work.Any help in how to achieve this will be very welcome.
Thanks
The text was updated successfully, but these errors were encountered: