[ACM MM 2017 & IEEE TMM 2020] This is the Theano code for the paper "Video Description with Spatial Temporal Attention"
-
Updated
Oct 20, 2020 - Python
[ACM MM 2017 & IEEE TMM 2020] This is the Theano code for the paper "Video Description with Spatial Temporal Attention"
Caption videos which has Chinese Sign Language.
S2VT with Attention
Implementation of Encoder-Decoder Model for Video Captioning in Tensorflow
Deep Learning for Computer Vision 2018 Spring
A Python-based web application that extracts video subtitles and translates them to English using the OpenAI Whisper library. The app provides both the translated and original subtitles for download.
a multi-modal video caption dataset with richer annotation
📺 Software concept for summarizing YouTube video captions.
Add a description, image, and links to the video-caption topic page so that developers can more easily learn about it.
To associate your repository with the video-caption topic, visit your repo's landing page and select "manage topics."