StyleNet: Generating Attractive Visual Captions with Styles

* under development

Powered by DLHacks

StyleNet is a novel framework to address the task of generating attractive captions for images and videos with different styles. A novel model component, named factored LSTM is used in StyleNet, which automatically distills the style factors in the monolingual text corpus.

framework

examples of generated captions

Description

A pytorch implemention of StyleNet
Author: Chuang Gan, Zhe Gan, Xiaodong He, Jianfeng Gao, Li Deng
Published in: Computer Vision and Pattern Recognition (CVPR), 2017
URL: https://www.microsoft.com/en-us/research/wp-content/uploads/2017/06/Generating-Attractive-Visual-Captions-with-Styles.pdf
Dataset: https://zhegan27.github.io/Paper.html
Slideshare: https://www.slideshare.net/DeepLearningJP2016/dl-hacks-stylenet-generating-attractive-visual-captions-with-styles
written by Kota Kakiuchi

Requirement

python 3.5.3
pytorch 0.2.0
torchvision 0.1.9
numpy 1.13.3
scikit-image 0.13.1
nltk 3.2.5

Name		Name	Last commit message	Last commit date
Latest commit History 30 Commits
sample_images		sample_images
.gitignore		.gitignore
README.md		README.md
build_vocab.py		build_vocab.py
constant.py		constant.py
data_loader.py		data_loader.py
loss.py		loss.py
models.py		models.py
preprocess.py		preprocess.py
sample.py		sample.py
test.py		test.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

StyleNet: Generating Attractive Visual Captions with Styles

* under development

Description

Requirement

About

Releases

Packages

Languages

kacky24/stylenet

Folders and files

Latest commit

History

Repository files navigation

StyleNet: Generating Attractive Visual Captions with Styles

* under development

Description

Requirement

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages