emorecom

ICDAR2021 Competition Multimodal Emotion Recognition on Comics scenes

Repo strucutre

train.py - training module
preprocess.py - module for concatenating image, transcripts, and label for efficient loading
dataset - data folder
download_warmup_dataset.sh - bash script for downloading warmup data
EDA.ipynb - notebook for EDA
emorecom - core folder consisting of model, data, and utilities

Setup and install datasts

This repo assumed that Tensorflow is installed successfully and run smoothly on your system (support Tensorflow >= 2.0.0).
Initialize settings

pip3 install gdown
pip3 install -r requirements.txt

Install datasets (warm-up, full)

bash download_warmup_dataset.sh
bash download_full_datast.sh

Run preprocessing to concat image-paths, labels, and transcripts into a single TFRecord file for efficient loading

# for training dataset
python3 preprocess.py --test-size 0.2 --training --image warm-up-train/train \
--transcript warm-up-train/train_transcriptions.json \
--lable warm-up-train/train_emotion_labels.csv \
--output train.tfrecords --val-output val.tfrecords

# for testing dataset
python3 preprocess.py --image warm-up-test/test \
--transcript warm-up-test/test_transcriptions.json \
--output test.tfrecords

Install Glove Word-Embeddings

bash download_twitter_glove_we.sh

Training

# remember to preprocess training and validation data as above

# check train.sh for additional arguments
bash train.sh

Inference

# remember to preprocess inference data as above

# make predictions
bash predict.py
# or (assume that all trained models ared saved in /saved_models folder
python3 train.py --experiment-name model_1_resnet_lstm_early_fusion

Dataset details

Warm-up dataset:

Warm-up data is provided with 800 training images (with transcriptions and labels) and 100 test images (with transcriptions)

Full dataset: Full dataset is provied with 8000 training images (with transcriptsion and labels) and 2000 examples (with transcriptions).

Data format

Labels: 8 emotion classes including: 0=Angry, 1=Disgust, 2=Fear, 3=Happy, 4=Sad, 5=Surprise, 6=Neutral, 7=Others.
Each instance includes 10 fields as follows:
- id: id of the image in the corresponding set (train or test)
- image_id: image_id associated with the image name
- emotion0_score: a manually annotated score for emotion0.
- emotion1_score: a manually annotated score for emotion1.
- emotion2_score: a manually annotated score for emotion2.
- emotion3_score: a manually annotated score for emotion3.
- emotion4_score: a manually annotated score for emotion4.
- emotion5_score: a manually annotated score for emotion5. - emotion6_score: a manually annotated score for emotion6.
- emotion7_score: a manually annotated score for emotion7.

References

@InProceedings{Iyyer:Manjunatha-Comics2017, Title = {The Amazing Mysteries of the Gutter: Drawing Inferences Between Panels in Comic Book Narratives}, Booktitle = {IEEE Conference on Computer Vision and Pattern Recognition}, Author = {Mohit Iyyer and Varun Manjunatha and Anupam Guha and Yogarshi Vyas and Jordan Boyd-Graber and Hal {Daum'{e} III} and Larry Davis}, Year = {2017}}

Name		Name	Last commit message	Last commit date
Latest commit History 160 Commits
.github/ISSUE_TEMPLATE		.github/ISSUE_TEMPLATE
checkpoints		checkpoints
emorecom		emorecom
logs		logs
saved_models		saved_models
.gitignore		.gitignore
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
EDA.ipynb		EDA.ipynb
LICENSE		LICENSE
README.md		README.md
download_full_dataset.sh		download_full_dataset.sh
download_twitter_glove_we.sh		download_twitter_glove_we.sh
download_warmup_dataset.sh		download_warmup_dataset.sh
predict.py		predict.py
predict.sh		predict.sh
preprocess.py		preprocess.py
requirements.txt		requirements.txt
train.py		train.py
train.sh		train.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

emorecom

Repo strucutre

Setup and install datasts

Dataset details

Data format

References

Links:

About

Releases

Packages

Languages

License

aisutd/emorecom

Folders and files

Latest commit

History

Repository files navigation

emorecom

Repo strucutre

Setup and install datasts

Dataset details

Data format

References

Links:

About

Topics

Resources

License

Code of conduct

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages