This code base is the pytorch implementation of the paper:
Counterfactual Debiasing Inference for Compositional Action Recognition, ACM Multimedia, 2021
*Pengzhan Sun, *Bo Wu, Xunsong Li, Wen Li, Lixin Duan, Chuang Gan
- Introduction
- Task Setting
- Method
- Requirements
- Dataset
- Getting Started
- Checkpoints
- Acknowledgments
- Citation
There are two disjoint action sets {1, 2} and two disjoint object sets {A, B}. For the compositional action recognition task, the training set of the model is {1A + 2B}, and the verification set is {1B + 2A}. Under this challenging setting, the model needs to be able to recognize new combinations of actions and objects. In this problem setting, there are 174 action categories with 54,919 training and 57,876 validation instances. More details can be found in Something-Else.
We empower models the ability of counterfactual analysis so that a more accurate classification result can be gained by comparing factual inference outcome and counterfactual inference outcome.
- We observe that prior knowledge learned from appearance information is mixed with the spurious correlation between action and instance appearance, which badly inhibits the model’s ability of action learning.
-
We remove the pure appearance effect from total effect by counterfactual debiasing inference on our novel framework CDN proposed for compositional action recognition.
-
We achieve state-of-the-art performance for compositional action recognition on the Something-Else dataset.
pip install -r requirements.txt
Download Something-Something Dataset and Something-Else Annotation from Something-Else repo (Joaana et al., 2020). Note that we also provide the annotation per video for users with limited computing resources by spliting Something-Else Annotation mentioned above.
To train, test or conduct counterfactual debiasing inference, please run these scripts.
Download our models reported on the paper.
If you use this code repository in your research, please cite this project.
@inproceedings{counterfactual2021,
title={Counterfactual Debiasing Inference for Compositional Action Recognition},
author={Sun, Pengzhan and Wu, Bo and Li, Xunsong and Li, Wen and Duan, Lixin and Gan, Chuang},
booktitle={Proceedings of the 29th ACM International Conference on Multimedia},
year={2021}
}
This repository is implemented as a fork of Something-Else. We used parts of code from following repositories:
https://github.com/joaanna/something_else
https://github.com/ruiyan1995/Interactive_Fusion_for_CAR
Contact: pengzhansun6@gmail.com