Khoi Nguyen (VinAI Research) Sinisa Todorovic (Oregon State University)
Abstract: This paper addresses incremental few-shot instance segmentation, where a few examples of new object classes arrive when access to training examples of old classes is not available anymore, and the goal is to perform well on both old and new classes. We make two contributions by extending the common Mask-RCNN framework in its second stage -- namely, we specify a new object class classifier based on the probit function and a new uncertainty-guided bounding-box predictor. The former leverages Bayesian learning to address a paucity of training examples of new classes. The latter learns not only to predict object bounding boxes but also to estimate the uncertainty of the prediction as guidance for bounding box refinement. We also specify two new loss functions in terms of the estimated object-class distribution and bounding-box uncertainty. Our contributions produce significant performance gains on the COCO dataset over the state of the art -- specifically, the gain of +6 on the new classes and +16 on the old classes in the AP instance segmentation metric. Furthermore, we are the first to evaluate the incremental few-shot setting on the more challenging LVIS dataset.
Details of the model architecture and experimental results can be found in our following paper:
@inproceedings{nguyen2022ifs,
title={ifs-rcnn: An incremental few-shot instance segmenter},
author={Nguyen, Khoi and Todorovic, Sinisa},
booktitle={Proceedings of the IEEE/CVF conference on computer vision and pattern recognition},
pages={7010--7019},
year={2022}
}
Please CITE our paper whenever this repository is used to help produce published results or incorporated into other software.
This codebase is primarily based on this codebase of https://github.com/ucbdrive/few-shot-object-detection
Requirements
- Linux with Python >= 3.6
- PyTorch >= 1.4
- torchvision that matches the PyTorch installation
- CUDA 10.0, 10.1, 10.2
- GCC >= 4.9
Build fsdet
- Create a virtual environment.
python3 -m venv fsdet
source fsdet/bin/activate
You can also use conda
to create a new environment.
conda create --name fsdet
conda activate fsdet
- Install Pytorch 1.6 with CUDA 10.2
pip install torch torchvision
You can choose the Pytorch and CUDA version according to your machine. Just to make sure your Pytorch version matches the prebuilt detectron2
- Install Detectron2 v0.2.1
python3 -m pip install detectron2 -f \
https://dl.fbaipublicfiles.com/detectron2/wheels/cu102/torch1.6/index.html
- Install other requirements.
python3 -m pip install -r requirements.txt
- configs: Configuration files
- datasets: Dataset files (see Data Preparation for more details)
- iFS-RCNN
- checkpoint: Checkpoint code.
- config: Configuration code and default configurations.
- engine: Contains training and evaluation loops and hooks.
- layers: Implementations of different layers used in models.
- modeling: Code for models, including backbones, proposal networks, and prediction heads.
- tools
- train_net.py: Training script.
- test_net.py: Testing script.
- ckpt_surgery.py: Surgery on checkpoints.
- run_experiments.py: Running experiments across many seeds.
- aggregate_seeds.py: Aggregating results from many seeds.
We evaluate our models on two datasets:
- COCO: We use COCO 2014 and extract 5k images from the val set for evaluation and use the rest for training. We use the 20 object classes that are the same with PASCAL VOC as novel classes and use the rest as base classes.
- LVIS: We treat the frequent and common classes as the base classes and the rare categories as the novel classes.
See datasets/README.md for more details.
-
To train a model on base classes, refer to the train.sh script
-
To fine-tune a model on novel classes, refer to the finetuning.sh script
-
To evaluate the trained models, refer to the test.sh script
-
For multiple running experiments, refer to run_experiments.sh script
-
To aggregate results of multiple runs, refer to aggregate_seeds.sh script
This repo is built upon fsdet
If you have any questions or suggestions about this repo, please feel free to contact me (ducminhkhoi@gmail.com).