Skip to content

Latest commit

 

History

History
46 lines (35 loc) · 1.8 KB

README.md

File metadata and controls

46 lines (35 loc) · 1.8 KB

Torch Implementation of FGVC Datasets and Baselines

This repository provides an implementation of the most renowned FGVC (Fine-Grained Visual Categorization) datasets, complete with a training script using ResNet50. Some datasets, originally unavailable or with altered structures, have been reorganized and are now accessible via Google Drive for automated downloading.

The purposes of this project are

  • providing unified interfaces to FGVC datasets
  • easy access to dataset using auto downloading before training
  • expendable to new datasets

The scripts have been tested on CUDA 11.8 with torch 2.0.

Datasets listed:

Prerequisites

This project is compatible with both Linux and Windows operating systems.

Linux & Windows

# Clone the repository:
git clone https://github.com/qiyuliaogh/FGVC_baselines_pytorch.git
cd FGVC_baselines_pytorch

# Install dependencies: 
pip install torch numpy time tqdm albumentations torchvision json pandas cv2 random scipy matplotlib

Then modify the including in training script, for example, if you want to use Stanford dogs datasset, use:

from datasets.Dogs import Dogs as FGVC_Dataset

Uncomment this line and comment out the rest.

Running the Training Script

Linux & Windows

python model_trainer.py --device cuda:0