Sample-Efficient Multi-Round Generative Data Augmentation for Long-Tail Instance Segmentation

Description

Data synthesis has become increasingly crucial for long-tail instance segmentation tasks to mitigate class imbalance and high annotation costs. We propose a collaborative approach that incorporates feedback from an instance segmentation model to guide the augmentation process. Specifically, the diffusion model uses feedback to generate objects that exhibit high uncertainty. The number and size of synthesized objects for each class are dynamically adjusted based on the model state to improve learning in underrepresented classes. This augmentation process is further strengthened by running multiple rounds, allowing feedback to be refined throughout training. In summary, multi-round collaborative augmentation (MRCA) enhances sample efficiency by providing optimal synthetic data at the right moment.

Requirements

We recommend creating three separate Python environments for generation, segmentation, and training. This is because each component requires different dependencies (Detectron2 is deprecated for support on recent CUDA and torch versions, while generation models like StableDiffusion3 require recent versions).

Follow X-Paste for basic requirements for training. Follow BiRefNet for cutting foreground objects.

pip install -r requirements.txt

Download LVIS, OpenImages, and VOC2012 datasets.

Set your access_token from StableDiffusion3 to use the model.

Modify the pipeline_stable_diffusion_3.py file in the diffusers library to this file.

In the case of generating with stable diffusion 1.5, modify the pipeline_stable_diffusion.py file in the diffusers library to this file.

Getting Started

Generate images with stablediffusion3:

cd generator

# for generating with a single GPU
python generate.py


# for generating with multiple GPUs 


python generate.py --gpu 0 --div 0 
python generate.py --gpu 1 --div 1 
python generate.py --gpu 2 --div 2

Segment foreground objects and filter low-quality objects:


# set environment for BirefNet before running

cd diSegmenter

python segmentAndFilter.py

Train instance segmentation model:

# modify launch.sh for setting multiple GPUs for training

# for training a single round
bash launch.sh --config configs/MRCA/MRCA_R50.yaml 

# for training multiple rounds
bash launch.sh --config configs/MRCA/MRCA_R50.yaml &&
bash launch.sh --config configs/MRCA/MRCA_R50.yaml && ...

Test with given checkpoint:

bash launch.sh --config configs/MRCA/MRCA_R50.yaml --eval-only

References & Acknowledgements

We use code from Detectron2, StableDiffusion3, BiRefNet, CenterNet2, X-Paste, and BSGAL

Name		Name	Last commit message	Last commit date
Latest commit History 28 Commits
assets		assets
configs		configs
datasets/metadata		datasets/metadata
diSegmenter		diSegmenter
generator		generator
mrca		mrca
third_party/CenterNet2		third_party/CenterNet2
.gitignore		.gitignore
README.md		README.md
launch.sh		launch.sh
requirements.txt		requirements.txt
train_net.py		train_net.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Sample-Efficient Multi-Round Generative Data Augmentation for Long-Tail Instance Segmentation

Description

Requirements

Getting Started

References & Acknowledgements

About

Uh oh!

Releases

Packages

Uh oh!

Languages

kaist-dmlab/MRCA

Folders and files

Latest commit

History

Repository files navigation

Sample-Efficient Multi-Round Generative Data Augmentation for Long-Tail Instance Segmentation

Description

Requirements

Getting Started

References & Acknowledgements

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages