This repository is the official implementation of Rethinking RoI Strategy in Interactive 3D Segmentation for Medical Images.
Our solution achieved remarkable results in the CVPR 2025 Foundation Models for Interactive 3D Biomedical Image Segmentation Challenge:
- 🥈 Second place in AllData Track
- 🥉 Third place in Coreset Track
This repository implements the DCM (DualClickMed) approach for interactive 3D medical image segmentation. Our dual-expert architecture features both global and local Region-of-Interest (RoI) strategies:
- Global-RoI expert: Processes the entire organ based on user prompts to provide comprehensive anatomical context
- Local-RoI expert: Focuses on high-resolution patches centered on specific user clicks for precise segmentation of fine structures
Download the docker images:
- Download pretrained weights:
- Install dependencies:
pip install -r requirements.txt
cd train_global_roi
torchrun --nnodes=1 --nproc_per_node=2 train.pycd train_local_roi
torchrun --nnodes=1 --nproc_per_node=3 train_cvpr_ddp_interactive.py# Load the docker image
docker load -i yiooo_coreset.tar.gz
# Run inference
docker container run --gpus "device=0" -m 32G --name yiooo_coreset --rm \
-v $PWD/PathToTestSet/:/workspace/inputs/ \
-v $PWD/yiooo_coreset_outputs/:/workspace/outputs/ \
yiooo_coreset:latest /bin/bash -c "sh predict.sh"Our method achieves the following performance on the challenge coreset.
| Modality | DSC AUC | NSD AUC | DSC Final | NSD Final |
|---|---|---|---|---|
| CT | 3.3461 | 3.4719 | 0.8462 | 0.8797 |
| MRI | 2.7133 | 3.0852 | 0.6809 | 0.7714 |
| Microscopy | 2.2917 | 3.0618 | 0.5871 | 0.7743 |
| PET | 3.0188 | 2.8778 | 0.7691 | 0.7440 |
| Ultrasound | 3.6741 | 3.7096 | 0.9299 | 0.9440 |
We sincerely thank the competition organizers for providing this valuable research platform. We also acknowledge the excellent work and open-source contributions from:
If you find our work useful, please consider citing our paper:
@inproceedings{zhang2025rethinking,
title={Rethinking RoI Strategy in Interactive 3D Segmentation for Medical Images},
author={Zhang, Ziyu and Yu, Yi and Xue, Yuan},
booktitle={CVPR Workshop on Foundation Models for Medical Vision},
year={2025}
}
