🚀 PathDiff : Histopathology Image Synthesis with Unpaired Text and Mask Conditions

Mahesh Bhosale¹, Abdul Wasi¹, Yuanhao Zhai¹, Yunjie Tian¹, Samuel Border², Nan Xi¹, Pinaki Sarder², Junsong Yuan¹, David Doermann¹, Xuan Gong³

¹University at Buffalo | ²University of Florida | ³Harvard Medical School

Paper

📖 Overview

PathDiff is a novel diffusion-based framework for generating high-quality histopathology images by jointly leveraging unpaired text reports and cell-type masks. Unlike prior methods that require paired annotations, PathDiff learns from two separate datasets, one with image–text pairs and one with image–mask pairs, and at inference time can synthesize images conditioned on: Text only or Mask only or Both text & mask.

📄 Abstract

Diffusion-based generative models have shown promise in synthesizing histopathology images to address data scarcity caused by privacy constraints. Diagnostic text reports provide high-level semantic descriptions, and masks offer fine- grained spatial structures essential for representing distinct morphological regions. However, public datasets lack paired text and mask data for the same histopathological images, limiting their joint use in image generation. This constraint restricts the ability to fully exploit the benefits of combining both modalities for enhanced control over semantics and spatial details. To overcome this, we propose PathDiff, a diffusion framework that effectively learns from unpaired mask-text data by integrating both modalities into a unified conditioning space. PathDiff allows precise control over structural and contextual features, generating high-quality, semantically accurate images. PathDiff also improves image fidelity, text-image alignment, and faithfulness, enhancing data augmentation for downstream tasks like nuclei segmentation and classification. Extensive experiments demonstrate its superiority over existing methods.

Figure 1. PathDiff Overview

🚀 Quick Start

⚙️ Environment setup

git clone https://github.com/bhosalems/PathDiff.git
cd PathDiff
# create and activate a conda/env virtual environment
conda env create -f environment.yml
conda activate pathdiff

🌊 Inference

Download pretrained model trained jointly and corresponding config here. Please setup the path of the downloaded checkpoint and config in sampling.py. We also provide small subset of dataset with silver standard masks on PathCap in pathcap_label_pred_small directory according to different pathologies.

# Set the checkpoint and config. Let method be PathDiff and dataset be PATHCAP (trained jointly on PathCap and CONIC.)
if method == 'PathDiff':
  mask_channels = 6
  model_check_points = {
              "PanNuke": "",
              "TCGA": "",
              "CONIC": "",
              "PATHCAP": "last.ckpt" <-------- Set this
          }
    
  model_configs = {
              "PanNuke": "",
              "TCGA": "",
              "CONIC": "",
              "PATHCAP": "/configs/11-02T02-36-project.yaml" <-------- Set this
          }  

# Run Inference
python sampling.py

🖼 Qualitative Results

PathDiff integrates Text and Mask control as seen below.

Figure 2. Unified Inference

Note that in this example conditions are randomly paired and might not be pathologically meaningful/plausible. For paired results please check our supplementary.

🔧 Scripts

- main.py # Run to train with your own dataset, you need to set up the config and Dataset class. Pleas take a look at the example of config in below command.
python main.py -t --base configs/latent-diffusion/mixed_cond/plip_imagenet_finetune_Mixed_control.yaml

- sampling.py # Run generation from text, mask, or both
python sampling.py

- evaluation.py # Run to sample from infer.yaml config file (functionally similar to sampling.py). You need to set up the config. Pleas take a look at the example of config in below command.
python evaluation.py --config configs/latent-diffusion/mixed_cond/plip_imagenet_finetune_mixed_control_inference.yaml --save --inference --plot

- compute_metrics.py # Run to calculate the FID, KID, PLIP score etc.
python computer_metrics.py

⚠️ Ethics Statement

This model is provided for research and educational purposes only. It is not designed, tested, or validated for clinical or diagnostic use. Under no circumstances should it be used to make medical decisions or inform patient care.

🤝 Acknowledgements

We thank authors of PathLDM, ControlNet, CellVIT and DeepCMorph for release of their code. We also thank authors of datasets we used in this work.

📑 Citation

@misc{bhosale2025pathdiffhistopathologyimagesynthesis,
      title={PathDiff: Histopathology Image Synthesis with Unpaired Text and Mask Conditions}, 
      author={Mahesh Bhosale and Abdul Wasi and Yuanhao Zhai and Yunjie Tian and Samuel Border and Nan Xi and Pinaki Sarder and Junsong Yuan and David Doermann and Xuan Gong},
      year={2025},
      eprint={2506.23440},
      archivePrefix={arXiv},
      primaryClass={cs.CV},
      url={https://arxiv.org/abs/2506.23440}, 
}

Name		Name	Last commit message	Last commit date
Latest commit History 34 Commits
cldm		cldm
configs		configs
figures		figures
ldm		ldm
pathcap_label_pred_small		pathcap_label_pred_small
samples		samples
scripts		scripts
README.md		README.md
compute_metrics.py		compute_metrics.py
config.py		config.py
environment.yaml		environment.yaml
evaluation.py		evaluation.py
main.py		main.py
sampling.py		sampling.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

🚀 PathDiff : Histopathology Image Synthesis with Unpaired Text and Mask Conditions

📖 Overview

📄 Abstract

🚀 Quick Start

⚙️ Environment setup

🌊 Inference

🖼 Qualitative Results

🔧 Scripts

⚠️ Ethics Statement

🤝 Acknowledgements

📑 Citation

About

Uh oh!

Releases

Packages

Languages

bhosalems/PathDiff

Folders and files

Latest commit

History

Repository files navigation

🚀 PathDiff : Histopathology Image Synthesis with Unpaired Text and Mask Conditions

📖 Overview

📄 Abstract

🚀 Quick Start

⚙️ Environment setup

🌊 Inference

🖼 Qualitative Results

🔧 Scripts

⚠️ Ethics Statement

🤝 Acknowledgements

📑 Citation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages