DiracDiffusion: Denoising and Incremental Reconstruction with Assured Data-Consistency (ICML 2024)

This is the official repository for the paper DiracDiffusion: Denoising and Incremental Reconstruction with Assured Data-Consistency.

DiracDiffusion: Denoising and Incremental Reconstruction with Assured Data-Consistency,
Zalan Fabian, Berk Tınaz, Mahdi Soltanolkotabi
ICML 2024

We introduce a novel framework for solving inverse problems using a generalized notion of diffusion that mimics the corruption process of a particular image degradation. Our method maintains data consistency throughout the reverse diffusion process allowing us to early-stop during reconstruction and flexibly trade off perceptual quality for lower distortion.

Requirements

CUDA-enabled GPU is necessary to run the code. We tested this code using:

A6000 GPUs
Ubuntu 20.04
CUDA 12.2
Python 3.10

Setup

In a clean virtual environment run

git clone https://github.com/z-fabian/dirac-diffusion
cd dirac-diffusion
pip install -r requirements.txt

Configure the root folder for each dataset in the dataset config file.
(Optional) Download pre-trained checkpoints into the checkpoints directory within the repository.

Data

We evaluate Dirac on CelebA-HQ (256x256), FFHQ (256x256) and ImageNet. Our code expects the datasets in the following library structure:

CelebA-HQ: the root folder contains all files directly, numbered 00000.jpg - 30000.jpg.
FFHQ: the root folder contains subfolders 00000 - 00069 each with 1000 images.
ImageNet: the root folder contains train and val folders.

Pre-trained models

Train dataset	Operator	Training loss	Checkpoint size	Link
CelebA-HQ	Gaussian blur	$\mathcal{L}_{IR}(\Delta t=0.0; \theta)$	776M	Download
CelebA-HQ	Gaussian blur	$\mathcal{L}_{IR}(\Delta t=1.0; \theta)$	776M	Download
CelebA-HQ	Inpainting	$\mathcal{L}_{IR}(\Delta t=0.0; \theta)$	776M	Download
ImageNet	Gaussian blur	$\mathcal{L}_{IR}(\Delta t=1.0; \theta)$	776M	Download
ImageNet	Inpainting	$\mathcal{L}_{IR}(\Delta t=0.0; \theta)$	5.8G	Download

Model training

To train an incremental reconstruction model from scratch, run

python scripts/train_dirac.py fit --config PATH_TO_CONFIG

and replace PATH_TO_CONFIG with the trainer config file. See trainer configs here for deblurring and inpainting experiments. In case your GPU doesn't support mixed-precision training, change precision to fp32 in the config file. You can train models with different architectural hyperparameters by adding a new key to the model config, and specifying the same key in the trainer config under model_arch.

Reconstruction

To reconstruct images, run

python scripts/recon_dirac.py --config_path PATH_TO_CONFIG --dataset DATASET_NAME

and replace PATH_TO_CONFIG with the reconstruction config file and DATASET_NAME with the name of the dataset to be reconstructed (either 'celeba256', 'ffhq' or 'imagenet'). You can find config files for both perception-optimized and distortion-optimized reconstructions here. Take a look at the annotated reference config to see all the options.

Citation

If you find our paper useful, please cite

@inproceedings{fabian2023diracdiffusion,
  title={Diracdiffusion: Denoising and incremental reconstruction with assured data-consistency},
  author={Fabian, Zalan and Tinaz, Berk and Soltanolkotabi, Mahdi},
  booktitle={Forty-first International Conference on Machine Learning},
  year={2023}
}

Acknowledgments

This repository builds upon code from

Elucidating the Design Space of Diffusion-Based Generative Models (EDM) for models
Diffusion Posterior Sampling for General Noisy Inverse Problems for some operators
NVIDIA NeMo Framework for EMA implementation

z-fabian / dirac-diffusion Goto Github PK

dirac-diffusion's Introduction

DiracDiffusion: Denoising and Incremental Reconstruction with Assured Data-Consistency (ICML 2024)

Requirements

Setup

Data

Pre-trained models

Model training

Reconstruction

Citation

Acknowledgments

dirac-diffusion's People

Contributors

Stargazers

Watchers

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent