ColonCrafter: A Depth Estimation Model for Colonoscopy Videos Using Diffusion Priors

This repository contains the official implementation for the paper "ColonCrafter: A Depth Estimation Model for Colonoscopy Videos Using Diffusion Priors".

Installation

Requires Python 3.10 and CUDA 11.8.

# Create an environment
conda create -n coloncrafter python=3.10 -y
conda activate coloncrafter

# Install PyTorch
pip install torch==2.1.0 torchvision==0.16.0 --index-url https://download.pytorch.org/whl/cu118
pip install xformers==0.0.22.post7 --index-url https://download.pytorch.org/whl/cu118

# Install remaining dependencies
pip install -r requirements.txt

Usage

Depth Estimation

import torch
from src.coloncrafter import ColonCrafterInference

# Load model
device = torch.device("cuda")
model = ColonCrafterInference.from_pretrained("romainhardy/coloncrafter", device=device)

# Predict depth from video frames (N, C, H, W) in [0, 1] range
depth, disparity = model.predict_depth(
    video,
    num_inference_steps=1,
    window_size=16,
    overlap=8,
)

Style Transfer

import torch
from src.style import StyleTransferPipeline2D

# Load pipeline
pipeline = StyleTransferPipeline2D(
    model_id="CompVis/stable-diffusion-v1-4",
    device="cuda",
    dtype=torch.float16,
)

# Transfer style from reference to content images
output = pipeline.run(
    images_content,  # (B, C, H, W) in [-1, 1]
    images_style,    # (B, C, H, W) in [-1, 1]
    num_inference_steps=25,
)

See notebooks/ for complete examples.

Citation

If you use part of our work in your research, please cite it:

@article{hardy2025coloncrafter,
  title={ColonCrafter: A Depth Estimation Model for Colonoscopy Videos Using Diffusion Priors},
  author={Hardy, Romain and Berzin, Tyler and Rajpurkar, Pranav},
  journal={arXiv preprint arXiv:2509.13525},
  year={2025}
}

License

This code is released for academic and research purposes only. Commercial use is prohibited due to dependencies on DepthCrafter which has a non-commercial license.

The code is licensed under Apache 2.0. See LICENSE for details.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
data		data
notebooks		notebooks
src		src
.gitattributes		.gitattributes
.gitignore		.gitignore
.gitmodules		.gitmodules
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ColonCrafter: A Depth Estimation Model for Colonoscopy Videos Using Diffusion Priors

Installation

Usage

Depth Estimation

Style Transfer

Citation

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

ColonCrafter: A Depth Estimation Model for Colonoscopy Videos Using Diffusion Priors

Installation

Usage

Depth Estimation

Style Transfer

Citation

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages