Diffusion vs Flow Matching Comparison

This repository provides a visual comparison between two generative modeling approaches: Diffusion Models and Flow Matching. The implementation focuses on a simple 2D toy dataset to help understand and visualize the fundamental differences between these methods.

Overview

Diffusion Models

Diffusion models work by gradually adding noise to data and then learning to reverse this process. They:

Start with pure noise
Gradually denoise the data through multiple steps
Use a neural network to predict and remove noise at each step
Follow a predefined noise schedule

Flow Matching

Flow matching directly learns a continuous transformation between noise and data distributions. They:

Learn velocity fields that transform noise to data
Use a single neural network to predict velocities
Transform data in a single continuous flow
Don't require a noise schedule

In this implementation, we use a linear interpolation path with added noise:

x_t = (1 - t) * z + t * x_0 + noise * torch.sqrt(t * (1 - t))

where:

z is the noise distribution
x_0 is the target data
t is the time parameter (0 to 1)
noise is small Gaussian noise scaled by sigma=0.1
The noise term torch.sqrt(t * (1 - t)) ensures smooth transitions

This path choice provides a simple yet effective way to transform between distributions, with the noise term helping to stabilize training and prevent mode collapse.

Installation

Create a conda environment and install dependencies:

conda create -n difffm python=3.10
conda activate difffm
pip install torch numpy matplotlib tqdm ipython

Usage

Run the main script to see the comparison:

Run Diffusion Demo

python diff_vs_flowmatch.py run-diffusion-demo --n-epochs 200 --batch-size 128 --n-samples 1000

Run Flow Matching Demo

python diff_vs_flowmatch.py run-flow-matching-demo --n-epochs 200 --batch-size 128 --n-samples 1000

Run Comparison Demo

python diff_vs_flowmatch.py run-comparison-demo --n-epochs 200 --batch-size 128 --n-samples 1000

This will:

Train both models on a toy dataset (3 Gaussian clusters)
Generate samples from both models
Create various visualizations to compare the approaches

Visualizations

The code generates several types of visualizations:

Sample Comparison
- Real vs Generated samples for both methods
- Shows final quality of generated samples
Trajectory Visualization
- Shows how individual points move during generation
- Helps understand the path from noise to data
Intermediate Steps
- Displays snapshots of the generation process
- Shows how the distribution evolves over time
Vector Fields
- Visualizes the learned vector fields at different timesteps
- Shows how each model guides points toward the data distribution
Animations
- Dynamic visualization of the generation process
- Creates GIFs showing the full transformation

Implementation Details

The repository contains several key components:

SimpleMLP: Neural network architecture shared by both models
DiffusionModel: Implementation of the diffusion process
FlowMatching: Implementation of the flow matching approach
Various visualization functions for analysis and comparison

Key Differences

Training Objective
- Diffusion: Predicts noise at each step
- Flow Matching: Predicts velocity vectors
Generation Process
- Diffusion: Discrete steps from noise to data
- Flow Matching: Continuous flow from noise to data
Sampling
- Diffusion: Uses multiple denoising steps
- Flow Matching: Uses ODE solvers (Euler or Heun's method)

Results

The visualizations help understand how each method approaches the generation task:

Diffusion models gradually denoise the data through many small steps
Flow matching creates a smooth transformation from noise to data
Both methods can successfully learn the target distribution
Each method has its own characteristic generation trajectory

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
Diffusion Model: Real vs Generated.png		Diffusion Model: Real vs Generated.png
Flow Matching: Real vs Generated.png		Flow Matching: Real vs Generated.png
README.md		README.md
comparison_animation.gif		comparison_animation.gif
diff_vs_flowmatch.py		diff_vs_flowmatch.py
diffusion_generation.gif		diffusion_generation.gif
diffusion_intermediate_steps.png		diffusion_intermediate_steps.png
diffusion_trajectory.png		diffusion_trajectory.png
diffusion_vector_field_t0.5.png		diffusion_vector_field_t0.5.png
diffusion_vector_field_t0.9.png		diffusion_vector_field_t0.9.png
flow_generation.gif		flow_generation.gif
flow_intermediate_steps.png		flow_intermediate_steps.png
flow_trajectory.png		flow_trajectory.png
flow_vector_field_t0.1.png		flow_vector_field_t0.1.png
flow_vector_field_t0.5.png		flow_vector_field_t0.5.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Diffusion vs Flow Matching Comparison

Overview

Diffusion Models

Flow Matching

Installation

Usage

Run Diffusion Demo

Run Flow Matching Demo

Run Comparison Demo

Visualizations

Implementation Details

Key Differences

Results

References

About

Uh oh!

Releases

Packages

Languages

harshm121/Diffusion-v-FlowMatching

Folders and files

Latest commit

History

Repository files navigation

Diffusion vs Flow Matching Comparison

Overview

Diffusion Models

Flow Matching

Installation

Usage

Run Diffusion Demo

Run Flow Matching Demo

Run Comparison Demo

Visualizations

Implementation Details

Key Differences

Results

References

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages