To Data & Beyond

To Data & Beyond

Share this post

To Data & Beyond
To Data & Beyond
Unlocking the Mysteries of Diffusion Models: An In-Depth Exploration

Unlocking the Mysteries of Diffusion Models: An In-Depth Exploration

Understanding the Basics Behind Most Powerful Image Generation Models

Youssef Hosni's avatar
Youssef Hosni
Sep 08, 2023
∙ Paid
4

Share this post

To Data & Beyond
To Data & Beyond
Unlocking the Mysteries of Diffusion Models: An In-Depth Exploration
1
1
Share

Midjourney, Stable Diffusion, DALL-E, and others are able to generate an image, sometimes a beautiful image, given only a text prompt. You may have heard of a vague description of these algorithms learning to subtract noise to generate an image. In this article, we will go through a concrete explanation of the diffusion model upon which all the recent models are based. 

By the end of this article, you will understand the technical details of exactly how it works. We will start with the intuition behind it and then understand the sampling process, starting with pure noise and progressively refining it to obtain a final nice-looking image. 

You will learn how to build a neural network that can predict noise in an image. You’ll add context to the model so that you can control where you want it to generate. And finally, by implementing advanced algorithms, you’ll learn how to accelerate the sampling process by a factor of 10.

Table of Contents:

  1. The Intuition Behind Diffusion Models 

  2. Sampling Technique 

  3. Neural Network 

  4. Diffusion Model Training 

  5. Controlling the Diffusion Model Output

  6. Speeding Up the Sampling Process

    Get 80% off for 1 year

This post is for paid subscribers

Already a paid subscriber? Sign in
© 2025 Youssef Hosni
Privacy ∙ Terms ∙ Collection notice
Start writingGet the app
Substack is the home for great culture

Share