In just 15 points, we talk about everything you need to know about Generative AI Diffusion models - from the basics to Latent Diffusion Models (LDMs) and Text-to-Image conditional Latent diffusion models. I also train a diffusion model with Pytorch on my laptop to demonstrate how it all works.
To access the full code repo && 15 minute code walkthrough video && 4000+ word script && 15+ animations && powerpoint slides used in this video (as well as others on my channel), please consider supporting us on Patreon! It helps the channel massively, so thanks for considering.
Patreon link: https://www.patreon.com/NeuralBreakdownwithAVB
#diffusion #ai #machinelearning #generativeai
Related videos:
So you think you know Text to Video Diffusion models?
https://youtu.be/KRTEOkYftUY
Attention Series: https://www.youtube.com/watch?v=frosrL1CEhw&list=PLGXWtN1HUjPfK_n9j5tPZ_a6Rx3yceZ_B
Latent Space: https://youtu.be/FslFZx08beM
CNNs: https://youtu.be/kebSR2Ph7zg
U-Net: https://youtu.be/jSvLCk4nvYk
NLP History: https://youtu.be/uocYQH0cWTs
Multimodal Models: https://youtu.be/-llkMpNH160
Papers:
DDPM: https://arxiv.org/pdf/2006.11239
CLIP: https://arxiv.org/pdf/2103.00020
LDMs: https://arxiv.org/pdf/2112.10752
Dataset:
You can search for CelebA dataset on Kaggle.
https://www.kaggle.com/datasets/jessicali9530/celeba-dataset/data
Timestamps:
0:00 - Intro
1:40 - 1
2:43 - 2
3:24 - 3
5:59 - 4
8:09 - 5
9:49 - 6
11:07 - 7
11:55 - 8
14:11 - 9
16:15 - 10
18:49- 11
19:48 - 12
21:03 - 13
22:07 - 14
23:27 - 15