Text to Image Diffusion AI Model from scratch - Explained one line of code at a time!

Neural Breakdown with AVB 17,567 lượt xem 11 months ago

Video Not Working? Fix It Now

In just 15 points, we talk about everything you need to know about Generative AI Diffusion models - from the basics to Latent Diffusion Models (LDMs) and Text-to-Image conditional Latent diffusion models. I also train a diffusion model with Pytorch on my laptop to demonstrate how it all works.

To access the full code repo && 15 minute code walkthrough video && 4000+ word script && 15+ animations && powerpoint slides used in this video (as well as others on my channel), please consider supporting us on Patreon! It helps the channel massively, so thanks for considering.

Patreon link: https://www.patreon.com/NeuralBreakdownwithAVB

#diffusion #ai #machinelearning #generativeai

Related videos:

So you think you know Text to Video Diffusion models?
https://youtu.be/KRTEOkYftUY

Attention Series: https://www.youtube.com/watch?v=frosrL1CEhw&list=PLGXWtN1HUjPfK_n9j5tPZ_a6Rx3yceZ_B

Latent Space: https://youtu.be/FslFZx08beM

CNNs: https://youtu.be/kebSR2Ph7zg

U-Net: https://youtu.be/jSvLCk4nvYk

NLP History: https://youtu.be/uocYQH0cWTs

Multimodal Models: https://youtu.be/-llkMpNH160

Papers:
DDPM: https://arxiv.org/pdf/2006.11239
CLIP: https://arxiv.org/pdf/2103.00020
LDMs: https://arxiv.org/pdf/2112.10752

Dataset:
You can search for CelebA dataset on Kaggle.

https://www.kaggle.com/datasets/jessicali9530/celeba-dataset/data

Timestamps:
0:00 - Intro
1:40 - 1
2:43 - 2
3:24 - 3
5:59 - 4
8:09 - 5
9:49 - 6
11:07 - 7
11:55 - 8
14:11 - 9
16:15 - 10
18:49- 11
19:48 - 12
21:03 - 13
22:07 - 14
23:27 - 15

machine learning

ai

deep learning

Comment