MENU

Fun & Interesting

E03 Mixed Precision Training | Blockwise Quantization | Tensor and CUDA Cores (with Google Engineer)

Martin Is A Dad 193 2 weeks ago
Video Not Working? Fix It Now

DeepSeek is rattling the whole tech world. As a regular normal SWE, I want to share my insights on why it's cheap and good. I'm optimistic about it's impact on tech industry and the world. It could bring the product market fit of AI closer to everyone. Related Deepseek Series: Episode #1: Mixture of Experts https://youtu.be/Id0_4-nJQN4 Episode #2: Multihead Latent Attention https://youtu.be/oYDkqSPXyMg Related Transformer Series: https://www.youtube.com/watch?v=3RB8WVu9t4Q&list=PLYzDKe6_687I8ZibqBYia1lrMITRjasPd #llm #deepseek #openai #gemini #google #nvda #nlp #transformer #cuda #fp8 #quantization

Comment