DeepSeek is rattling the whole tech world. As a regular normal SWE, I want to share my insights on why it's cheap and good.
I'm optimistic about it's impact on tech industry and the world. It could bring the product market fit of AI closer to everyone.
Related Deepseek Series:
Episode #1: Mixture of Experts https://youtu.be/Id0_4-nJQN4
Episode #2: Multihead Latent Attention https://youtu.be/oYDkqSPXyMg
Related Transformer Series: https://www.youtube.com/watch?v=3RB8WVu9t4Q&list=PLYzDKe6_687I8ZibqBYia1lrMITRjasPd
#llm
#deepseek
#openai
#gemini
#google
#nvda
#nlp
#transformer
#cuda
#fp8
#quantization