Deepseek R1 & DeepSeek R1-Distill-Qwen-32B: Reasoning LM explained

Discover AI 17,862 3 months ago

Video Not Working? Fix It Now

With the new open-source DeepSeek R1 (Reasoning 1) model we have now access to a complete new family of open-source reasoning models from Qwen 1.5B to R1-Distill-Qwen32B. The new DeepSeek R1-Distill LM family explained - with benchmark data, compared to Sonnet 3.5, OpenAI o1 and other LLMs. deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B deepseek-ai/DeepSeek-R1-Distill-Qwen-7B deepseek-ai/DeepSeek-R1-Distill-Qwen-14B deepseek-ai/DeepSeek-R1-Distill-Qwen-32B https://huggingface.co/collections/deepseek-ai/deepseek-r1-678e1e131c0169c0bc89728d all rights with authors: https://github.com/deepseek-ai/DeepSeek-R1/tree/main #airesearch #aimodel #deepseek

Comment