Paper: DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning
Paper reading in the Discord group. All the lecture was improvised.
Join the group: https://discord.gg/JRKsaNbhCg
Link to paper: https://github.com/deepseek-ai/DeepSeek-R1/blob/main/DeepSeek_R1.pdf