MENU

Fun & Interesting

Anthropic found a "terrifying" consequence of adding reasoning to AI

bycloud 47,051 3 weeks ago
Video Not Working? Fix It Now

Master AI Agents in 2025 now with HubSpot's FREE resource! https://clickhubspot.com/n7u6 I have reworked this video many times, so it is definitely quite a long delay since the paper's first release. But I think there has been a lot of misunderstanding from the narratives people trying to build surrounding this research, so here is a deeper look into it without being too technical! My Newletter https://mail.bycloud.ai/ My Patreon https://www.patreon.com/c/bycloud Alignment faking in large language models [Paper] https://arxiv.org/abs/2412.14093 [Blog] https://www.anthropic.com/research/alignment-faking This video is supported by the kind Patrons & YouTube Members: 🙏Andrew Lescelius, Ben Shaener, Kainan, Chris LeDoux, Miguilim, Deagan, FiFaŁ, Robert Zawiasa, Marcelo Ferreira, Owen Ingraham, Daddy Wen, Tony Jimenez, Panther Modern, Jake Disco, Demilson Quintao, Penumbraa, Shuhong Chen, Hongbo Men, happi nyuu nyaa, Carol Lo, Mose Sakashita, Miguel, Bandera, Gennaro Schiano, gunwoo, Ravid Freedman, Mert Seftali, Mrityunjay, Richárd Nagyfi, Timo Steiner, Henrik G Sundt, projectAnthony, Brigham Hall, Kyle Hudson, Kalila, Jef Come, Jvari Williams, Tien Tien, BIll Mangrum, owned, Janne Kytölä, SO, Richárd Nagyfi, Hector, Drexon, Claxvii 177th, Inferencer, Michael Brenner, Akkusativ, Oleg Wock, FantomBloth, Thipok Tham, Clayton Ford, Theo, Handenon, Diego Silva, mayssam, Kadhai Pesalam, Tim Schulz, jiye, Anushka, Henrik Sundt, Julian Aßmann, Thomas Lin, Sid_Cypher, Mark Buckler, Kevin Tai, NO U, Gonzalo Fidalgo, Igor Alvarez, Alon Pluda, Clément Veyssière, Sander Zwaenepoel, etrotta, Binnie Yiu, Matej Macak, c zhou, Berhane-Meskel, sai sandeep mandava, Leo, Asad Dhamani, Charlie C, tantan assawade, Ângelo Fonseca, Stefan Lorenz, Paperboy, mika, Leo, Utsav Soi [Discord] https://discord.gg/NhJZGtH [Twitter] https://twitter.com/bycloudai [Patreon] https://www.patreon.com/bycloud [Business Inquiries] [email protected] [Profile & Banner Art] https://twitter.com/pygm7 [Video Editor] @Booga04 [Bitcoin (BTC)] 3JFMJQVGXNA2HJE5V9qCwLiqy6wHY9Vhdx [Ethereum (ETH)] 0x3d784F55E0bE5f35c1566B2E014598C0f354f190 [Litecoin (LTC)] MGHnqALjyU2W6NuJSSW9fTWV4dcHfwHZd7 [Bitcoin Cash (BCH)] 1LkyGfzHxnSfqMF8tN7ZGDwUTyBB6vcii9 [Solana (SOL)] 6XyMCEdVhtxJQRjMKgUJaySL8cGoBPzzA2NPDMPfVkKN [Ko-fi] https://ko-fi.com/bycloudai

Comment