MENU

Fun & Interesting

Turn ANY Website into LLM Knowledge in SECONDS

Cole Medin 120,798 1 week ago
Video Not Working? Fix It Now

One of the biggest challenges we face with LLMs is their knowledge is too general and limited for anything new. That’s why RAG is such a huge topic when it comes to AI right now - it’s a method for providing an LLM with external knowledge you curate so it can become an expert at something it wasn’t before - a specific AI framework, your ecommerce store, you name it. The problem is, that “curate” step can be very difficult and slow. That is where Crawl4AI comes in! Crawl4AI is an open source web crawling framework specifically designed for scraping websites and formatting the output in the BEST possible way for an LLM to understand. The best part is it solves a LOT of problems we typically have with systems that crawl websites - usually they are slow, resource intensive, and complicated. But Crawl4AI is VERY fast, intuitive, easy to set up, and extremely memory efficient. In this video, I show you how to use Crawl4AI to super easily crawl websites for LLMs in just seconds, and at the end I even show you a RAG AI agent I’ve built to be a “Pydantic AI” framework expert using Crawl4AI to build the knowledgebase. And you could really take this and use it for any website you want. Next video I'll do a deep dive into this agent! ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ Register now for the oTTomator AI Agent Hackathon with a $6,000 prize pool! https://studio.ottomator.ai/hackathon/register All code for this Crawl4AI RAG Agent can be found here: https://github.com/coleam00/ottomator-agents/tree/main/crawl4AI-agent Crawl4AI GitHub: https://github.com/unclecode/crawl4ai ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 00:00 - The Beauty of Crawl4AI 02:16 - Why Crawl4AI? 05:25 - Basic Crawl4AI Example - Single Page Crawl 06:56 - Crawling Multiple Pages 08:58 - Ethics of Web Scraping 10:01 - Crawling Multiple Pages Continued 12:24 - FAST Parallel Page Crawling 15:19 - Crawl4AI RAG AI Agent 17:48 - Outro ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ Join me as I push the limits of what is possible with AI. I'll be uploading videos at least two times a week - Sundays and Wednesdays at 7:00 PM CDT! Sundays and Wednesdays are for everything AI, focusing on providing insane and practical educational value. I will also post sometimes on Fridays at 7:00 PM CDT - specifically for platform showcases - sometimes sponsored, always creative in approach!

Comment