MENU

Fun & Interesting

How to scrape the web for LLM in 2024: Jina AI (Reader API), Mendable (firecrawl) and Scrapegraph-ai

LLMs for Devs 201,349 12 months ago
Video Not Working? Fix It Now

Watch this video on an AI-powered, interactive learning platform: https://app.catswithbats.com/lesson/d83e3d6e More content (paid): https://app.catswithbats.com/org/90d4bd29 👨‍💻 Code: https://github.com/trancethehuman/ai-workshop-code/blob/main/notebooks/Web_scraping_for_LLM_in_2024.ipynb (if there are issues with viewing the code, just fork and clone the repository. It's just a current problem with GitHub's way of displaying Jupyter notebooks - nbconvert) Tools mentioned: Jina AI: https://jina.ai/reader Mendable's Firecrawl: https://www.firecrawl.dev/ Scrapegraph-ai: https://github.com/VinciGit00/Scrapegraph-ai This workshop was made possible by Invest Ottawa. IO supports tech founders across the National Capital Region of Canada through their Venture Path programs. Each step in the IO Venture Path is designed to shorten the path to growth. They help today’s entrepreneurs leverage the experience, expertise and insights of businesses leaders who have been there, done that — successfully launched and grown world-class technology enterprises. To date, they’ve supported over 1100 startups and scaleups. If you’re a tech founder in the National Capital Region (or thinking of becoming one) – check out IO’s Venture Path: https://www.investottawa.ca/venture-path/

Comment