Tags: #web-scraping
firecrawl/firecrawl
A robust Web Data API designed to provide clean, LLM-ready web data for AI agents, enabling scalable search, scraping, and interaction with the web.
D4Vinci/Scrapling
An adaptive Python web scraping framework designed to handle everything from single requests to full-scale crawls, featuring anti-bot bypass and self-healing parsers.
browserbase/stagehand
Stagehand is an SDK that combines AI and code for building reliable, flexible, and self-healing web automations.
browser-use/browser-use
A Python framework and cloud service designed to empower AI agents to interact with and automate tasks on websites.
TeamWiseFlow/wiseflow
Enhances AI agents with advanced, undetectable browser automation, smart search, and content creation capabilities for complex web interactions.
mishushakov/llm-scraper
A TypeScript library to extract structured data from any webpage using Large Language Models (LLMs).
ScrapeGraphAI/Scrapegraph-ai
A Python library that leverages LLMs and graph logic to simplify web scraping and data extraction from various sources.
rom1504/img2dataset
An efficient command-line tool to download, resize, and package vast collections of image URLs into ready-to-use datasets for machine learning.