mishushakov/llm-scraper
A TypeScript library that leverages Large Language Models to extract structured data from any webpage.
Core Features
Quick Start
npm i zod playwright llm-scraperDetailed Introduction
LLM Scraper is a powerful TypeScript library designed to revolutionize web data extraction by integrating Large Language Models (LLMs). It empowers developers to transform unstructured content from any webpage into well-defined, structured data using schemas like Zod or JSON Schema. Built upon the robust Playwright framework, it ensures reliable browser automation and offers advanced features such as real-time streaming of extracted objects and automatic code generation for scraping scripts. Its versatility in handling various content formats and supporting a wide range of LLMs makes it an indispensable tool for intelligent and efficient data acquisition.