ucbepic/docetl - OSS Alternative - Discover Top Open Source Alternatives to Popular Software
LLM-powered Data Processing Framework
3.7k 2026-05-06

ucbepic/docetl

DocETL is an agentic LLM-powered framework designed for building and executing complex data processing and ETL pipelines, especially for documents.

Core Features

Interactive UI playground (DocWrangler) for iterative prompt engineering and pipeline development.
Python package for running production-grade data processing pipelines from CLI or code.
Specialized for complex document processing tasks using agentic LLMs.
Supports integration with various LLM providers like OpenAI and AWS Bedrock.
Enables step-by-step pipeline construction and real-time result experimentation.

Quick Start

pip install docetl

Detailed Introduction

DocETL is a robust framework engineered to streamline the creation and execution of sophisticated data processing and ETL pipelines, particularly for challenging document-centric tasks. Leveraging agentic Large Language Models, it offers both an intuitive interactive UI playground, DocWrangler, for rapid iterative development and prompt engineering, and a powerful Python package for deploying production-ready pipelines. This dual approach empowers developers to efficiently design, test, and scale complex document workflows, integrating seamlessly with leading LLM services to transform unstructured data into actionable insights.

OSS Alternative

Explore the best open source alternatives to commercial software.

© 2026 OSS Alternative. hotgithub.com - All rights reserved.