Tags: #etl

Data Integration Platform
SeaTunnel Zeta Engine
9.3k

apache/seatunnel

SeaTunnel is a high-performance, distributed data integration tool designed to synchronize massive amounts of multimodal data from diverse sources with efficiency and stability.

Data Processing Library
python
14.5k

Unstructured-IO/unstructured

An open-source ETL solution for transforming complex documents into clean, structured data formats, optimized for language models.

Workflow Orchestration Platform
python
45.0k

apache/airflow

A robust open-source platform for programmatically authoring, scheduling, and monitoring data workflows.

AI Data Warehouse
python
2.7k

datachain-ai/datachain

DataChain is a Python-based AI-data warehouse for transforming, analyzing, and versioning unstructured multimodal data like video, audio, PDFs, and images.

Data Orchestration Library
python
2.4k

apache/hamilton

A Python library for building modular, testable, and self-documenting data transformation DAGs with built-in lineage and metadata tracking.

Data Orchestration Platform
Python
15.3k

dagster-io/dagster

A cloud-native data pipeline orchestrator designed for the development, production, and observation of data assets, featuring integrated lineage, observability, and a declarative programming model.

Workflow Orchestration Framework
Python
22.2k

PrefectHQ/prefect

Prefect is a Python-based workflow orchestration framework designed to build resilient and dynamic data pipelines, automating complex data processes with features like scheduling, caching, and retries.

Data Orchestration Platform
OpenJDK
1.4k

apache/hop

An open-source platform designed to facilitate all aspects of data and metadata orchestration, enabling efficient data integration and pipeline management.

Airflow Extension for dbt Orchestration
apache airflow
1.2k

astronomer/astronomer-cosmos

Integrate dbt Core projects seamlessly into Apache Airflow DAGs and Task Groups, enabling robust data transformation orchestration.

OSS Alternative

Explore the best open source alternatives to commercial software.

© 2026 OSS Alternative. hotgithub.com - All rights reserved.