Data Pipeline Framework
10.9k 2026-04-26
kedro-org/kedro
A Python framework for building reproducible, maintainable, and modular data engineering and data science pipelines using software engineering best practices.
Core Features
Standardized Project Template
Versatile Data Catalog with Versioning
Automated Pipeline Dependency Resolution & Visualization
Enforces Software Engineering Best Practices
Quick Start
uv pip install kedroDetailed Introduction
Kedro is an open-source Python framework designed to bring software engineering best practices to data science and data engineering workflows. It helps users build production-ready data pipelines that are reproducible, maintainable, and modular. By providing a structured project template, a versatile data catalog for various data sources, and automatic pipeline dependency resolution, Kedro streamlines the development and deployment of complex data and machine learning projects, ensuring consistency and scalability.