High-Performance Data Engine
5.4k 2026-04-18
Eventual-Inc/Daft
A high-performance data engine for AI and multimodal workloads, processing diverse data types at scale with Python and Rust.
Core Features
Native multimodal processing (images, audio, video, structured data)
Built-in AI operations (LLM prompts, embeddings, classification)
Python-native with Rust under the hood for blazing performance
Seamless scaling from local to distributed clusters (Ray, Kubernetes)
Universal data connectivity (S3, GCS, Iceberg, Delta Lake, Hugging Face, Unity Catalog)
Quick Start
pip install daftDetailed Introduction
Daft is a cutting-edge data engine designed for the demanding requirements of AI and multimodal applications. It uniquely combines Python's ease of use with Rust's raw performance, enabling efficient processing of diverse data types like images, audio, video, and structured data at any scale. With built-in AI operations and seamless integration with distributed systems like Ray and Kubernetes, Daft empowers developers to build robust, scalable data pipelines for machine learning, eliminating the complexity often associated with traditional data processing frameworks.