lance-format/lance
An open lakehouse format designed for multimodal AI, offering high-performance vector search, lightning-fast random access, and robust data versioning capabilities.
Core Features
Quick Start
pip install pylanceDetailed Introduction
Lance is an innovative open lakehouse format specifically engineered for multimodal AI workloads. It provides a unified file format, table format, and catalog specification, enabling the construction of complete lakehouses on object storage. This project addresses critical needs in AI workflows by facilitating high-performance I/O, random access, and advanced search capabilities for diverse data types, including embeddings. Its design prioritizes efficiency, scalability, and seamless integration with popular data science tools, making it ideal for building search engines, feature stores, and large-scale ML training pipelines.