Tags: #data-versioning
CLI Tool, MLOps Tool
Python
15.5k
treeverse/dvc
A command-line tool and VS Code extension for data versioning, ML experiment tracking, and reproducible machine learning pipelines.
Data Lakehouse Format
Python
6.3k
lance-format/lance
An open lakehouse format designed for multimodal AI, offering high-performance vector search, lightning-fast random access, and robust data versioning capabilities.
AI Data Warehouse
python
2.7k
datachain-ai/datachain
DataChain is a Python-based AI-data warehouse for transforming, analyzing, and versioning unstructured multimodal data like video, audio, PDFs, and images.