Ecosystem & Stack: apache-arrow
AI Data Lakehouse Format
python
6.4k
lance-format/lance
An open lakehouse format for multimodal AI, offering high-performance random access, vector indexing, and data versioning.
Machine Learning Data Library
Python
21.5k
huggingface/datasets
A lightweight library providing one-line dataloaders and efficient pre-processing tools for a vast hub of AI datasets, supporting various ML frameworks.
Columnar Data Processing Framework
apache arrow
2.9k
vortex-data/vortex
Vortex is a next-generation, high-performance, and extensible open-source columnar file format and toolkit designed for blazing-fast data processing and storage, especially with object storage.
Robotics Middleware Framework
Rust
3.6k
dora-rs/dora
DORA is a high-performance, 100% Rust framework for building real-time, low-latency, and distributed AI-based robotic applications using a dataflow-oriented architecture.