Ecosystem & Stack: s3
LMCache/LMCache
LMCache is an LLM serving engine extension designed to significantly reduce Time-To-First-Token (TTFT) and boost throughput, especially for long-context scenarios, by intelligently reusing KV caches.
datachain-ai/datachain
DataChain is a Python-based AI-data warehouse for transforming, analyzing, and versioning unstructured multimodal data like video, audio, PDFs, and images.
aws/amazon-sagemaker-examples
A collection of Jupyter notebooks and a new Python SDK demonstrating how to build, train, and deploy machine learning models on Amazon SageMaker.
feast-dev/feast
An open-source feature store that streamlines the management and serving of features for AI/ML models, ensuring consistency between training and inference.
activeloopai/deeplake
Deep Lake is an AI data runtime and database optimized for deep learning, offering multimodal data storage, querying, vector search, and streaming for LLM and deep learning applications.
databendlabs/databend
A unified, open-source enterprise data warehouse built in Rust, offering analytics, vector search, and full-text search, specifically designed for AI agents with secure Python UDF sandboxes.
bghira/SimpleTuner
A user-friendly, versatile fine-tuning kit for image, video, and audio diffusion models, emphasizing simplicity and cutting-edge features.
nucleuscloud/neosync
An open-source platform for developers to anonymize PII, generate synthetic data, and sync environments, enabling secure testing and compliance.
Eventual-Inc/Daft
A high-performance data engine for AI and multimodal workloads, processing diverse data types at scale with Python and Rust.
openobserve/openobserve
An open-source, cost-effective observability platform for logs, metrics, traces, and RUM, offering significant storage cost savings and high performance.