Ecosystem & Stack: jax
Distributed AI Training Platform
Kubernetes
2.1k
kubeflow/trainer
A Kubernetes-native platform for scalable distributed AI model training and LLM fine-tuning across various frameworks.
Machine Learning Data Library
Python
21.5k
huggingface/datasets
A lightweight library providing one-line dataloaders and efficient pre-processing tools for a vast hub of AI datasets, supporting various ML frameworks.
Python Library for Multimodal AI Data
Python
3.1k
docarray/docarray
A Python library for representing, transmitting, storing, and retrieving multimodal data, designed for AI applications.
LLM Training Framework
python
2.2k
AI-Hypercomputer/maxtext
A high-performance, scalable JAX-based open-source library for training large language models on Google Cloud TPUs and GPUs.