Tags: #distributed-computing
Distributed AI Compute Engine
python
42.3k
ray-project/ray
Ray is a unified framework for scaling AI and Python applications from a laptop to a cluster, simplifying complex ML workloads with a distributed runtime and specialized libraries.
Distributed AI Training Platform
Kubernetes
2.1k
kubeflow/trainer
A Kubernetes-native platform for scalable distributed AI model training and LLM fine-tuning across various frameworks.
High-Performance Data Engine
Python
5.4k
Eventual-Inc/Daft
A high-performance data engine for AI and multimodal workloads, processing diverse data types at scale with Python and Rust.
Distributed Job Scheduling Middleware
java
7.7k
PowerJob/PowerJob
An open-source distributed job scheduling and computing framework designed to simplify task orchestration and execution in enterprise applications.