AI Development & Deployment Platform
25.9k 2026-04-12
modular/modular
A unified, open platform for accelerating AI model serving and scaling GenAI deployments with industry-leading performance across various hardware.
Core Features
Unified platform for AI development and deployment (MAX & Mojo).
Accelerates model serving and scales GenAI deployments.
Abstracts hardware complexity for optimal GPU/CPU performance.
OpenAI-compatible REST API for model inference.
Kubernetes-compatible Docker containers for easy deployment.
Quick Start
docker run --gpus=1 -v ~/.cache/huggingface:/root/.cache/huggingface -p 8000:8000 modular/max-nvidia-full:latest --model-path google/gemma-3-27b-itDetailed Introduction
The Modular Platform is an integrated suite of AI libraries and tools designed to streamline the development and deployment of AI models, particularly for GenAI. It provides MAX and Mojo, enabling developers to achieve industry-leading performance on GPUs and CPUs by abstracting away hardware complexities. The platform supports popular open models, offers an OpenAI-compatible API for inference, and facilitates scalable deployments through Kubernetes-compatible containers, making it a powerful solution for production-grade AI.