Tags: #mlops
mlflow/mlflow
An open-source AI engineering platform for debugging, evaluating, monitoring, and optimizing production-quality AI applications, including agents, LLMs, and ML models.
bentoml/BentoML
A Python library for building, deploying, and scaling AI/ML model inference APIs and serving systems.
Netflix/metaflow
A human-centric Python framework for building, managing, and deploying real-life AI/ML systems from rapid prototyping to reliable production.
wandb/wandb
An AI developer platform for tracking, visualizing, and managing machine learning models from experimentation to production.
treeverse/dvc
DVC (Data Version Control) is a command-line tool and VS Code extension for managing data, models, and ML experiments, enabling reproducible machine learning projects.
Arize-ai/phoenix
An open-source platform for debugging, evaluating, and monitoring AI/ML models and pipelines.
kedro-org/kedro
A Python framework for building reproducible, maintainable, and modular data engineering and data science pipelines using software engineering best practices.
SwanHubX/SwanLab
SwanLab is an open-source, modern-design platform for tracking, visualizing, and analyzing AI/ML training experiments, supporting cloud and self-hosted deployments.
kubeflow/pipelines
An open-source platform for building, deploying, and managing end-to-end machine learning workflows on Kubernetes.
EthicalML/awesome-production-machine-learning
A comprehensive curated list of open-source libraries for deploying, monitoring, versioning, and scaling machine learning models in production.
zenml-io/zenml
An open-source AI platform that unifies ML pipelines and agentic workflows, abstracting infrastructure complexity for ML/AI engineers.
polyaxon/polyaxon
A comprehensive MLOps platform for managing, orchestrating, and scaling the machine learning lifecycle with reproducibility and automation.
apache/hamilton
Apache Hamilton is a lightweight Python library that enables data scientists and engineers to define testable, modular, and self-documenting dataflows (DAGs) with built-in lineage and metadata, portable across any Python environment.
feast-dev/feast
An open-source feature store for AI/ML that streamlines the management and serving of features for model training and online inference.
clearml/clearml
ClearML streamlines AI/ML/LLM workflows with integrated experiment tracking, data management, MLOps/LLMOps orchestration, and model serving.
SeldonIO/seldon-core
An MLOps and LLMOps framework for deploying, managing, and scaling AI systems, from singular models to complex data-centric applications, on Kubernetes.
kelvins/awesome-mlops
A comprehensive and categorized collection of awesome MLOps tools and resources, designed to help practitioners navigate the complex MLOps ecosystem.
GokuMohandas/Made-With-ML
A comprehensive educational platform teaching developers how to design, develop, deploy, and iterate on production-grade machine learning applications.
PacktPublishing/LLM-Engineers-Handbook
A comprehensive practical guide and accompanying code repository for LLM engineers, covering the full lifecycle of building, deploying, and monitoring advanced LLM and RAG applications on AWS with LLMOps best practices.
tencentmusic/cube-studio
An open-source, cloud-native, all-in-one MLOps platform designed for the full lifecycle management of machine learning, deep learning, and large language model development and deployment.
decodingai-magazine/llm-twin-course
A free, hands-on course to build a production-ready LLM & RAG system, including a personalized AI replica, applying LLMOps best practices.
oumi-ai/oumi
An end-to-end platform for fine-tuning, evaluating, and deploying open-source Large Language Models (LLMs) and Vision Language Models (VLMs).
dstackai/dstack
A vendor-agnostic unified control plane for GPU provisioning and orchestration across clouds, Kubernetes, and on-prem for AI/ML workloads.
predibase/lorax
A multi-LoRA inference server designed to serve thousands of fine-tuned LLMs on a single GPU, significantly reducing serving costs while maintaining high throughput and low latency.
argilla-io/distilabel
Distilabel is a framework for generating synthetic data and AI feedback, enabling engineers to build fast, reliable, and scalable AI pipelines based on verified research.
microsoft/promptflow
A comprehensive development suite for building, testing, evaluating, deploying, and monitoring high-quality LLM-based AI applications.
mlrun/mlrun
An open-source MLOps and AI orchestration platform for building, managing, and automating continuous machine learning and generative AI applications across their entire lifecycle.
instill-ai/instill-core
An end-to-end AI platform for data, model, and pipeline orchestration, offering ETL, LLM hosting, and RAG capabilities to streamline AI application development.
ashleve/lightning-hydra-template
A user-friendly template integrating PyTorch Lightning and Hydra to streamline deep learning experimentation and development.
DataTalksClub/mlops-zoomcamp
A free 9-week online course from DataTalks.Club, designed to teach the fundamentals of MLOps, from experimentation to deployment and monitoring.