Ecosystem & Stack: gpu

LLM Inference and Serving Engine
python
76.3k

vllm-project/vllm

A high-throughput and memory-efficient open-source engine designed for fast, easy, and cost-effective serving of large language models.

Reinforcement Learning Framework
python
9.2k

OpenPipe/ART

An open-source framework for training multi-step LLM agents using reinforcement learning (GRPO) to learn from experience, offering a serverless RL training service.

LLM Fine-tuning Framework
python
11.7k

axolotl-ai-cloud/axolotl

A free and open-source framework designed for efficient and flexible fine-tuning of large language models.

Developer Tool for AI Model Serving
docker
2.7k

containers/ramalama

RamaLama simplifies the local serving and production inference of AI models from any source by leveraging familiar container patterns, eliminating complex host system configurations.

LLM Inference Optimization Engine
GPU
8.0k

LMCache/LMCache

LMCache is an LLM serving engine extension designed to significantly reduce Time-To-First-Token (TTFT) and boost throughput, especially for long-context scenarios, by intelligently reusing KV caches.

AI-powered Document Processing Platform
python
5.2k

katanaml/sparrow

Sparrow is a production-ready platform for structured data extraction and instruction calling from various documents and images using ML, LLM, and Vision LLM technologies.

AI Data Curation Toolkit
NVIDIA NeMo
1.5k

NVIDIA-NeMo/Curator

A GPU-accelerated, scalable toolkit for multimodal data preprocessing and curation, designed to train better AI models faster.

LLM Training Framework
python
2.2k

AI-Hypercomputer/maxtext

A high-performance, scalable JAX-based open-source library for training large language models on Google Cloud TPUs and GPUs.

LLM Fine-tuning Framework
Python
2.7k

stochasticai/xTuring

xTuring simplifies the fine-tuning, evaluation, and deployment of open-source Large Language Models (LLMs) on private data, ensuring privacy and efficiency.

AI Data Curation Platform
python
3.5k

Docta-ai/docta

Docta is an advanced data-centric AI platform that detects and rectifies issues in various data types to improve model performance.

AI Model Fine-tuning Tool
python
7.7k

XavierXiao/Dreambooth-Stable-Diffusion

An implementation of Google's Dreambooth technique on Stable Diffusion, enabling personalized text-to-image model fine-tuning with limited examples.

On-device Multimodal AI Application
python
1.6k

fikrikarim/parlor

Parlor is an on-device, real-time multimodal AI that enables natural voice and vision conversations, running entirely on your local machine.

Replaces:
Details
AI/ML Inference SDK
android
8.0k

NexaAI/nexa-sdk

A high-performance local inference framework for running frontier multimodal AI models on various devices with minimal energy consumption.

OSS Alternative

Explore the best open source alternatives to commercial software.

© 2026 OSS Alternative. hotgithub.com - All rights reserved.