Tags: #gpu-acceleration

AI Inference Framework
Python
25.7k

sgl-project/sglang

A high-performance serving framework designed to accelerate inference for large language models and multimodal AI models.

LLM Inference and Serving Engine
python
76.3k

vllm-project/vllm

A high-throughput and memory-efficient open-source engine designed for fast, easy, and cost-effective serving of large language models.

Developer Tool for AI Model Serving
docker
2.7k

containers/ramalama

RamaLama simplifies the local serving and production inference of AI models from any source by leveraging familiar container patterns, eliminating complex host system configurations.

Deep Learning Framework
python
41.4k

hpcaitech/ColossalAI

Colossal-AI makes training and deploying large AI models cheaper, faster, and more accessible through advanced distributed training techniques.

AI Data Curation Toolkit
NVIDIA NeMo
1.5k

NVIDIA-NeMo/Curator

A GPU-accelerated, scalable toolkit for multimodal data preprocessing and curation, designed to train better AI models faster.

LLM Optimization Toolkit
huggingface
1.1k

ModelCloud/GPTQModel

A toolkit for quantizing (compressing) Large Language Models (LLMs) with hardware acceleration across various GPUs and CPUs, integrating with popular inference frameworks.

Reinforcement Learning Library for LLMs
Ray
3.1k

alibaba/ROLL

An efficient and user-friendly library for scaling Reinforcement Learning with Large Language Models on large-scale GPU resources.

Multimodal AI Inference and Serving Framework
python
4.4k

vllm-project/vllm-omni

vLLM-Omni is an efficient, flexible, and easy-to-use framework extending vLLM to serve omni-modality models (text, image, video, audio) with high throughput and an OpenAI-compatible API.

Node.js Library for Local AI Inference
Node.js
2.0k

withcatai/node-llama-cpp

A Node.js library providing bindings for llama.cpp, enabling local AI model inference with advanced features like JSON schema enforcement and function calling.

AI/ML Library & SDK
Python
1.4k

edwko/OuteTTS

A versatile interface for OuteTTS models, providing flexible text-to-speech generation capabilities across various AI inference backends and hardware platforms.

OSS Alternative

Explore the best open source alternatives to commercial software.

© 2026 OSS Alternative. hotgithub.com - All rights reserved.