Tags: #ai-inference

LLM Serving Platform
Python
5.1k

kvcache-ai/Mooncake

A KVCache-centric disaggregated architecture for high-performance LLM serving, powering leading AI services.

GPU Cluster Management Platform
Docker
4.8k

gpustack/gpustack

An open-source GPU cluster manager that orchestrates high-performance AI inference engines across diverse environments, optimizing model deployment and resource utilization.

AI Inference Platform
kubernetes
5.3k

kserve/kserve

KServe is a standardized, scalable, and multi-framework platform for deploying and serving both generative and predictive AI models on Kubernetes.

AI Inference Optimization Toolkit
python
10.1k

openvinotoolkit/openvino

OpenVINO is an open-source toolkit designed to optimize and deploy deep learning models for efficient AI inference across diverse hardware platforms, from edge to cloud.

Resource List
19.0k

cheahjs/free-llm-api-resources

A comprehensive, curated list of free and trial-based Large Language Model (LLM) inference APIs, detailing available models and their usage limits.

Node.js Library for Local AI Inference
Node.js
2.0k

withcatai/node-llama-cpp

A Node.js library providing bindings for llama.cpp, enabling local AI model inference with advanced features like JSON schema enforcement and function calling.

AI/ML Library & SDK
Python
1.4k

edwko/OuteTTS

A versatile interface for OuteTTS models, providing flexible text-to-speech generation capabilities across various AI inference backends and hardware platforms.

API Client Library / SDK
Python
2.4k

Stability-AI/stability-sdk

A Python SDK and CLI for programmatic access to Stability AI's generative AI APIs, enabling image generation, upscaling, and animation.

AI Inference Engine / Command-Line Tool
ggml
5.8k

leejet/stable-diffusion.cpp

A lightweight, pure C/C++ inference engine for various diffusion models, enabling efficient image and video generation across multiple platforms and hardware.

Stable Diffusion Management Tool
Python
8.0k

LykosAI/StabilityMatrix

A multi-platform package manager and inference UI designed to simplify the installation, updating, and management of various Stable Diffusion web UIs and related AI tools.

ComfyUI Plugin
ComfyUI
2.8k

nunchaku-ai/ComfyUI-nunchaku

An efficient ComfyUI plugin for accelerated 4-bit neural network inference, leveraging Nunchaku and SVDQuant for enhanced performance in AI image generation workflows.

OSS Alternative

Explore the best open source alternatives to commercial software.

© 2026 OSS Alternative. hotgithub.com - All rights reserved.