Tags: #gpu-acceleration

AI/ML Serving Framework

26.4k

sgl-project/sglang

SGLang is a high-performance serving framework for large language models and multimodal models, optimizing inference throughput and latency.

llm-serving high-performance multimodal-ai

Details

Deep Learning Framework

pytorch

31.1k

Lightning-AI/pytorch-lightning

Streamlines complex deep learning engineering, enabling scalable AI model training and finetuning across diverse hardware with minimal code changes.

deep-learning pytorch ml-framework

Details

AI/Deep Learning Optimization Framework

NVIDIA GPUs

41.4k

hpcaitech/ColossalAI

An open-source framework designed to make large AI model training and inference cheaper, faster, and more accessible through advanced distributed computing and memory optimization techniques.

deep-learning distributed-training llm-optimization

Replaces:

OpenRouter

Details

AI Data Curation Toolkit

NVIDIA NeMo

1.5k

NVIDIA-NeMo/Curator

A GPU-accelerated, scalable toolkit for multimodal data preprocessing and curation, designed to train better AI models faster.

llm data-curation gpu-acceleration

Details

LLM Optimization Toolkit

huggingface

1.1k

ModelCloud/GPTQModel

A toolkit for quantizing (compressing) Large Language Models (LLMs) with hardware acceleration across various GPUs and CPUs, integrating with popular inference frameworks.

llm quantization compression

Details

Reinforcement Learning Library for LLMs

Ray

3.1k

alibaba/ROLL

An efficient and user-friendly scaling library designed to optimize Reinforcement Learning with Large Language Models, enhancing performance in complex AI tasks.

reinforcement-learning large-language-models distributed-training

Details

AI/ML Library

Node.js

2.0k

withcatai/node-llama-cpp

A Node.js binding for llama.cpp, enabling local execution of large language models with advanced features like JSON schema enforcement and function calling.

Replaces:

AI/ML Library & SDK

1.4k

edwko/OuteTTS

A versatile interface for OuteTTS models, providing flexible text-to-speech generation capabilities across various AI inference backends and hardware platforms.

text-to-speech ai-inference python-library

Replaces:

Google Cloud Text-to-Speech Amazon Polly...

Details

Local AI API Platform / CLI Tool

llama.cpp

2.8k

janhq/cortex.cpp

A local AI API platform for running various AI models (vision, speech, language) on diverse hardware with an OpenAI-compatible API.

local ai api platform llm inference

Replaces:

OpenAI

Details

LLM Inference Engine

Python

13.1k

GeeeekExplorer/nano-vllm

A lightweight and optimized Python library for fast offline large language model inference, offering comparable or better performance than vLLM with a more readable codebase.

llm inference deep learning python

Details

Large Language Model (LLM) Development Toolkit

Python

13.3k

Lightning-AI/litgpt

A high-performance, no-abstraction toolkit providing recipes for pretraining, finetuning, and deploying over 20 large language models at scale.

llm finetuning deep-learning

Details