Tags: #vllm
LLM Inference Engine
Python
13.1k
GeeeekExplorer/nano-vllm
A lightweight and optimized Python library for fast offline large language model inference, offering comparable or better performance than vLLM with a more readable codebase.
Hardware Plugin
vLLM
2.0k
vllm-project/vllm-ascend
A community-maintained hardware plugin that enables vLLM to run seamlessly and efficiently on Ascend NPUs for large language model inference.