Ecosystem & Stack: xpu
Deep Learning Library
python
8.2k
bitsandbytes-foundation/bitsandbytes
A PyTorch library enabling accessible large language models through k-bit quantization, significantly reducing memory consumption for both inference and training.
AI/ML Inference Serving Framework
Hugging Face
4.6k
vllm-project/vllm-omni
A framework for efficient, fast, and cheap serving of omni-modality (text, image, video, audio) AI models.