Tags: #cpu-inference
Technical Tutorial
Docker
2.4k
datawhalechina/handy-ollama
A comprehensive tutorial guiding users to deploy large language models locally on CPU using Ollama, making LLM inference accessible without dedicated GPU resources.
Text-to-Speech Model
onnx
3.0k
OpenMOSS/MOSS-TTS-Nano
MOSS-TTS-Nano is an open-source, multilingual, tiny speech generation model optimized for real-time CPU inference and lightweight integration.