Tags: #hardware-acceleration

Machine Learning Inference Library

2.2k

pykeio/ort

A high-performance Rust interface for hardware-accelerated machine learning inference and training with ONNX models, leveraging ONNX Runtime and pure-Rust backends.

rust onnx ml-inference

Details

AI Optimization Library

python

1.0k

intel/auto-round

AutoRound is an advanced quantization toolkit for Large Language Models (LLMs) and Vision-Language Models (VLMs), enabling high-accuracy, ultra-low-bit inference across diverse hardware.

llm quantization deep-learning

Details