Tags: #sota
AI Inference Optimization Proxy
Python
3.5k
algorithmicsuperintelligence/optillm
OptiLLM is an OpenAI API-compatible inference proxy that uses 20+ state-of-the-art techniques to significantly boost LLM accuracy and performance on reasoning tasks without requiring any training.