Tags: #optimization

AI Model Utility

20.5k

p-e-w/heretic

Heretic is an AI model utility that automatically removes censorship and safety alignment from transformer-based language models without requiring expensive post-training.

llm censorship-removal ai-safety

Details

LLM Inference Optimization Engine

vllm

8.1k

LMCache is an LLM serving engine extension designed to significantly reduce Time-To-First-Token (TTFT) and boost throughput by intelligently reusing KV caches across various storage tiers and serving instances.

llm kv-cache inference

Details

Curated Resource List

Python

5.2k

xlite-dev/Awesome-LLM-Inference

A comprehensive, curated list of research papers and associated code implementations focused on optimizing Large Language Model (LLM) and Vision-Language Model (VLM) inference.

llm inference vlm inference optimization

Details

AI Agent Training Framework

python

17.0k

microsoft/agent-lightning

A versatile framework designed to train and optimize AI agents from any framework with minimal code changes, leveraging advanced algorithms like Reinforcement Learning.

ai-agents reinforcement-learning agent-training

Details

RAG Optimization Framework

python

4.7k

Marker-Inc-Korea/AutoRAG

An open-source framework that automates the evaluation and optimization of Retrieval-Augmented Generation (RAG) pipelines using AutoML-style automation for specific datasets.

rag automl llm

Details

Tags: #optimization

p-e-w/heretic

LMCache/LMCache

xlite-dev/Awesome-LLM-Inference

microsoft/agent-lightning

Marker-Inc-Korea/AutoRAG