Tags: #parallelism
Curated Resource List
Python
5.2k
xlite-dev/Awesome-LLM-Inference
A comprehensive, curated list of research papers and associated code implementations focused on optimizing Large Language Model (LLM) and Vision-Language Model (VLM) inference.