Tags: #cost-reduction
LLM Caching Library
Python
8.0k
zilliztech/GPTCache
A semantic cache library for Large Language Models (LLMs) designed to significantly reduce API costs and accelerate response times.
A semantic cache library for Large Language Models (LLMs) designed to significantly reduce API costs and accelerate response times.