Tags: #language-models
LLM Evaluation Framework
Python
2.0k
tatsu-lab/alpaca_eval
An automatic, fast, and cost-effective evaluation framework for instruction-following language models, highly correlated with human judgments.
An automatic, fast, and cost-effective evaluation framework for instruction-following language models, highly correlated with human judgments.