Tags: #model-evaluation
ML/LLM Observability Framework
python
7.4k
evidentlyai/evidently
An open-source Python library for evaluating, testing, and monitoring ML and LLM systems from experiments to production, supporting tabular and text data with 100+ built-in metrics.
LLM Dataset Generation and Evaluation Platform
14.1k
ConardLi/easy-dataset
Easy Dataset is a powerful application for creating high-quality datasets for LLM fine-tuning, RAG, and model evaluation, featuring intelligent document processing and a comprehensive evaluation system.