ConardLi/easy-dataset
Easy Dataset is a powerful application for creating high-quality datasets for LLM fine-tuning, RAG, and model evaluation, featuring intelligent document processing and a comprehensive evaluation system.
Core Features
Detailed Introduction
Easy Dataset is an application specifically designed for building large language model (LLM) datasets. It offers an intuitive interface, powerful document parsing, intelligent segmentation, data cleaning, and augmentation capabilities. The platform converts domain-specific documents into high-quality structured datasets for model fine-tuning, retrieval-augmented generation (RAG), and performance evaluation. Recent updates include robust evaluation capabilities, allowing users to generate evaluation datasets and conduct multi-dimensional assessments, including a human blind test system.