Tags: #data-generation

AI Data Framework
3.2k

argilla-io/distilabel

Distilabel is a framework for engineers to build fast, reliable, and scalable pipelines for synthetic data generation and AI feedback, based on verified research.

AI/ML Synthetic Data Generation Framework
Python
1.6k

NVIDIA-NeMo/DataDesigner

A flexible framework by NVIDIA NeMo for generating high-quality synthetic datasets with diverse distributions, meaningful correlations, and robust validation.

OSS Alternative

Explore the best open source alternatives to commercial software.

© 2026 OSS Alternative. hotgithub.com - All rights reserved.