Tags: #synthetic-data - OSS Alternative - Discover Top Open Source Alternatives to Popular Software

Tags: #synthetic-data

AI Development Platform
Ollama
4.8k

Kiln-AI/Kiln

A free, all-in-one platform for building, evaluating, and optimizing AI systems, offering tools for RAG, agents, fine-tuning, and synthetic data generation.

AI/ML Data Curation Library
python
1.7k

bespokelabsai/curator

A Python library for generating and curating high-quality synthetic data for AI model training and structured data extraction.

Data Security & Orchestration Platform
Docker
4.1k

nucleuscloud/neosync

An open-source platform for developers to anonymize sensitive production data, generate synthetic data, and sync environments for secure testing and improved developer experience.

AI/ML Data Generation Framework
python
3.2k

argilla-io/distilabel

Distilabel is a framework for generating synthetic data and AI feedback, enabling engineers to build fast, reliable, and scalable AI pipelines based on verified research.

AI/ML Synthetic Data Generation Framework
Python
1.6k

NVIDIA-NeMo/DataDesigner

A flexible framework by NVIDIA NeMo for generating high-quality synthetic datasets with diverse distributions, meaningful correlations, and robust validation.

OSS Alternative

Explore the best open source alternatives to commercial software.

© 2026 OSS Alternative. hotgithub.com - All rights reserved.