Tags: #data-processing - OSS Alternative - Discover Top Open Source Alternatives to Popular Software

Tags: #data-processing

Go Library
go
20.5k

qax-os/excelize

A pure Go library for reading and writing Microsoft Excel spreadsheet files, supporting various formats and streaming API for large datasets.

Workflow Orchestration Engine
kubernetes
16.6k

argoproj/argo-workflows

An open-source, container-native workflow engine for Kubernetes, designed to orchestrate parallel jobs and complex multi-step tasks.

Machine Learning Data Library
Python
21.5k

huggingface/datasets

A lightweight library providing one-line dataloaders and efficient pre-processing tools for a vast hub of AI datasets, supporting various ML frameworks.

High-Performance Data Engine
Python
5.4k

Eventual-Inc/Daft

A high-performance data engine for AI and multimodal workloads, processing diverse data types at scale with Python and Rust.

Columnar Data Processing Framework
apache arrow
2.9k

vortex-data/vortex

Vortex is a next-generation, high-performance, and extensible open-source columnar file format and toolkit designed for blazing-fast data processing and storage, especially with object storage.

Curated List
6.6k

pditommaso/awesome-pipeline

A comprehensive, curated list of various pipeline toolkits, frameworks, and libraries for workflow management and data processing.

Data Processing CLI Tool
4.4k

rom1504/img2dataset

A highly efficient command-line tool to download, resize, and package large sets of image URLs into machine learning datasets.

Geospatial Data Processing Tool
Node.js
4.1k

mbloch/mapshaper

A JavaScript-based tool for editing and transforming geospatial data formats like Shapefile, GeoJSON, and TopoJSON, offering both command-line and interactive web interfaces.

LLM-powered Data Processing Framework
Python
3.7k

ucbepic/docetl

DocETL is an agentic LLM-powered framework designed for building and executing complex data processing and ETL pipelines, especially for documents.

OSS Alternative

Explore the best open source alternatives to commercial software.

© 2026 OSS Alternative. hotgithub.com - All rights reserved.