Tags: #data-processing

Go Library
go
20.5k

qax-os/excelize

A pure Go library for programmatically reading, writing, and manipulating Microsoft Excel spreadsheet files (XLAM, XLSM, XLSX, XLTM, XLTX).

Technical Guide
Python
6.1k

datawhalechina/all-in-rag

A comprehensive, full-stack guide to Retrieval-Augmented Generation (RAG) technology, covering theory, practice, and engineering best practices for building LLM applications.

Kubernetes Workflow Engine
kubernetes
16.6k

argoproj/argo-workflows

A container-native workflow engine for orchestrating parallel jobs and multi-step tasks on Kubernetes.

Machine Learning Data Library
Python
21.4k

huggingface/datasets

A lightweight library providing a vast hub of ready-to-use datasets and efficient tools for data manipulation in AI and machine learning workflows.

LLM Dataset & Evaluation Platform
14.0k

ConardLi/easy-dataset

An application for generating high-quality datasets for LLM fine-tuning, RAG, and evaluation, featuring intelligent document processing and a comprehensive evaluation system.

High-Performance Data Engine
Python
5.4k

Eventual-Inc/Daft

A high-performance data engine for AI and multimodal workloads, processing diverse data types at scale with Python and Rust.

Columnar Data Format & Processing Framework
apache arrow
2.9k

vortex-data/vortex

Vortex is a next-generation, high-performance, and extensible columnar file format and toolkit designed for blazing-fast data processing and storage.

Curated List / Resource Collection
6.6k

pditommaso/awesome-pipeline

A comprehensive, curated list of powerful pipeline toolkits and workflow management systems for various data processing and automation needs.

Geospatial Data Processing Tool
Node.js
4.1k

mbloch/mapshaper

A JavaScript-based tool for editing and transforming geospatial data formats like Shapefile, GeoJSON, and TopoJSON, offering both command-line and interactive web interfaces.

OSS Alternative

Explore the best open source alternatives to commercial software.

© 2026 OSS Alternative. hotgithub.com - All rights reserved.