AI/ML Data Processing Framework
3.4k 2026-04-30
towhee-io/towhee
A cutting-edge framework for building fast and simple neural data processing pipelines, especially for unstructured multi-modal data using LLMs.
Core Features
Multi-Modality Support (Image, Video, Text, Audio, Molecular)
LLM Pipeline Orchestration with prompt management and knowledge retrieval
Rich Operators with over 140 state-of-the-art models
Prebuilt ETL Pipelines for common AI tasks like RAG and search
High-performance backend leveraging Triton Inference Server and Docker
Quick Start
pip install towhee towhee.modelsDetailed Introduction
Towhee is a powerful framework designed to simplify and accelerate the processing of unstructured data, including text, images, audio, and video. It leverages Large Language Models (LLMs) and state-of-the-art deep learning models to orchestrate data pipelines, transforming raw data into valuable insights or embeddings for storage in systems like vector databases. With its Pythonic API, rich operator library, and high-performance backend, Towhee enables developers to quickly prototype and deploy robust AI applications, making complex neural data processing accessible and efficient.