Tags: #ocr

Document Processing and AI Data Preparation Library

58.5k

docling-project/docling

Docling simplifies document processing, parsing diverse formats including advanced PDF understanding, and provides seamless integrations with the generative AI ecosystem.

document processing pdf parsing generative ai

Details

AI/ML Framework for Document Processing

python

48.9k

LlamaIndex is an open-source framework designed to build intelligent agentic applications by connecting Large Language Models (LLMs) with private or custom data sources, focusing on document understanding and OCR.

ai llm rag

Details

Mobile Image Processing Application

Android OS

12.7k

T8RIN/ImageToolbox

A powerful Android application for advanced image manipulation, offering a wide range of tools from basic editing to filters and OCR.

image-editor image-processing android-app

Replaces:

Snapseed PicsArt...

Details

AI Agent Development Framework

python

7.8k

Upsonic/Upsonic

A Python framework for building autonomous and traditional AI agents, offering robust tools, prebuilt components, and integrated OCR capabilities.

python ai-agents llm

Details

AI-powered Document Automation Tool

Docker

2.3k

icereed/paperless-gpt

An AI-powered add-on for paperless-ngx that leverages LLMs and advanced OCR to automate document title, tag, correspondent, and custom field generation, streamlining digital document management.

llm ocr document-automation

Details

AI-powered Document Processing Platform

Python

5.2k

katanaml/sparrow

A production-ready platform for structured data extraction and instruction calling using ML, LLM, and Vision LLM technologies.

llm vision-llm data-extraction

Replaces:

Google Document AI Amazon Textract

Details

Desktop Media Player for Language Learning

.NET Desktop Runtime

3.7k

umlx5h/LLPlayer

An advanced media player designed for language learners, offering dual subtitles, AI-powered real-time translation and subtitle generation, and instant word lookup.

media player language learning subtitles

Details

AI Desktop Application Toolkit

10.7k

Baiyuetribe/paper2gui

Paper2GUI converts complex AI research papers into user-friendly, install-free desktop applications, making advanced AI accessible to everyone.

ai tools desktop app gui

Replaces:

Midjourney Topaz Video AI...

Details

Productivity Software

Tauri

17.8k

pot-app/pot-desktop

A versatile cross-platform desktop application that provides efficient text translation and optical character recognition (OCR) by integrating a wide array of AI and traditional service providers.

translation ocr desktop-app

Replaces:

DeepL Desktop App ABBYY FineReader

Details

AI Research Framework

1.8k

AlibabaResearch/AdvancedLiterateMachinery

A research initiative by Alibaba's Tongyi Lab, focusing on developing advanced AI systems capable of reading, thinking, and creating, with an initial emphasis on sophisticated OCR and document understanding technologies.

ocr document understanding large multimodal models

Details

Tags: #ocr

docling-project/docling

run-llama/llama_index

T8RIN/ImageToolbox

Upsonic/Upsonic

icereed/paperless-gpt

katanaml/sparrow

umlx5h/LLPlayer

Baiyuetribe/paper2gui

pot-app/pot-desktop

AlibabaResearch/AdvancedLiterateMachinery