Tags: #rag
comet-ml/opik
An open-source platform for debugging, evaluating, and monitoring LLM applications, RAG systems, and agentic workflows from prototype to production.
onyx-dot-app/onyx
Onyx is an open-source AI platform providing a feature-rich interface for Large Language Models, enabling advanced AI chat with RAG, web search, and agentic capabilities.
deepset-ai/haystack
An open-source AI orchestration framework for building production-ready LLM applications with modular pipelines and agent workflows.
labring/FastGPT
FastGPT is an AI Agent building platform offering out-of-the-box data processing, RAG retrieval, and visual AI workflow orchestration for complex question-answering systems.
Tencent/WeKnora
An LLM-powered framework for enterprise-grade document understanding, semantic retrieval, and context-aware Q&A using RAG and ReACT agents.
infiniflow/ragflow
RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to create a superior context layer for LLMs.
dataelement/bisheng
An open LLM devops platform for building, deploying, and managing next-generation enterprise AI applications with comprehensive features like GenAI workflow, RAG, and Agent capabilities.
elizaOS/eliza
An open-source framework for building, deploying, and managing autonomous multi-agent AI applications with a modern, extensible platform.
open-webui/open-webui
A user-friendly, extensible, and feature-rich self-hosted AI platform supporting various LLM runners like Ollama and OpenAI-compatible APIs, with built-in RAG capabilities.
1Panel-dev/MaxKB
An open-source platform for building enterprise-grade AI agents, integrating RAG, workflows, and tool-use for intelligent Q&A and complex business scenarios.
HKUDS/LightRAG
LightRAG is a simple and fast Retrieval-Augmented Generation (RAG) system designed for efficient and scalable knowledge retrieval and generation with Large Language Models.
run-llama/llama_index
LlamaIndex is an open-source framework for building agentic applications, specializing in document processing, OCR, parsing, and indexing to empower LLMs.
neuml/txtai
An all-in-one AI framework for semantic search, LLM orchestration, and language model workflows, built around an embeddings database.
coze-dev/coze-studio
An all-in-one visual development platform that simplifies the creation, debugging, and deployment of AI agents using low-code/no-code approaches.
codexu/note-gen
A cross-platform Markdown AI note-taking application that leverages AI to transform fragmented knowledge into organized, readable notes.
VectifyAI/PageIndex
PageIndex is a vectorless, reasoning-based RAG system that builds a hierarchical tree index from long documents for agentic, context-aware retrieval, simulating human expert navigation.
arc53/DocsGPT
DocsGPT is an open-source private AI platform for building intelligent agents, assistants, and enterprise search solutions with robust document analysis and multi-model support.
llmware-ai/llmware
A unified framework for building local, private, and secure enterprise RAG pipelines using small, specialized LLMs optimized for on-device and edge deployment.
FlagOpen/FlagEmbedding
A comprehensive toolkit providing state-of-the-art embedding and reranker models for efficient information retrieval and Retrieval-Augmented Generation (RAG) applications.
microsoft/graphrag
GraphRAG is a modular, graph-based Retrieval-Augmented Generation (RAG) system that leverages LLMs to extract structured data from unstructured text, enhancing reasoning on private datasets.
ageerle/ruoyi-ai
A one-stop enterprise-grade AI application development framework supporting multi-vendor LLM integration, secure knowledge bases, visual workflow orchestration, and multi-agent collaboration to rapidly build AI agent applications.
UnicomAI/wanwu
An enterprise-grade, multi-tenant AI agent development platform from China Unicom, offering full-lifecycle model management, workflow orchestration, and RAG capabilities for secure and efficient AI application building.
yichuan-w/LEANN
LEANN is an innovative vector database enabling fast, accurate, and 100% private RAG on personal devices with 97% storage savings.
Arindam200/awesome-ai-apps
A comprehensive collection of practical examples, tutorials, and recipes for building powerful LLM-powered applications.
langchain4j/langchain4j
LangChain4j is an open-source Java library that simplifies the integration of Large Language Models (LLMs) into Java applications.
meta-llama/llama-cookbook
A comprehensive guide and collection of recipes for building with the Llama model family, covering inference, fine-tuning, RAG, and end-to-end use cases.
datawhalechina/all-in-rag
A comprehensive, full-stack guide to Retrieval-Augmented Generation (RAG) technology, covering theory, practice, and engineering best practices for building LLM applications.
adongwanai/AgentGuide
A comprehensive, job-oriented guide for AI Agent development, covering core technologies, practical projects, and interview preparation.
datawhalechina/llm-universe
A beginner-friendly tutorial for LLM application development, focusing on building a personal knowledge base assistant using cloud services.
NirDiamant/RAG_Techniques
A comprehensive repository showcasing advanced Retrieval-Augmented Generation (RAG) techniques through detailed, practical notebook tutorials.
ragapp/ragapp
Simplifies the deployment of agentic Retrieval Augmented Generation (RAG) applications within enterprise cloud infrastructure, offering an open-source alternative to custom GPTs.
run-llama/rags
A Streamlit app to build and query custom Retrieval-Augmented Generation (RAG) pipelines over your data using natural language.
crmne/ruby_llm
A unified Ruby API for integrating with diverse Large Language Models (LLMs) from multiple providers, streamlining AI application development.
Kiln-AI/Kiln
A free, comprehensive platform for building, evaluating, and optimizing AI systems, offering tools for RAG, fine-tuning, agents, and synthetic data generation.
clusterzx/paperless-ai
An AI-powered extension for Paperless-ngx that automates document classification, smart tagging, and enables semantic search using various AI models.
OpenBMB/UltraRAG
UltraRAG is a low-code framework leveraging the Model Context Protocol (MCP) to simplify the construction and orchestration of complex, innovative RAG pipelines for AI applications.
weaviate/weaviate
An open-source, cloud-native vector database enabling semantic search at scale by combining vector similarity with structured filtering and integrated AI models.
chatchat-space/Langchain-Chatchat
An open-source, offline-deployable RAG and Agent application built with Langchain, supporting various local and online Large Language Models for knowledge-based Q&A.
liaokongVFX/LangChain-Chinese-Getting-Started-Guide
A comprehensive Chinese-language guide to getting started with LangChain, enabling developers to build powerful applications powered by large language models.
iusztinpaul/hands-on-llms
Learn to design, train, and deploy a real-time financial advisor LLM system using a hands-on, three-pipeline approach.
TaskingAI/TaskingAI
An open-source Backend as a Service (BaaS) platform for developing, deploying, and managing LLM-based AI agents and applications.
rag-web-ui/rag-web-ui
An intelligent dialogue system leveraging RAG technology to build Q&A services based on your custom knowledge base.
heshengtao/comfyui_LLM_party
A ComfyUI extension providing a comprehensive LLM agent framework for building custom AI assistants, integrating diverse AI models, and automating complex workflows.
NirDiamant/agents-towards-production
An open-source playbook offering end-to-end, code-first tutorials for building and deploying production-grade GenAI agents from prototype to enterprise scale.
truefoundry/cognita
A modular, API-driven RAG framework designed for building scalable, production-ready AI applications, addressing the complexities of deploying RAG systems beyond prototyping.
PacktPublishing/LLM-Engineers-Handbook
A comprehensive practical guide and accompanying codebase for building, deploying, and monitoring advanced LLM and RAG applications on AWS, emphasizing LLMOps best practices.
decodingai-magazine/second-brain-ai-assistant-course
An open-source course teaching how to build a production-ready Second Brain AI assistant leveraging LLMs, RAG, agents, and LLMOps for personal knowledge management.
VoltAgent/voltagent
An end-to-end AI Agent Engineering Platform providing an open-source TypeScript framework for building intelligent agents and a console for operations and observability.
lancedb/lancedb
An open-source, embedded retrieval library and multimodal AI lakehouse designed for fast, scalable vector search and data management in AI/ML applications.
infiniflow/infinity
An AI-native database optimized for LLM applications, offering incredibly fast hybrid search across dense vectors, sparse vectors, tensors, and full-text data.
HelixDB/helix-db
HelixDB is an open-source, Rust-built graph-vector database that consolidates multiple data models to simplify AI application development.
memvid/memvid
Memvid is a serverless, single-file memory layer for AI agents, offering instant retrieval and long-term memory by replacing complex RAG pipelines and vector databases.
milvus-io/bootcamp
An interactive learning platform providing tutorials and demos to master Milvus for building AI applications with unstructured data.
oramasearch/orama
A tiny, versatile JavaScript search engine offering full-text, vector, and hybrid search with RAG capabilities for browser, server, and edge.
pathwaycom/llm-app
Ready-to-run cloud templates for RAG, AI pipelines, and enterprise search with live data, always in sync with various data sources.
liyupi/yu-ai-agent
A comprehensive AI development tutorial project based on Spring Boot 3 and Spring AI, guiding developers to build AI applications and agents with core AI technologies.
zilliztech/deep-searcher
An open-source platform leveraging LLMs and vector databases to enable deep research, intelligent Q&A, and comprehensive reporting on private enterprise data.
devflowinc/trieve
An all-in-one API platform providing advanced search, recommendations, and Retrieval-Augmented Generation (RAG) capabilities for developers.
decodingai-magazine/llm-twin-course
A free, hands-on course teaching how to build and deploy production-ready LLM and RAG systems, including a personalized 'LLM Twin', using LLMOps best practices.
athina-ai/rag-cookbooks
A comprehensive repository offering practical implementations and evaluation guidance for advanced and agentic Retrieval-Augmented Generation (RAG) techniques.
ConardLi/easy-dataset
An application for generating high-quality datasets for LLM fine-tuning, RAG, and evaluation, featuring intelligent document processing and a comprehensive evaluation system.
adithya-s-k/AI-Engineering.academy
A structured, community-driven learning platform designed to make complex applied AI concepts accessible and practical for everyone.
dair-ai/Prompt-Engineering-Guide
A comprehensive guide and resource hub for prompt engineering, context engineering, RAG, and AI Agents, designed to empower developers and researchers in leveraging large language models.
morphik-org/morphik-core
Morphik Core is an AI-native platform providing accurate document search and storage for building robust AI applications, specifically designed to handle complex, visually rich, and multimodal data, overcoming the limitations of traditional RAG.
GaiZhenbiao/ChuanhuChatGPT
A user-friendly web GUI for various LLMs, enhancing interaction with features like agents, file-based QA, and finetuning.
Marker-Inc-Korea/AutoRAG
An open-source framework that automates the evaluation and optimization of Retrieval-Augmented Generation (RAG) pipelines using AutoML-style techniques for specific datasets.
EmbeddedLLM/JamAIBase
A collaborative, spreadsheet-like platform for building, experimenting with, and evaluating AI applications, especially those leveraging Retrieval-Augmented Generation (RAG).
zhimaAi/chatwiki
ChatWiki is an AI agent and RAG knowledge base platform for WeChat official accounts, enabling workflow automation and intelligent customer service.
postgresml/postgresml
PostgresML integrates machine learning and AI capabilities, including GPU acceleration and large language models, directly into PostgreSQL, eliminating the need for separate systems and data transfers.
reorproject/reor
A private, local-first AI-powered desktop app for personal knowledge management, offering automatic note linking, semantic search, and Q&A on your notes.
PeterH0323/Streamer-Sales
An AI-powered large language model designed to generate compelling product descriptions and sales pitches, enhancing live streaming and e-commerce sales.
InternLM/HuixiangDou
HuixiangDou is an LLM-powered professional knowledge assistant designed to provide accurate technical support in group chat environments without message flooding.
SciSharp/LLamaSharp
A cross-platform C#/.NET library for efficient local inference of large language models (LLMs) like LLaMA and LLAVA.
johnbean393/Sidekick
A native macOS app that allows users to chat with a local LLM, responding with information from files, folders, and websites on your Mac, ensuring privacy and offline functionality.