Tags: #llm
langfuse/langfuse
An open-source LLM engineering platform providing observability, metrics, evaluations, and prompt management for developing and debugging AI applications.
langgenius/dify
A production-ready platform designed for developing and deploying agentic AI applications and complex workflows.
unslothai/unsloth
Unsloth Studio is a web UI that enables efficient local training and inference of open-source large language models and other AI models with significant VRAM and speed optimizations.
ZhuLinsen/daily_stock_analysis
An LLM-driven system for automated daily stock analysis across A/H/US markets, providing decision dashboards and multi-channel notifications at no cost.
lobehub/lobehub
A comprehensive platform for building, deploying, and collaborating with AI agents, designed to enhance productivity and foster human-agent co-evolution.
bytedance/deer-flow
DeerFlow is an open-source SuperAgent harness designed to research, code, and create by orchestrating sub-agents, memory, and sandboxes with extensible skills for long-horizon tasks.
comet-ml/opik
An open-source platform for comprehensive tracing, evaluation, and optimization of LLM applications, RAG systems, and agentic workflows.
MemoriLabs/Memori
Memori provides an LLM-agnostic layer that transforms AI agent execution and conversations into structured, persistent memory for production systems.
mastra-ai/mastra
A modern TypeScript framework for building, tuning, and scaling reliable AI-powered applications and autonomous agents.
langchain-ai/langchain
LangChain is a framework for building robust LLM-powered applications and intelligent agents by chaining together interoperable components and third-party integrations.
libukai/awesome-agent-skills
A comprehensive guide and curated collection of resources for Agent Skills, designed to help users quickly understand, implement, and manage AI agent capabilities.
onyx-dot-app/onyx
An open-source AI platform offering an advanced, feature-rich chat interface compatible with all major LLMs, enabling RAG, web search, and custom agents.
yamadashy/repomix
Repomix is a powerful CLI tool that packages entire code repositories into AI-friendly formats, optimized for Large Language Models (LLMs) and other AI tools.
langchain-ai/deepagents
Deep Agents is a batteries-included AI agent harness built with LangChain and LangGraph, offering out-of-the-box capabilities for planning, file system interaction, sub-agent delegation, and intelligent context management to tackle complex agentic tasks efficiently.
cloudwego/eino
A Golang framework for building sophisticated LLM and AI applications, inspired by LangChain and Google ADK.
BerriAI/litellm
A unified open-source AI Gateway and Python SDK to call over 100 LLM APIs in an OpenAI-compatible format, offering features like cost tracking, load balancing, and guardrails.
deepset-ai/haystack
An open-source AI orchestration framework for building production-ready LLM applications with modular pipelines and agent workflows.
modelscope/ms-swift
A scalable and lightweight infrastructure for fine-tuning, inference, and deployment of over 1000 large language models (LLMs) and multimodal large language models (MLLMs) using advanced techniques.
labring/FastGPT
FastGPT is an LLM-powered platform for building AI agents and complex question-answering systems, offering out-of-the-box data processing, RAG, and visual workflow orchestration.
promptfoo/promptfoo
A CLI and library for testing, evaluating, and red-teaming LLM applications to ensure security, reliability, and performance across various models.
Mintplex-Labs/anything-llm
An all-in-one, privacy-first AI application for chatting with documents and automating workflows using AI agents, designed for easy local deployment.
huggingface/peft
A state-of-the-art library for Parameter-Efficient Fine-Tuning (PEFT) of large pretrained models, drastically reducing computational and storage costs.
Chainlit/chainlit
A Python framework for rapidly building and deploying production-ready conversational AI applications.
upstash/context7
Context7 provides up-to-date, version-specific code documentation directly to LLMs and AI code editors, eliminating outdated examples and hallucinations for more accurate code generation.
vllm-project/vllm
vLLM is a high-throughput and memory-efficient open-source library designed for fast and easy serving of large language models.
langbot-app/LangBot
A production-grade, open-source platform for building and deploying AI-powered instant messaging bots across various chat platforms.
Tencent/WeKnora
An LLM-powered framework for deep document understanding, semantic retrieval, and context-aware Q&A, leveraging RAG and advanced reasoning agents for enterprise knowledge management.
linshenkx/prompt-optimizer
A powerful AI prompt optimization tool that enhances AI output quality through intelligent, multi-model, and multi-platform prompt refinement.
AstrBotDevs/AstrBot
AstrBot is an open-source, all-in-one AI agent chatbot platform and development framework that integrates with various instant messaging apps and LLMs, offering scalable conversational AI infrastructure.
looplj/axonhub
AxonHub is an open-source AI gateway that enables seamless integration with over 100 LLMs using any SDK, featuring built-in failover, load balancing, cost control, and end-to-end tracing.
thesysdev/openui
An open standard and full-stack framework for building token-efficient, streaming-first generative UIs using a compact language and React runtime.
awslabs/amazon-bedrock-agentcore-samples
Amazon Bedrock AgentCore provides a framework-agnostic and model-agnostic infrastructure for securely deploying and operating advanced AI agents at scale, with this repository offering practical samples and tutorials.
infiniflow/ragflow
An open-source Retrieval-Augmented Generation (RAG) engine that integrates RAG with Agent capabilities to provide a superior context layer for Large Language Models (LLMs).
MemTensor/MemOS
MemOS is an AI memory operating system designed for LLMs and AI agents, providing persistent, context-aware, and multi-modal memory for enhanced skill reuse and evolution across tasks.
datapizza-labs/datapizza-ai
A Python framework for building reliable, predictable, and observable Generative AI agents with minimal overhead.
Mai-with-u/MaiBot
MaiBot is an LLM-based intelligent agent designed to be a human-like digital companion, prioritizing warmth, authenticity, and genuine connection over perfection or efficiency.
embabel/embabel-agent
A JVM-based framework for building intelligent agents that dynamically combine LLM interactions with custom code and domain models to achieve complex goals.
camel-ai/camel
A multi-agent framework for studying emergent behaviors and scaling laws in AI systems through simulation and interaction.
mlflow/mlflow
An open-source AI engineering platform for debugging, evaluating, monitoring, and optimizing production-quality AI applications, including agents, LLMs, and ML models.
Skyvern-AI/skyvern
Automates complex browser-based workflows using LLMs and computer vision, providing a resilient and adaptive solution for web interaction.
hiyouga/LlamaFactory
A unified and efficient framework for fine-tuning over 100 large language models (LLMs) and vision-language models (VLMs) with both CLI and Web UI.
1Panel-dev/MaxKB
An open-source platform for building enterprise-grade AI agents, integrating RAG, robust workflows, and multi-modal capabilities to enhance smart Q&A and automate complex business scenarios.
Unstructured-IO/unstructured
An open-source ETL solution for transforming complex documents into clean, structured data formats, optimized for language models.
HKUDS/LightRAG
LightRAG is a simple and fast framework for Retrieval-Augmented Generation, designed to enhance LLM performance with external knowledge.
alibaba/MNN
MNN is a blazing-fast, lightweight deep learning inference engine optimized for high-performance on-device AI and Large Language Models.
run-llama/llama_index
LlamaIndex is an open-source framework designed to build intelligent agentic applications by connecting Large Language Models (LLMs) with private or custom data sources, focusing on document understanding and OCR.
ollama/ollama
Run open-source large language models locally on your machine with a simple CLI, REST API, and client libraries.
aiming-lab/MetaClaw
An AI agent framework that enables continuous learning and evolution from live conversations, requiring no GPU for deployment.
agentscope-ai/agentscope-java
An agent-oriented programming framework for Java, enabling developers to build production-ready LLM-powered applications with intelligent agents, ReAct reasoning, and robust control mechanisms.
MemMachine/MemMachine
An open-source, universal memory layer for AI agents, enabling persistent state management, learning, and recall across sessions for LLM-powered applications.
olimorris/codecompanion.nvim
Integrates Large Language Models (LLMs) and AI coding agents directly into Neovim for an enhanced, AI-powered development workflow.
khoj-ai/khoj
A self-hostable AI second brain that integrates with local/online LLMs to provide answers from your documents and the web, build custom agents, and automate research.
microsoft/autogen
A programming framework for building multi-agent AI applications that can operate autonomously or collaboratively with humans.
Canner/WrenAI
Wren AI is an open-source GenBI agent that transforms natural language questions into accurate SQL, charts, and BI insights, powered by a semantic layer for data governance and trustworthiness.
axolotl-ai-cloud/axolotl
A free and open-source framework designed for efficient fine-tuning of large language models.
jujumilk3/leaked-system-prompts
A curated repository of leaked system prompts from various large language model (LLM) services, valuable for research and understanding LLM behavior.
Narcooo/inkos
An autonomous AI agent designed for novel writing, capable of generating, auditing, and revising stories across diverse genres, with human oversight.
ThinkInAIXYZ/deepchat
DeepChat is an open-source desktop AI agent platform that unifies multiple LLMs, tools, and agents, providing a seamless experience for both cloud and local AI models.
mem0ai/mem0
Mem0 provides a universal, intelligent memory layer for AI agents, enabling personalized and context-rich interactions across various applications.
VectifyAI/PageIndex
PageIndex is a vectorless, reasoning-based RAG system that builds hierarchical tree indexes for human-like, context-aware document retrieval, outperforming traditional vector-based methods.
botpress/botpress
Botpress is an open-source platform designed for rapidly building, deploying, and managing next-generation AI chatbots and intelligent agents powered by large language models (LLMs) like OpenAI.
charmbracelet/crush
A terminal-based agentic coding assistant that integrates with various LLMs and developer tools to streamline coding workflows.
huggingface/chat-ui
An open-source SvelteKit web application providing a customizable chat interface for OpenAI-compatible Large Language Models, powering HuggingChat.
ItzCrazyKns/Vane
A privacy-focused, self-hosted AI answering engine that integrates local and cloud LLMs to deliver accurate, cited answers from diverse sources.
AlexsJones/llmfit
A terminal tool that detects your hardware and recommends optimal LLM models, providing performance benchmarks for local execution.
xorbitsai/inference
A unified, production-ready inference API for deploying and serving open-source language, speech, and multimodal AI models on various infrastructures.
InternLM/xtuner
A next-generation training engine optimized for ultra-large Mixture-of-Experts (MoE) models, offering superior efficiency and scalability.
datawhalechina/hello-agents
A comprehensive tutorial guiding users from foundational theories to practical implementation of AI-native intelligent agent systems.
wandb/wandb
An AI developer platform for tracking, visualizing, and managing machine learning models from experimentation to production.
llmware-ai/llmware
A unified Python framework for building local, private, and secure enterprise RAG pipelines using small, specialized LLMs and a comprehensive model catalog.
DayuanJiang/next-ai-draw-io
A Next.js web application that integrates AI capabilities with draw.io diagrams, enabling creation, modification, and enhancement through natural language commands.
mudler/LocalAI
An open-source AI engine enabling local execution of various AI models (LLMs, vision, voice, image, video) on diverse hardware, including CPU-only setups.
rasbt/LLMs-from-scratch
An educational project providing step-by-step code to build a ChatGPT-like Large Language Model (LLM) from scratch using PyTorch.
emcie-co/parlant
A production-ready framework for building controlled, consistent, and predictable customer-facing AI agent interactions with LLMs, optimized for context engineering.
mlc-ai/mlc-llm
A universal machine learning compiler and high-performance deployment engine for large language models, enabling native execution across diverse platforms.
FlagOpen/FlagEmbedding
FlagEmbedding (BGE) is a comprehensive toolkit offering state-of-the-art embedding models for efficient search, Retrieval-Augmented Generation (RAG), and various multimodal AI applications.
memodb-io/Acontext
Acontext is an open-source skill memory layer for AI agents that automatically captures learnings from agent runs and stores them as readable, editable, and shareable skill files.
icip-cas/PPTAgent
An agentic framework leveraging AI to autonomously generate and refine professional PowerPoint presentations with deep research integration and visual design capabilities.
campfirein/cipher
An open-source memory layer for AI coding agents, enhancing context, collaboration, and seamless integration across various IDEs and LLMs.
microsoft/graphrag
A modular graph-based Retrieval-Augmented Generation (RAG) system designed to extract structured data from unstructured text using LLMs to enhance reasoning on private data.
ageerle/ruoyi-ai
An enterprise-grade, all-in-one AI application development framework supporting multi-vendor LLM integration, secure knowledge bases, visual workflow orchestration, and intelligent agent deployment.
alibaba/page-agent
An in-page JavaScript GUI agent that enables natural language control over web interfaces, streamlining complex workflows and enhancing web accessibility.
linyqh/NarratoAI
Leveraging AI models for one-click video commentary and editing, enabling efficient content creation.
mobile-next/mobile-mcp
A platform-agnostic Model Context Protocol (MCP) server enabling scalable mobile automation and development across iOS and Android devices, emulators, and simulators.
langchain-ai/langgraph
A low-level orchestration framework for building, managing, and deploying resilient, stateful AI agents using a graph-based approach.
p-e-w/heretic
Heretic is an AI model utility that automatically removes censorship and safety alignment from transformer-based language models without requiring expensive post-training.
shaxiu/XianyuAutoAgent
An AI-powered customer service bot for Xianyu, offering 24/7 automated support, multi-expert decision-making, smart negotiation, and context-aware conversations.
tambo-ai/tambo
A fullstack React SDK for building AI agents that dynamically render user interfaces based on natural language input.
ruc-datalab/DeepAnalyze
DeepAnalyze is an agentic LLM that autonomously performs data science tasks, generating professional analysis reports with a single click.
WEIFENG2333/VideoCaptioner
An AI-powered tool for comprehensive video subtitling, offering speech-to-text, intelligent segmentation, LLM-based optimization, translation, and video synthesis.
GiovanniPasq/agentic-rag-for-dummies
A modular framework built with LangGraph for developing advanced Agentic RAG systems, offering both learning materials and an extensible architecture.
yuruotong1/autoMate
An AI-driven local automation assistant that acts as a personal data warehouse and tool orchestrator for any LLM, enabling cross-vendor memory, file storage, and real-world actions.
Arindam200/awesome-ai-apps
A comprehensive collection of over 80 practical examples, tutorials, and recipes for building powerful LLM-powered applications.
xming521/WeClone
A comprehensive platform enabling users to create personalized AI twins by fine-tuning Large Language Models with their unique chat history, capturing individual style and bringing digital selves to life.
0xPlaygrounds/rig
A Rust library designed for building scalable, modular, and ergonomic applications powered by Large Language Models.
TurixAI/TuriX-CUA
TuriX empowers AI models to directly interact with and control your desktop, automating complex tasks across various applications with high accuracy.
TauricResearch/TradingAgents
A multi-agent LLM framework designed to simulate real-world trading firms, enabling collaborative evaluation and informed financial trading decisions.
luhengshiwo/LLMForEverybody
An accessible knowledge sharing platform for Large Language Models (LLMs), designed to help individuals understand complex concepts and excel in LLM-related job interviews.
langchain4j/langchain4j
An idiomatic, open-source Java library simplifying the integration of Large Language Models (LLMs) into JVM applications with unified APIs and a comprehensive toolbox.
Upsonic/Upsonic
A Python framework for building autonomous and traditional AI agents, offering robust tools, prebuilt components, and integrated OCR capabilities.
mishushakov/llm-scraper
A TypeScript library that leverages Large Language Models to extract structured data from any webpage.
meta-llama/llama-cookbook
An official guide and collection of recipes for building applications with the Llama model family, covering inference, fine-tuning, and RAG.
kyrolabs/awesome-langchain
A comprehensive and curated list of tools, projects, and resources built around the rapidly evolving LangChain framework.
huggingface/agents-course
A comprehensive, free online course from Hugging Face designed to teach the fundamentals and advanced techniques of building AI agents.
LazyAGI/LazyLLM
LazyLLM simplifies the creation and iterative optimization of multi-agent large language model (LLM) applications with a low-code approach.
neo4j-labs/llm-graph-builder
A powerful application that transforms diverse unstructured data sources into structured Neo4j Knowledge Graphs using Large Language Models (LLMs) and LangChain.
AgentOps-AI/agentops
A Python SDK and platform providing comprehensive observability, monitoring, and evaluation tools for AI agents, from prototype to production.
datawhalechina/all-in-rag
A comprehensive, full-stack guide to Retrieval Augmented Generation (RAG) technology for large language model application development, covering theory, practice, and engineering best practices.
microsoft/markitdown
A lightweight Python utility for converting diverse file formats and office documents into structured Markdown, optimized for Large Language Models (LLMs) and text analysis pipelines.
adongwanai/AgentGuide
A comprehensive, job-oriented guide for AI Agent development, covering core technologies, practical projects, and interview preparation for LLM-related roles.
NirDiamant/RAG_Techniques
A repository showcasing various advanced Retrieval-Augmented Generation (RAG) techniques through detailed notebook tutorials.
tmc/langchaingo
LangChain Go is a powerful framework that simplifies building large language model (LLM) applications in Go, leveraging composability.
run-llama/LlamaIndexTS
A data framework designed to integrate custom data with large language models in JavaScript/TypeScript environments.
ragapp/ragapp
An enterprise-grade platform for easily deploying agentic RAG applications in your own cloud infrastructure, offering an alternative to hosted solutions like OpenAI's custom GPTs.
run-llama/rags
Build and customize RAG pipelines and chatbots over your data using natural language, powered by Streamlit.
LearningCircuit/local-deep-research
An AI-powered research assistant for deep, agentic research, supporting local and cloud LLMs, multiple search sources, and ensuring data privacy with local, encrypted storage.
JetBrains/koog
A Kotlin-based multiplatform framework for building predictable, fault-tolerant, and enterprise-ready AI agents with idiomatic JVM/Kotlin APIs.
ArvinLovegood/go-stock
An AI-powered cross-platform desktop application for stock analysis, offering market data, news, financial insights, and stock picking features across multiple global markets.
miurla/morphic
An AI-powered search engine with a generative UI, offering quick and adaptive search modes, multiple AI and search providers, and robust deployment options.
crmne/ruby_llm
A unified Ruby API to interact with various Large Language Models (LLMs) from different providers, simplifying AI integration into Ruby and Rails applications.
icereed/paperless-gpt
An AI-powered add-on for paperless-ngx that leverages LLMs and advanced OCR to automate document title, tag, correspondent, and custom field generation, streamlining digital document management.
strands-agents/sdk-python
A Python SDK that simplifies the creation and deployment of AI agents using a model-driven approach, supporting diverse LLMs and advanced features.
RunanywhereAI/runanywhere-sdks
A production-ready toolkit enabling developers to integrate private, offline, and fast on-device AI capabilities like LLMs, speech-to-text, and text-to-speech into their applications across various platforms.
chatboxai/chatbox
A powerful desktop client for various Large Language Models (LLMs) like ChatGPT and Claude, offering a unified interface across multiple platforms.
gofireflyio/aiac
A command-line tool and library that generates Infrastructure-as-Code, configurations, and utilities using large language models.
CaviraOSS/OpenMemory
A self-hosted, local-first cognitive memory engine providing real long-term memory for LLM applications and AI agents, distinct from RAG or vector databases.
prism-php/prism
A Laravel package providing a fluent interface to integrate and manage Large Language Models (LLMs) from various AI providers.
vllm-project/semantic-router
A signal-driven intelligent router designed to optimize, secure, and adapt mixture-of-models for AI infrastructure across cloud, data center, and edge environments.
PaddlePaddle/FastDeploy
A high-performance inference and deployment toolkit for Large Language Models (LLMs) and Vision-Language Models (VLMs) based on PaddlePaddle.
LMCache/LMCache
LMCache is an LLM serving engine extension designed to significantly reduce Time-To-First-Token (TTFT) and boost throughput by intelligently reusing KV caches across various storage tiers and serving instances.
OpenRLHF/OpenRLHF
An easy-to-use, scalable, and high-performance open-source framework for Reinforcement Learning from Human Feedback (RLHF), leveraging Ray and vLLM for distributed training of LLMs and VLMs.
katanaml/sparrow
A production-ready platform for structured data extraction and instruction calling using ML, LLM, and Vision LLM technologies.
mostlygeek/llama-swap
llama-swap enables seamless hot-swapping and management of multiple local generative AI models, acting as a unified API gateway compatible with OpenAI and Anthropic standards.
OpenBMB/UltraRAG
A low-code MCP framework for building complex and innovative RAG pipelines, standardizing components and enabling precise orchestration.
apconw/Aix-DB
Aix-DB is an intelligent data analysis system leveraging large language models and RAG technology to transform natural language queries into data insights and visualizations.
zenml-io/zenml
An open-source AI platform that unifies ML pipelines and agentic workflows, abstracting infrastructure complexity for ML/AI engineers.
asgeirtj/system_prompts_leaks
A comprehensive collection of extracted system prompts, messages, and developer instructions from leading AI chatbots and coding assistants.
SWE-agent/mini-swe-agent
A radically simple yet highly performant AI agent for software engineering, capable of solving GitHub issues and assisting in command-line tasks.
chatchat-space/Langchain-Chatchat
An open-source, offline-deployable RAG and Agent application built with Langchain, supporting various LLMs for private, local knowledge base Q&A.
zilliztech/GPTCache
A semantic caching library for LLM queries, designed to drastically cut API costs and accelerate response times.
reworkd/AgentGPT
Assemble, configure, and deploy autonomous AI Agents directly in your browser.
liaokongVFX/LangChain-Chinese-Getting-Started-Guide
A comprehensive Chinese-language tutorial for LangChain, guiding developers to build powerful applications powered by large language models.
iusztinpaul/hands-on-llms
A hands-on course to learn LLMs, LLMOps, and vector databases by building, training, and deploying a real-time financial advisor LLM system.
TaskingAI/TaskingAI
An open-source BaaS platform that unifies hundreds of LLM models and provides comprehensive tools for developing, deploying, and managing AI-native applications and LLM-based agents.
shroominic/codeinterpreter-api
An open-source Python library providing a LangChain-compatible implementation of the ChatGPT Code Interpreter for sandboxed code execution.
rag-web-ui/rag-web-ui
An intelligent dialogue system leveraging RAG technology to build custom Q&A systems from diverse knowledge bases.
Mobile-Artificial-Intelligence/maid
A free and open-source Android application for local and remote interaction with various large language models, offering features like on-device inference and API key integration.
Giskard-AI/giskard-oss
An open-source Python library for comprehensive testing, evaluation, and red teaming of LLM agents and AI systems, designed for dynamic, multi-turn interactions.
stas00/ml-engineering
An open collection of methodologies, tools, and step-by-step instructions for successful training, fine-tuning, and inference of large language and multi-modal models.
truefoundry/cognita
A production-ready RAG framework that provides modular, API-driven components and a UI to streamline the deployment and management of Retrieval Augmented Generation applications.
plexe-ai/plexe
Build machine learning models from natural language prompts using an AI-powered multi-agent system.
PacktPublishing/LLM-Engineers-Handbook
A comprehensive practical guide and accompanying code repository for LLM engineers, covering the full lifecycle of building, deploying, and monitoring advanced LLM and RAG applications on AWS with LLMOps best practices.
tencentmusic/cube-studio
An open-source, cloud-native, all-in-one MLOps platform designed for the full lifecycle management of machine learning, deep learning, and large language model development and deployment.
bitsandbytes-foundation/bitsandbytes
A PyTorch library enabling accessible large language models through k-bit quantization, significantly reducing memory consumption for both inference and training.
ludwig-ai/ludwig
A low-code, declarative framework for building and deploying custom large language models (LLMs) and other deep neural networks with ease and efficiency.
camel-ai/owl
A cutting-edge framework for multi-agent collaboration to automate real-world tasks using AI.
xszyou/Fay
Fay is an AI agent framework designed to connect digital humans (2.5D, 3D, mobile, PC, web) and large language models (OpenAI compatible, DeepSeek) with various business systems.
AccumulateMore/CV
A comprehensive and curated collection of deep learning study notes, integrating content from leading educators like Andrew Ng, Li Mu, and TuDui, covering CV, NLP, and Large Language Models.
microsoft/RD-Agent
Automates data-driven AI R&D processes using AI agents, excelling in machine learning engineering tasks.
harry0703/MoneyPrinterTurbo
An AI-powered tool that generates high-definition short videos automatically from a given topic or keywords, including script, footage, subtitles, and background music.
CoplayDev/unity-mcp
Empowers AI assistants to directly interact with and automate tasks within the Unity Editor, streamlining game development workflows.
vas3k/TaxHacker
A self-hosted AI accounting application that automates expense and income tracking for freelancers and small businesses using LLMs to analyze financial documents.
simonw/llm
A command-line tool and Python library to interact with various large language models, both remote APIs and local models.
MiroMindAI/MiroThinker
An open-source deep research agent designed for complex research and prediction tasks, demonstrating state-of-the-art performance on various AI benchmarks.
aaif-goose/goose
An open source, extensible AI agent that automates complex engineering tasks from start to finish, working locally with any LLM.
coderamp-labs/gitingest
Gitingest transforms any Git repository into a structured, prompt-friendly text digest, optimized for large language models to understand codebases efficiently.
ScrapeGraphAI/Scrapegraph-ai
A Python library that leverages LLMs and graph logic to simplify web scraping and data extraction from various sources.
HKUDS/nanobot
An ultra-lightweight, open-source personal AI agent platform designed for efficiency and broad compatibility across various LLM providers and communication channels.
NirDiamant/Prompt_Engineering
A comprehensive GitHub repository offering 22 hands-on Jupyter Notebook tutorials on prompt engineering techniques, from basic to advanced, for leveraging large language models.
TheR1D/shell_gpt
A command-line tool powered by AI large language models to quickly generate shell commands, code snippets, and documentation, significantly boosting developer productivity.
google/langextract
A Python library leveraging LLMs to extract structured information from unstructured text with precise source grounding and interactive visualization.
u14app/deep-research
An AI-powered web application for generating comprehensive, privacy-focused deep research reports using various LLMs and web search.
livekit/agents
A framework for building realtime, programmable voice AI agents that can see, hear, and understand.
WangRongsheng/awesome-LLM-resources
A comprehensive, continuously updated collection of the best resources for Large Language Models (LLMs), covering various aspects from data processing to advanced applications.
mlc-ai/web-llm
A high-performance, in-browser LLM inference engine with OpenAI API compatibility, leveraging WebGPU for local, private AI.
JuliusBrussee/caveman
A plugin that dramatically reduces LLM token usage by making AI agents communicate in a concise, 'caveman-like' style, while preserving full technical accuracy.
Pythagora-io/gpt-pilot
GPT Pilot is an AI developer companion designed to build complete, production-ready applications, automating up to 95% of the coding process with human oversight.
cheahjs/free-llm-api-resources
A comprehensive list of free and trial-based LLM inference resources accessible via API.
plastic-labs/honcho
An open-source memory library and managed service for building stateful AI agents that learn and adapt over time.
genkit-ai/genkit
A Google-built open-source framework simplifying the development and deployment of full-stack AI applications across JavaScript, Go, and Python.
pingcap/autoflow
An open-source, graph RAG-based conversational knowledge base tool built with TiDB Serverless Vector Storage, offering intelligent search and instant answers.
volcengine/MineContext
MineContext is an open-source, proactive context-aware AI partner that enhances productivity by understanding your digital world and delivering timely insights.
pathwaycom/llm-app
A framework providing ready-to-deploy templates for building scalable, high-accuracy RAG and AI enterprise search applications with live data synchronization.
liyupi/yu-ai-agent
A comprehensive AI development tutorial using Spring Boot 3 and Spring AI to build AI applications and autonomous agents, enhancing developers' AI skills and career competitiveness.
zilliztech/deep-searcher
DeepSearcher is an open-source platform that leverages LLMs and vector databases to enable deep research, intelligent Q&A, and comprehensive reporting on private enterprise data.
decodingai-magazine/llm-twin-course
A free, hands-on course to build a production-ready LLM & RAG system, including a personalized AI replica, applying LLMOps best practices.
athina-ai/rag-cookbooks
A comprehensive repository offering practical implementations and evaluation guidance for advanced and agentic Retrieval-Augmented Generation (RAG) techniques.
NVIDIA-NeMo/Curator
A GPU-accelerated, scalable toolkit for multimodal data preprocessing and curation, designed to train better AI models faster.
AI-Hypercomputer/maxtext
A high-performance, scalable JAX-based open-source library for training large language models on Google Cloud TPUs and GPUs.
oumi-ai/oumi
An end-to-end platform for fine-tuning, evaluating, and deploying open-source Large Language Models (LLMs) and Vision Language Models (VLMs).
ConardLi/easy-dataset
Easy Dataset is a powerful application for creating high-quality datasets for LLM fine-tuning, RAG, and model evaluation, featuring intelligent document processing and a comprehensive evaluation system.
h2oai/h2o-llmstudio
A no-code GUI and framework for easily fine-tuning state-of-the-art large language models (LLMs).
bespokelabsai/curator
A Python library for generating and curating high-quality synthetic data for AI model training and structured data extraction.
smallcloudai/refact
An open-source AI Agent that automates end-to-end software engineering tasks by integrating with developer tools, planning, executing, and iterating for successful results.
FunAudioLLM/CosyVoice
CosyVoice is an advanced multi-lingual large language model-based text-to-speech system offering state-of-the-art voice generation, cloning, and full-stack deployment capabilities.
LLMBook-zh/LLMBook-zh.github.io
A comprehensive Chinese technical book on Large Language Models, offering a systematic framework and roadmap for beginners with a deep learning background, authored by leading experts.
ashishpatel26/LLM-Finetuning
A collection of guides and code for efficiently fine-tuning large language models using PEFT (LoRA) and Hugging Face transformers.
eosphoros-ai/DB-GPT-Hub
A specialized hub providing models, datasets, and fine-tuning techniques to enhance Large Language Models' performance in Text-to-SQL, Text-to-NLU, and Text-to-GQL tasks.
ModelCloud/GPTQModel
A toolkit for quantizing (compressing) Large Language Models (LLMs) with hardware acceleration across various GPUs and CPUs, integrating with popular inference frameworks.
yangjianxin1/Firefly
Firefly is an open-source, all-in-one tool designed for efficient pre-training, instruction fine-tuning, and DPO of a wide range of mainstream large language models, optimized for resource-constrained environments.
mymusise/ChatGLM-Tuning
A cost-effective solution for finetuning ChatGLM-6B with LoRA, enabling personalized large language models.
hiyouga/ChatGLM-Efficient-Tuning
An efficient framework for fine-tuning ChatGLM-6B and ChatGLM2-6B models using PEFT methods, including LoRA, QLoRA, and RLHF, with a Web UI.
transformerlab/transformerlab-app
An open-source platform designed for AI researchers to unify fragmented ML tooling, enabling seamless training, evaluation, and scaling of models from local hardware to GPU clusters.
datawhalechina/self-llm
A comprehensive Linux-based guide for beginners to quickly fine-tune and deploy open-source LLMs and MLLMs, tailored for Chinese learners.
ymcui/Chinese-LLaMA-Alpaca
An open-source project providing Chinese LLaMA and instruction-tuned Alpaca large language models, optimized for Chinese NLP and local deployment on CPU/GPU.
microsoft/LoRA
A Python library implementing LoRA (Low-Rank Adaptation) to efficiently fine-tune large language models by significantly reducing trainable parameters and storage requirements.
LianjiaTech/BELLE
BELLE is an open-source project dedicated to fostering the development of Chinese conversational large language models, aiming to make LLMs accessible to everyone.
JIA-Lab-research/LongLoRA
LongLoRA is an efficient fine-tuning method and associated models/datasets designed to extend the context window of Large Language Models (LLMs) for processing longer inputs.
wenge-research/YAYI
YaYi is an open-source Chinese Large Language Model, built on LLaMA 2 & BLOOM, designed for secure, reliable, and domain-specific applications through extensive instruction tuning.
natolambert/rlhf-book
A comprehensive open-source textbook and code repository dedicated to Reinforcement Learning from Human Feedback (RLHF) and post-training language models.
Gen-Verse/OpenClaw-RL
An asynchronous reinforcement learning framework enabling personalized AI agent training through natural language conversations and scalable real-world deployments.
argilla-io/distilabel
Distilabel is a framework for generating synthetic data and AI feedback, enabling engineers to build fast, reliable, and scalable AI pipelines based on verified research.
PKU-Alignment/align-anything
A modular framework for aligning any-modality large models with human intentions and values using various fine-tuning and reinforcement learning methods.
PKU-Alignment/safe-rlhf
A modular open-source framework for training constrained value-aligned Large Language Models (LLMs) using Safe Reinforcement Learning from Human Feedback (RLHF).
InternLM/InternLM
A series of high-performance, cost-effective open-source large language models (LLMs) designed for general-purpose usage and advanced reasoning.
xtreme1-io/xtreme1
An all-in-one open-source platform for multimodal data labeling and annotation, supporting 3D LiDAR, image, and LLM training data with AI-fueled tools.
RUCAIBox/LLMSurvey
A comprehensive collection of papers and resources on Large Language Models, based on an authoritative survey paper.
LAION-AI/Open-Assistant
An open-source, chat-based large language model project aimed at democratizing access to powerful AI assistants through community-driven data collection.
OpenLMLab/MOSS-RLHF
An open-source framework providing code, models, and insights for stable Reinforcement Learning from Human Feedback (RLHF) training in Large Language Models, focusing on the PPO algorithm and reward modeling.
promptslab/Awesome-Prompt-Engineering
A comprehensive, hand-curated collection of resources for Prompt Engineering and Context Engineering, specifically for Large Language Models.
ai-boost/awesome-prompts
A comprehensive curated collection of prompts, frameworks, and research papers for advanced prompt engineering and LLM interaction.
elder-plinius/CL4R1T4S
A public repository of leaked and extracted system prompts from major AI models and agents, promoting transparency and observability in AI systems.
dottxt-ai/outlines
Outlines is a Python library that guarantees structured outputs from Large Language Models (LLMs) during generation, eliminating the need for post-processing and ensuring data validity.
mufeedvh/code2prompt
A powerful CLI tool and ecosystem to convert entire codebases into structured, token-counted prompts for Large Language Models.
DSXiangLi/DecryptPrompt
A comprehensive open-source project summarizing research papers, models, datasets, and applications in Prompt Engineering, Large Language Models (LLMs), and AIGC.
AI4Finance-Foundation/FinRobot
An open-source AI agent platform leveraging large language models for automated financial analysis, investment research, and algorithmic trading.
promptslab/Promptify
A Python library for structured NLP tasks using LLMs, offering Pydantic outputs, multi-provider support, and built-in evaluation.
algorithmicsuperintelligence/optillm
OptiLLM is an OpenAI API-compatible inference proxy that uses 20+ state-of-the-art techniques to significantly boost LLM accuracy and performance on reasoning tasks without requiring any training.
dair-ai/Prompt-Engineering-Guide
A comprehensive, open-source guide and resource hub for prompt engineering, context engineering, RAG, and AI Agents, designed to help developers and researchers master LLM interactions.
SamurAIGPT/AI-Youtube-Shorts-Generator
An AI-powered Python tool that automatically generates engaging YouTube Shorts from long-form videos by identifying viral-worthy moments and vertically cropping them.
Hunyuan-PromptEnhancer/PromptEnhancer
A prompt rewriting tool that refines user prompts into clearer, structured versions to enhance the quality of text-to-image generation and image-to-image editing.
NVIDIA-NeMo/NeMo
A scalable generative AI framework for researchers and developers focused on Large Language Models, Multimodal, and Speech AI (ASR, TTS).
krillinai/KrillinAI
An AI-powered tool for one-click video translation and dubbing across 100 languages, optimized for major social media platforms.
vllm-project/vllm-omni
A framework for efficient, fast, and cheap serving of omni-modality (text, image, video, audio) AI models.
NVIDIA-NeMo/DataDesigner
A flexible framework by NVIDIA NeMo for generating high-quality synthetic datasets with diverse distributions, meaningful correlations, and robust validation.
kyegomez/BitNet
A PyTorch implementation of BitNet, enabling highly efficient 1-bit transformers for large language models.
2U1/Qwen-VL-Series-Finetune
An open-source implementation for efficiently fine-tuning Alibaba Cloud's Qwen-VL series of multimodal large language models using HuggingFace and Liger-Kernel.
intel/auto-round
AutoRound is an advanced quantization toolkit for Large Language Models (LLMs) and Vision-Language Models (VLMs), enabling high-accuracy, ultra-low-bit inference across diverse hardware.
alichherawalla/off-grid-mobile-ai
A privacy-first, offline AI suite for mobile and desktop, enabling chat, image generation, vision AI, and more without internet access.
withcatai/node-llama-cpp
A Node.js binding for llama.cpp, enabling local execution of large language models with advanced features like JSON schema enforcement and function calling.
sammcj/gollama
A TUI (Text User Interface) tool for macOS and Linux to efficiently manage Ollama large language models.
GaiZhenbiao/ChuanhuChatGPT
A feature-rich web GUI for ChatGPT and various LLMs, offering advanced functionalities like agents, file-based QA, web search, and finetuning with a refined user experience.
CommandCodeAI/langui
An open-source collection of beautifully designed Tailwind CSS components specifically crafted for building user interfaces for AI, GPT, and LLM projects.
Marker-Inc-Korea/AutoRAG
An open-source framework that automates the evaluation and optimization of Retrieval-Augmented Generation (RAG) pipelines using AutoML-style automation for specific datasets.
Kong/kong
A high-performance, extensible API and AI Gateway for orchestrating microservices, LLM, and MCP traffic.
julep-ai/julep
An open-source, serverless platform for building and deploying complex, agent-based AI workflows with persistent memory and tool orchestration.
inkeep/agents
A versatile platform for building and deploying AI agents and multi-agent workflows using a no-code visual builder or a TypeScript SDK, featuring full 2-way synchronization.
rohitg00/pro-workflow
Pro Workflow enhances AI coding assistants like Claude Code with self-correcting memory, enabling them to learn from corrections and improve over time, significantly reducing repetitive guidance.
postgresml/postgresml
PostgresML integrates machine learning and AI capabilities, including LLMs and vector search, directly into PostgreSQL, leveraging GPUs for high-performance in-database inference.
sentient-agi/OML-1.0-Fingerprinting
A framework for embedding secret cryptographic fingerprints into Large Language Models (LLMs) via fine-tuning to verify ownership and prevent unauthorized use.
liucongg/ChatGLM-Finetuning
A toolkit for finetuning ChatGLM series models (ChatGLM-6B, ChatGLM2-6B, ChatGLM3-6B) using various methods like Freeze, Lora, P-tuning, and full parameter training for downstream NLP tasks.
PhoebusSi/Alpaca-CoT
A unified platform simplifying instruction-tuning for Large Language Models by integrating diverse data, LLMs, and parameter-efficient methods.
chenking2020/FindTheChatGPTer
A curated directory of open-source alternatives to ChatGPT and GPT-4, encompassing text and multimodal large language models, designed to assist users in navigating the AI landscape.
FellouAI/eko
A production-ready JavaScript framework for building reliable agentic workflows with natural language, supporting both computer and browser environments.
swyxio/ai-notes
A curated collection of notes and resources for software engineers to quickly get up to speed on new AI developments, focusing on generative AI and large language models.
hegelai/prompttools
An open-source, self-hostable toolkit for testing, experimenting with, and evaluating prompts, large language models (LLMs), and vector databases.
ashishps1/learn-ai-engineering
A comprehensive, curated collection of free resources for learning AI, Machine Learning, LLMs, and AI Engineering from scratch.
langgptai/LangGPT
LangGPT is a structured, reusable prompt design framework that transforms chaotic prompt engineering into a systematic, template-based methodology for creating high-quality prompts for Large Language Models.
canopyai/Orpheus-TTS
Orpheus TTS is a state-of-the-art open-source text-to-speech system built on a Llama-3b backbone, aiming to generate human-sounding, emotionally rich speech with low latency.
PeterH0323/Streamer-Sales
Streamer-Sales is an AI large language model designed for live streaming sales, generating compelling product descriptions and integrating advanced features like digital human generation, RAG, TTS, ASR, and Agent capabilities.
InternLM/HuixiangDou
An LLM-based professional knowledge assistant designed to provide accurate technical support in group chat scenarios without message flooding.
kyegomez/tree-of-thoughts
A plug-and-play Python library implementing the Tree of Thoughts algorithm to significantly enhance Large Language Model reasoning capabilities.
atfortes/Awesome-LLM-Reasoning
A meticulously curated collection of academic papers and resources focused on enhancing and understanding the reasoning abilities of Large Language Models (LLMs) and Multimodal Large Language Models (MLLMs).
X-PLUG/mPLUG-Owl
A family of powerful multi-modal large language models (MLLMs) designed to advance AI's understanding and generation capabilities across various data types.
xtekky/gpt4free
GPT4Free (g4f) is a community-driven project that aggregates various free and accessible LLM providers, offering a unified API, clients, and GUI for flexible AI model interaction.
SciSharp/LLamaSharp
A C#/.NET library for efficient local execution of Large Language Models (LLMs) like LLaMA and LLAVA, leveraging llama.cpp.
Chevey339/kelivo
A versatile Flutter-based LLM chat client supporting multiple AI providers and platforms with a modern design.
SciSharp/BotSharp
BotSharp is an open-source .NET framework for building sophisticated AI multi-agent applications, enabling seamless integration of Large Language Models into enterprise business systems.
The-Pocket/PocketFlow
A minimalist, 100-line LLM framework designed for building AI agents and workflows with zero bloat and maximum expressiveness.
claraverse-space/ClaraVerse
ClaraVerse is an open-source, privacy-focused AI ecosystem designed to replace commercial AI services by allowing users to host their own LLMs, keys, and compute, offering a unified workspace across desktop and mobile.
coleam00/local-ai-packaged
A Docker Compose template for quickly bootstrapping a self-hosted local AI and low-code development environment with integrated tools like Ollama, Supabase, and n8n.
n8n-io/self-hosted-ai-starter-kit
An open-source Docker Compose template for quickly setting up a secure, self-hosted AI and low-code development environment.
nanobrowser/nanobrowser
An open-source Chrome extension for AI-powered web automation, enabling multi-agent workflows with user-provided LLM API keys as a free alternative to commercial solutions.
WeThinkIn/AIGC-Interview-Book
An ultimate interview guide for AIGC/LLM/AI Agent algorithm and development engineers, covering core AI knowledge and practical experience.
NexaAI/nexa-sdk
A high-performance SDK enabling day-0 local inference of frontier LLMs and VLMs across diverse hardware (NPU, GPU, CPU) and platforms (PC, mobile, IoT) with minimal energy.
mylxsw/aidea
An open-source, cross-platform application integrating mainstream large language models and image generation models for unified AI interaction.
nichtdax/awesome-totally-open-chatgpt
A curated list of truly open-source alternatives to ChatGPT, featuring instruction-finetuned language models for chat.
karthink/gptel
A simple, extensible Emacs client for interacting with various Large Language Models directly within the editor.
memovai/mimiclaw
MimiClaw transforms a $5 ESP32-S3 chip into a personal AI assistant, offering Telegram integration, local memory, and ultra-low power consumption without a traditional OS.
LearnPrompt/LearnPrompt
A permanently free and open-source AIGC course platform covering prompt engineering, generative AI tools like ChatGPT, Midjourney, Stable Diffusion, and advanced topics such as LLM fine-tuning and AI agents.
stitionai/devika
An open-source AI agent that acts as a software engineer, capable of understanding instructions, planning, researching, and writing code to build software.
OpenGVLab/InternVL
A pioneering open-source multimodal large language model family aiming to match or exceed commercial models like GPT-4o/GPT-5 in performance.
shanraisshan/claude-code-best-practice
A comprehensive repository of best practices and implementation examples for developing with Claude Code's agentic engineering features.
wassim249/fastapi-langgraph-agent-production-ready-template
A production-ready FastAPI template for building scalable and secure AI agent applications with LangGraph, handling common infrastructure challenges.
trustgraph-ai/trustgraph
A graph-native platform for storing, enriching, and retrieving structured knowledge to power AI applications and intelligent agents.
FireRedTeam/FireRed-OpenStoryline
FireRed-OpenStoryline is an AI video editing agent that transforms manual editing into intention-driven directing through natural language interaction and LLM-powered planning.
campfirein/byterover-cli
ByteRover CLI provides AI coding agents with persistent, structured memory, enabling developers to curate, version, and share project knowledge across tools and teams.
awslabs/agentcore-samples
Amazon Bedrock AgentCore provides a framework-agnostic and model-agnostic infrastructure for securely deploying and operating advanced AI agents at scale, with this repository offering practical samples and tutorials.
mozilla-ai/llamafile
Distribute and run LLMs with a single file, no installation required.
ggozad/oterm
A terminal client for interacting with Ollama and other LLM providers, offering a simple UI and persistent chat sessions.
llm-workflow-engine/llm-workflow-engine
A powerful command-line interface and workflow manager designed to streamline interaction with various Large Language Models, including ChatGPT and GPT-4.
NeoVertex1/SuperPrompt
A sophisticated prompt engineering framework designed to guide LLMs, particularly Claude, into deeper, more novel, and "outside-the-box" thinking using holographic metadata and structured reasoning tags.
SWE-agent/SWE-agent
An AI agent that autonomously fixes GitHub issues, finds cybersecurity vulnerabilities, and performs coding tasks using large language models.
SylphAI-Inc/AdalFlow
AdalFlow is a PyTorch-like open-source library designed to build and automatically optimize large language model (LLM) applications, from chatbots and RAG systems to complex AI agents.
ollama/ollama-python
A Python library providing the easiest way to integrate Python 3.8+ projects with Ollama for local and cloud LLM interactions.
qualcomm/nexa-sdk
A high-performance SDK enabling day-0 local inference of frontier LLMs and VLMs across diverse hardware (NPU, GPU, CPU) and platforms (PC, mobile, IoT) with minimal energy.
Lightning-AI/litgpt
A high-performance, no-abstraction toolkit providing recipes for pretraining, finetuning, and deploying over 20 large language models at scale.
mahseema/awesome-ai-tools
A comprehensive, curated list of top Artificial Intelligence tools, covering various categories from generative AI to LLMs and specialized applications.
haotian-liu/LLaVA
An open-source large language and vision assistant (LLaVA) that achieves GPT-4V level multimodal capabilities through visual instruction tuning.
microsoft/torchscale
A PyTorch library providing advanced foundation architectures to efficiently and effectively scale Transformers for large language models and general-purpose AI.
luban-agi/Awesome-AIGC-Tutorials
A comprehensive, curated collection of tutorials and resources for Artificial Intelligence Generated Content (AIGC), encompassing Large Language Models, AI Painting, and related AI fields.
ucbepic/docetl
DocETL is an agentic LLM-powered framework designed for building and executing complex data processing and ETL pipelines, especially for documents.
microsoft/TypeChat
A library that simplifies building robust natural language interfaces by leveraging types for schema-driven LLM interactions, replacing complex prompt engineering.
Cinnamon/kotaemon
An open-source, customizable RAG UI and framework for secure, multi-modal document Q&A with various LLM support.
cocoindex-io/cocoindex
CocoIndex is an incremental data indexing framework that provides continuously fresh context from diverse enterprise data sources for AI agents and LLM applications.
Open-LLM-VTuber/Open-LLM-VTuber
An open-source, cross-platform AI companion featuring real-time voice interaction, visual perception, and a Live2D avatar, running entirely offline.