Tags: #ai
langgenius/dify
A production-ready platform designed for developing and deploying agentic AI workflows and applications.
ray-project/ray
A unified framework for scaling AI and Python applications from a laptop to a cluster, providing a distributed runtime and AI libraries.
OpenHands/OpenHands
An AI-driven development platform providing an SDK, CLI, GUI, and cloud services to build, run, and scale autonomous software agents for various development tasks.
n8n-io/n8n
A fair-code workflow automation platform offering visual building, custom code, native AI, and extensive integrations for technical teams to self-host or use in the cloud.
mastra-ai/mastra
A TypeScript framework for building, tuning, and scaling reliable AI-powered applications and autonomous agents, integrating seamlessly with modern web stacks.
langchain-ai/langchain
LangChain is an agent engineering platform and framework for building LLM-powered applications by chaining interoperable components and integrating diverse data sources.
libukai/awesome-agent-skills
The ultimate guide to AI Agent Skills, providing quick starts, curated resources, and practical tools to enhance AI's specialized capabilities and streamline task automation.
code-yeongyu/oh-my-openagent
An open-source AI agent harness designed to orchestrate multiple large language models for efficient code generation and complex task automation, avoiding vendor lock-in.
blakeblackshear/frigate
A complete and local Network Video Recorder (NVR) with real-time AI object detection for IP cameras, tightly integrated with Home Assistant.
docling-project/docling
A Python library designed to simplify the processing and parsing of diverse document formats, preparing them for seamless integration with generative AI ecosystems.
koala73/worldmonitor
A real-time, AI-powered global intelligence dashboard offering unified situational awareness through news aggregation, geopolitical monitoring, and infrastructure tracking.
screenpipe/screenpipe
An open-source, local-first AI memory assistant that continuously captures your screen and audio to provide a searchable, automated record of your digital activity.
google-gemini/gemini-cli
An open-source AI agent that brings the power of Google Gemini models, including advanced reasoning and large context windows, directly into your terminal for enhanced developer productivity.
zeroclaw-labs/zeroclaw
A fast, small, and fully autonomous personal AI assistant infrastructure designed to run locally on any device with minimal resources.
chroma-core/chroma
An open-source vector database designed as data infrastructure for AI applications, simplifying embedding management and semantic search.
cloudwego/eino
A Golang framework for building LLM applications, offering reusable components, an Agent Development Kit, and flexible composition for complex AI workflows.
Mintplex-Labs/anything-llm
An all-in-one, privacy-first AI application for chatting with documents and automating workflows using AI agents, designed for easy local deployment.
linshenkx/prompt-optimizer
A powerful AI prompt optimization tool designed to enhance AI output quality by intelligently refining prompts across various models and platforms.
CopilotKit/CopilotKit
A full-stack SDK for building agent-native applications with generative UI, shared state, and human-in-the-loop workflows.
apache/hertzbeat
An AI-powered open-source real-time observability system for unified metrics, logs, alerting, and notification.
continuedev/continue
Automate code quality and security checks with AI agents directly in your CI/CD pipeline, enforced on every pull request.
MODSetter/SurfSense
SurfSense is an open-source, privacy-focused alternative to NotebookLM, offering unlimited data, configurable AI models, and real-time team collaboration for enhanced knowledge management.
browserbase/stagehand
Stagehand is an SDK that combines AI and code for building reliable, flexible, and self-healing web automations.
web-infra-dev/midscene
An AI-powered, vision-driven UI automation framework for every platform, enabling natural language control and scripting.
open-webui/open-webui
A user-friendly, extensible, and feature-rich self-hosted AI platform supporting various LLM runners like Ollama and OpenAI-compatible APIs, with built-in RAG capabilities.
Lightning-AI/pytorch-lightning
A deep learning framework that simplifies PyTorch development by automating boilerplate engineering code, enabling scalable training from CPU to multi-node GPUs with minimal code changes.
milvus-io/milvus
A high-performance, cloud-native vector database built for scalable Approximate Nearest Neighbor (ANN) search and AI applications.
Anionex/banana-slides
An AI-native application that transforms ideas into professional, visually appealing presentations in minutes, eliminating tedious manual design and enabling natural language modifications.
bentoml/BentoML
A Python library for building and deploying high-performance AI model inference APIs and multi-model serving systems with ease.
ValueCell-ai/ClawX
ClawX bridges the gap between powerful AI agents and everyday users by providing an intuitive desktop interface for CLI-based AI orchestration.
YaoApp/yao
Yao is a single-binary, full-stack runtime designed for building and deploying proactive, event-driven autonomous agents without requiring Python or Node.js.
khoj-ai/khoj
A self-hostable AI second brain that integrates with various LLMs and data sources to provide personalized answers, automate research, and create custom AI agents.
carla-simulator/carla
An open-source simulator providing a flexible platform and digital assets for autonomous driving research, development, and validation.
elie222/inbox-zero
An open-source AI personal assistant designed to automate email management, draft replies, and help users achieve inbox zero efficiently.
sansan0/TrendRadar
An AI-driven platform for multi-platform public opinion and trend monitoring, offering smart alerts and news aggregation to combat information overload.
codexu/note-gen
A cross-platform Markdown AI note-taking application that leverages AI to transform fragmented knowledge into organized, readable notes.
RightNow-AI/openfang
An open-source, Rust-built operating system for truly autonomous agents that work 24/7 on schedules, performing complex tasks without human prompting.
Netflix/metaflow
A human-centric Python framework for building, managing, and deploying real-life AI/ML systems from rapid prototyping to reliable production.
ThinkInAIXYZ/deepchat
DeepChat is an open-source desktop AI agent platform that unifies various large language models, tools, and agents for a seamless and private AI interaction experience.
VectifyAI/PageIndex
PageIndex is a vectorless, reasoning-based RAG system that builds a hierarchical tree index from long documents for agentic, context-aware retrieval, simulating human expert navigation.
explosion/spaCy
An industrial-strength Python library for advanced Natural Language Processing, designed for building real-world applications.
udecode/plate
A rich-text editor framework built with AI capabilities and shadcn/ui components, offering a highly customizable and efficient editing experience.
ItzCrazyKns/Vane
Vane is a privacy-focused, self-hosted AI answering engine that integrates local and cloud LLMs with web search to deliver accurate, cited answers.
OpenDCAI/Paper2Any
An AI-driven platform that transforms research papers, text, or topics into editable scientific figures, technical diagrams, and presentation slides with universal file support.
usestrix/strix
Strix leverages autonomous AI agents to dynamically find, validate, and help fix application vulnerabilities, acting like real hackers to provide fast and accurate security testing.
DayuanJiang/next-ai-draw-io
An AI-powered web application that integrates with draw.io, enabling users to create, modify, and enhance diagrams using natural language commands and AI-assisted visualization.
dyad-sh/dyad
A local, open-source AI app builder providing privacy, speed, and full user control as an alternative to cloud-based platforms.
icip-cas/PPTAgent
An AI-powered agentic framework for reflective and autonomous PowerPoint presentation generation, integrating deep research and visual design.
campfirein/cipher
An open-source memory layer for AI coding agents, enhancing context, collaboration, and seamless integration across various IDEs and LLMs.
facefusion/facefusion
An industry-leading open-source platform for advanced face manipulation, offering powerful features for deepfake creation and processing.
golutra/golutra
golutra transforms existing CLI tools into a unified multi-agent AI collaboration hub, enabling parallel execution, automated orchestration, and real-time result tracking.
ageerle/ruoyi-ai
A one-stop enterprise-grade AI application development framework supporting multi-vendor LLM integration, secure knowledge bases, visual workflow orchestration, and multi-agent collaboration to rapidly build AI agent applications.
upscayl/upscayl
Upscayl is the #1 free and open-source AI image upscaler for Linux, MacOS, and Windows, designed to enlarge and enhance low-resolution images.
linyqh/NarratoAI
An AI-powered tool that automates video commentary and editing, offering a one-stop solution for scriptwriting, automated video editing, voiceover, and subtitle generation to boost efficient content creation.
ahmedkhaleel2004/gitdiagram
Turn any GitHub repository into an interactive, AI-generated system design diagram for instant visualization and navigation.
p-e-w/heretic
A tool for automatically removing censorship and safety alignment from transformer-based language models without expensive post-training.
casibase/casibase
An open-source, enterprise-grade AI Cloud OS providing a knowledge base and a comprehensive management platform for AI models (MCP) and agents (A2A), complete with admin UI, user management, and SSO.
photoprism/photoprism
An AI-powered, privacy-focused, and self-hosted photo management application designed to automatically organize and browse your entire photo and video collection.
0xPlaygrounds/rig
A Rust library designed for building scalable, modular, and ergonomic applications powered by Large Language Models.
Leonxlnx/taste-skill
Taste Skill enhances AI-generated frontend code, transforming generic outputs into modern, premium designs with proper aesthetics and animations.
metorial/metorial
An open-source platform enabling AI models to connect with thousands of APIs and data sources via a single function call, simplifying agentic AI development.
TauricResearch/TradingAgents
A multi-agent LLM framework designed for financial trading research, simulating real-world trading firm dynamics for market analysis and decision-making.
langchain4j/langchain4j
LangChain4j is an open-source Java library that simplifies the integration of Large Language Models (LLMs) into Java applications.
FlowiseAI/Flowise
Visually build and deploy custom AI agents and LLM applications with a drag-and-drop interface.
kyegomez/swarms
An enterprise-grade, production-ready framework for orchestrating complex multi-agent systems, designed for scalable AI applications and seamless integration.
Arize-ai/phoenix
An open-source platform for comprehensive AI/ML model observability, evaluation, and debugging.
mayooear/ai-pdf-chatbot-langchain
A customizable AI chatbot template built with LangChain and LangGraph, enabling users to ingest PDF documents, store embeddings, and answer queries using an LLM.
meta-llama/llama-cookbook
A comprehensive guide and collection of recipes for building with the Llama model family, covering inference, fine-tuning, RAG, and end-to-end use cases.
argilla-io/argilla
Argilla is a collaboration tool for AI engineers and domain experts to build and maintain high-quality datasets for various AI models, from NLP to LLMs and multimodal systems.
datawhalechina/all-in-rag
A comprehensive, full-stack guide to Retrieval-Augmented Generation (RAG) technology, covering theory, practice, and engineering best practices for building LLM applications.
datawhalechina/llm-universe
A beginner-friendly tutorial for LLM application development, focusing on building a personal knowledge base assistant using cloud services.
run-llama/LlamaIndexTS
A deprecated JavaScript/TypeScript data framework designed to integrate custom data with large language models across various JS runtime environments.
run-llama/rags
A Streamlit app to build and query custom Retrieval-Augmented Generation (RAG) pipelines over your data using natural language.
LearningCircuit/local-deep-research
An AI-powered research assistant for deep, agentic research, emphasizing local control, privacy, and multi-source information synthesis.
control-theory/gonzo
Gonzo is a Go-based terminal UI for real-time log analysis, offering interactive dashboards, AI-powered insights, and advanced filtering capabilities.
ArvinLovegood/go-stock
An AI-driven desktop tool for multi-market stock analysis, picking, and real-time alerts, integrating various large language models.
miurla/morphic
Morphic is an AI-powered search engine that provides dynamic, generative user interfaces for enhanced search experiences, integrating various AI models and search providers.
icereed/paperless-gpt
An AI-powered utility that integrates with paperless-ngx to enhance document digitalization through LLM-enhanced OCR, automatic titling, tagging, and field extraction.
OvidijusParsiunas/deep-chat
A highly customizable AI chatbot component designed for easy integration into any website or UI framework.
JerryZLiu/Dayflow
Dayflow is an open-source, local-first macOS app that automatically generates a private, context-aware timeline of your daily activities using AI, helping you understand how your time is truly spent.
gofireflyio/aiac
An AI-driven command-line tool and library for generating Infrastructure-as-Code, configurations, CI/CD pipelines, and various code snippets using Large Language Models.
clusterzx/paperless-ai
An AI-powered extension for Paperless-ngx that automates document classification, smart tagging, and enables semantic search using various AI models.
deta/surf
Deta Surf is an open-source, local-first AI notebook designed to unify research, note-taking, and knowledge synthesis across diverse media types.
prism-php/prism
A Laravel package providing a fluent interface to integrate and manage Large Language Models (LLMs) from various AI providers.
containers/ramalama
RamaLama simplifies the local serving and production inference of AI models from any source by leveraging familiar container patterns, eliminating complex host system configurations.
vllm-project/semantic-router
A signal-driven intelligent router designed to optimize the efficiency, safety, and adaptability of multi-model AI systems across various environments.
mostlygeek/llama-swap
A high-performance Go-based proxy for hot-swapping and managing multiple local generative AI models compatible with OpenAI and Anthropic APIs.
weaviate/weaviate
An open-source, cloud-native vector database enabling semantic search at scale by combining vector similarity with structured filtering and integrated AI models.
qdrant/qdrant
Qdrant is a high-performance, massive-scale vector database and search engine designed for next-generation AI applications.
Avaiga/taipy
A Python library empowering data scientists and ML engineers to rapidly build and deploy production-ready data and AI-driven web applications.
alvinunreal/awesome-opensource-ai
A meticulously curated list of battle-tested, production-proven open-source AI models, libraries, infrastructure, and developer tools.
feast-dev/feast
An open-source feature store that streamlines the management and serving of features for AI/ML models, ensuring consistency between training and inference.
asgeirtj/system_prompts_leaks
A comprehensive collection of extracted system prompts, messages, and developer instructions from leading AI chatbots and coding assistants.
liaokongVFX/LangChain-Chinese-Getting-Started-Guide
A comprehensive Chinese-language guide to getting started with LangChain, enabling developers to build powerful applications powered by large language models.
shroominic/codeinterpreter-api
An open-source LangChain implementation of the ChatGPT Code Interpreter, enabling sandboxed Python code execution for LLM applications.
buxuku/SmartSub
A cross-platform desktop tool for batch video/audio subtitle generation and multi-service translation, supporting offline processing and hardware acceleration.
Zackriya-Solutions/meetily
A privacy-first, self-hosted AI meeting assistant that provides local transcription, speaker diarization, and summarization without cloud dependency.
AsyncFuncAI/deepwiki-open
An AI-powered tool that automatically generates comprehensive, interactive wikis and documentation for GitHub, GitLab, and BitBucket repositories.
truefoundry/cognita
A modular, API-driven RAG framework designed for building scalable, production-ready AI applications, addressing the complexities of deploying RAG systems beyond prototyping.
activeloopai/deeplake
Deep Lake is an AI data runtime and database optimized for deep learning, offering multimodal data storage, querying, vector search, and streaming for LLM and deep learning applications.
jamiepine/voicebox
The open-source, local-first voice synthesis studio for voice cloning, speech generation, and audio effects.
alvinreal/awesome-opensource-ai
A meticulously curated list of battle-tested, production-proven open-source AI models, libraries, infrastructure, and developer tools.
camel-ai/owl
A cutting-edge framework for multi-agent collaboration, enabling optimized workforce learning and robust real-world task automation.
campfirein/byterover-cli
ByteRover CLI provides persistent, structured memory and context management for autonomous AI coding agents through an interactive REPL interface.
xszyou/Fay
Fay is an AI agent framework designed to connect digital humans (2.5D, 3D, mobile, PC, web) and large language models (OpenAI compatible, DeepSeek) with various business systems.
RSSNext/Folo
Folo is an AI-powered RSS reader that curates content into a single, noise-free timeline for distraction-free browsing.
chaitin/PandaWiki
PandaWiki is an AI large model-driven open-source knowledge base system that helps you quickly build intelligent product documentation, technical documentation, FAQ, and blog systems, leveraging large models for AI creation, Q&A, and search capabilities.
harry0703/MoneyPrinterTurbo
An AI-powered tool that generates high-definition short videos automatically from a given topic or keywords, including script, footage, subtitles, and background music.
steven2358/awesome-generative-ai
A meticulously curated list of modern Generative Artificial Intelligence projects and services, offering a structured overview of the rapidly evolving AI landscape.
punkpeye/awesome-mcp-servers
A curated list of Model Context Protocol (MCP) servers enabling AI models to securely interact with various local and remote resources.
CoplayDev/unity-mcp
Connects AI assistants like Claude and Cursor directly to the Unity Editor, empowering LLMs to automate game development tasks.
vas3k/TaxHacker
TaxHacker is a self-hosted AI accounting app that automates expense and income tracking for freelancers and small businesses by analyzing receipts, invoices, and transactions with LLMs.
simonw/llm
A command-line tool and Python library for interacting with various large language models, both remote APIs and local models.
umlx5h/LLPlayer
A specialized media player designed for language learners, offering advanced subtitle features like dual subtitles, AI-generated subtitles, and real-time translation.
microsoft/AI-For-Beginners
A comprehensive 12-week, 24-lesson curriculum designed to introduce beginners to the fundamentals of Artificial Intelligence.
tensorchord/Awesome-LLMOps
A comprehensive and curated list of the best LLMOps tools, designed to help developers navigate the complex landscape of Large Language Model operations.
spmallick/learnopencv
A comprehensive repository offering C++ and Python code examples for computer vision, deep learning, and AI research, complementing articles on LearnOpenCV.com.
zjunlp/LLMAgentPapers
A meticulously curated collection of must-read research papers on Large Language Model (LLM) agents, categorized for easy navigation and comprehensive understanding.
ScrapeGraphAI/Scrapegraph-ai
A Python library that leverages LLMs and graph logic to simplify web scraping and data extraction from various sources.
NirDiamant/Prompt_Engineering
A comprehensive repository offering 22 hands-on Jupyter Notebook tutorials on prompt engineering techniques for leveraging large language models.
TheR1D/shell_gpt
An AI-powered command-line tool that streamlines the generation of shell commands, code snippets, and documentation, enhancing developer productivity.
mnfst/awesome-free-llm-apis
A comprehensive, curated list of Large Language Model (LLM) APIs offering permanent free tiers for text inference.
tirth8205/code-review-graph
Optimizes AI code reviews by building a local knowledge graph of your codebase, drastically reducing token usage and providing precise context.
u14app/deep-research
A privacy-focused, AI-powered platform that leverages various large language models and web search to generate comprehensive deep research reports rapidly.
WangRongsheng/awesome-LLM-resources
A comprehensive and continuously updated collection of the world's best resources for Large Language Models (LLMs), covering various aspects from data to advanced applications.
vercel-labs/open-agents
An open-source reference application and template for building and running durable, cloud-based AI coding agents on Vercel, enabling automated code changes from prompts.
huggingface/datasets
A lightweight library providing a vast hub of ready-to-use datasets and efficient tools for data manipulation in AI and machine learning workflows.
zilliztech/claude-context
Enhances AI coding agents like Claude Code with semantic search, providing deep, cost-effective context from the entire codebase.
plastic-labs/honcho
An open-source memory library and managed service for building stateful AI agents that learn and adapt over time.
genkit-ai/genkit
An open-source, multi-language framework by Google for rapidly building and deploying production-ready AI-powered applications.
vespa-engine/vespa
A high-performance AI data platform for real-time search, recommendation, and machine learning inference at any scale.
vearch/vearch
A cloud-native distributed vector database designed for efficient similarity search of embedding vectors in AI applications.
HelixDB/helix-db
HelixDB is an open-source, Rust-built graph-vector database that consolidates multiple data models to simplify AI application development.
pinecone-io/examples
A comprehensive collection of Jupyter Notebooks and sample applications designed to help users learn and experiment with Pinecone vector databases and common AI patterns.
volcengine/MineContext
A proactive, context-aware AI assistant that enhances productivity by understanding your digital environment and proactively delivering insights.
devflowinc/trieve
An all-in-one API platform providing advanced search, recommendations, and Retrieval-Augmented Generation (RAG) capabilities for developers.
athina-ai/rag-cookbooks
A comprehensive repository offering practical implementations and evaluation guidance for advanced and agentic Retrieval-Augmented Generation (RAG) techniques.
Nerogar/OneTrainer
A comprehensive, one-stop solution for training various Diffusion models, offering extensive features for fine-tuning, dataset preparation, and model management.
AIDotNet/OpenDeepWiki
An open-source, AI-driven platform for converting code repositories into intelligent, searchable knowledge bases with conversational interaction.
beam-cloud/beta9
An ultrafast, open-source Pythonic runtime for deploying and scaling serverless GPU inference, sandboxes, and background jobs with zero infrastructure overhead.
stochasticai/xTuring
xTuring simplifies the fine-tuning, evaluation, and deployment of open-source Large Language Models (LLMs) on private data, ensuring privacy and efficiency.
LLMBook-zh/LLMBook-zh.github.io
A comprehensive Chinese technical book and associated course materials providing a systematic framework and roadmap for understanding Large Language Models, authored by leading experts.
nunchaku-ai/nunchaku
Nunchaku is a high-performance inference engine that optimizes 4-bit neural networks, especially diffusion models, for speed and efficiency.
agentheroes/agentheroes
Generate, animate, and schedule AI characters and their content for social media with automated workflows.
opendilab/awesome-RLHF
A continually updated, curated list of essential resources for Reinforcement Learning with Human Feedback (RLHF), encompassing research papers, codebases, datasets, and related materials.
Docta-ai/docta
Docta is an advanced data-centric AI platform that detects and rectifies issues in various data types to improve model performance.
EvolutionAPI/evolution-api
An open-source API for integrating WhatsApp and other messaging services, enabling advanced conversational applications.
promptslab/Awesome-Prompt-Engineering
A comprehensive, hand-curated collection of resources for Prompt Engineering and Context Engineering, covering papers, tools, models, APIs, and courses for Large Language Models.
ai-boost/awesome-prompts
A curated collection of high-quality prompts, advanced engineering frameworks, and research papers to master effective interaction with large language models.
dottxt-ai/outlines
Outlines guarantees structured outputs from Large Language Models during generation, eliminating post-processing headaches and ensuring data integrity.
nidhinjs/prompt-master
A Claude skill designed to generate precise, token-efficient prompts for any AI tool, eliminating wasted credits and re-prompting while maintaining full context.
Sanster/IOPaint
A free and open-source AI-powered tool for advanced image inpainting, outpainting, and object replacement using state-of-the-art models.
Baiyuetribe/paper2gui
Converts advanced AI models into user-friendly desktop applications, making cutting-edge AI accessible to everyone without installation.
HisMax/RedInk
A one-stop AI-powered generator for creating Xiaohongshu image-and-text posts from a single sentence.
SamurAIGPT/AI-Youtube-Shorts-Generator
Automates YouTube Shorts generation from long videos using AI for highlights, subtitles, and vertical cropping.
kuprel/min-dalle
A fast, minimal PyTorch port of DALL·E Mini, optimized for efficient text-to-image generation inference.
OpenBMB/VoxCPM
A tokenizer-free, multilingual Text-to-Speech system offering advanced voice design, controllable cloning, and high-quality audio output.
IAHispano/Applio
Applio is a powerful, user-friendly, and high-performance open-source tool for high-quality voice transformation.
fishaudio/Bert-VITS2
An open-source Text-to-Speech system built on the VITS2 backbone, enhanced with multilingual BERT for improved speech synthesis.
AIDC-AI/Pixelle-Video
An AI-powered engine that fully automates short video creation from a single topic, handling script, visuals, voiceover, and music without editing skills.
pot-app/pot-desktop
A versatile cross-platform desktop application that provides efficient text translation and optical character recognition (OCR) by integrating a wide array of AI and traditional service providers.
fishaudio/fish-speech
A state-of-the-art open-source multilingual text-to-speech system offering natural, expressive, and emotionally rich voice generation.
index-tts/index-tts
IndexTTS2 is an industrial-level, zero-shot text-to-speech system offering precise duration control and disentangled emotional expression for highly natural and controllable speech synthesis.
CorentinJ/Real-Time-Voice-Cloning
Clone a voice in 5 seconds to generate arbitrary speech in real-time using a three-stage deep learning framework.
remsky/Kokoro-FastAPI
A Dockerized FastAPI wrapper providing a high-performance, multi-platform (CPU/GPU) and multi-language API for the Kokoro-82M text-to-speech model, compatible with OpenAI's speech endpoint.
WhisperSpeech/WhisperSpeech
An open-source, high-performance text-to-speech system built on Whisper, aiming to be a hackable and commercially safe alternative for speech generation.
Eventual-Inc/Daft
A high-performance data engine for AI and multimodal workloads, processing diverse data types at scale with Python and Rust.
rerun-io/rerun
An open source SDK for logging, storing, querying, and visualizing multimodal and multi-rate data, especially for robotics and AI.
Blaizzy/mlx-audio
A high-performance library built on Apple's MLX framework, offering efficient text-to-speech, speech-to-text, and speech-to-speech capabilities optimized for Apple Silicon.
morphik-org/morphik-core
Morphik Core is an AI-native platform providing accurate document search and storage for building robust AI applications, specifically designed to handle complex, visually rich, and multimodal data, overcoming the limitations of traditional RAG.
emcf/thepipe
A Python library for extracting clean markdown, multimodal media, and structured data from complex documents using vision-language models.
collabora/WhisperLive
A real-time transcription application leveraging OpenAI's Whisper model for converting live or pre-recorded speech into text with optimized backends.
getumbrel/llama-gpt
A private, self-hosted, and offline ChatGPT-like chatbot powered by Llama 2 and Code Llama, ensuring 100% data privacy.
ramon-victor/freegpt-webui
A free, user-friendly web interface for interacting with GPT-3.5/4 models without requiring an API key.
CommandCodeAI/langui
An open-source collection of beautifully designed, Tailwind CSS-powered UI components for building modern AI, GPT, and LLM applications.
ai-for-developers/awesome-ai-coding-tools
A comprehensive, curated list of AI-powered tools designed to enhance various aspects of software development.
EmbeddedLLM/JamAIBase
A collaborative, spreadsheet-like platform for building, experimenting with, and evaluating AI applications, especially those leveraging Retrieval-Augmented Generation (RAG).
alan-ai/alan-sdk-web
Alan AI SDK for Web enables developers to embed a self-coding, generative AI layer into web applications, automating feature creation and UI/logic generation in real-time.
HughYau/qiushi-skill
Qiushi-Skill arms AI agents with a core principle and nine methodological tools derived from classical dialectical materialism and practical philosophy to enhance their problem-solving capabilities.
czlonkowski/n8n-mcp
A Model Context Protocol (MCP) server enabling AI assistants to build n8n workflows with deep node knowledge.
BasedHardware/omi
An open-source AI assistant that captures screen and conversations, transcribes in real-time, generates summaries, and offers an AI chat with comprehensive memory.
postgresml/postgresml
PostgresML integrates machine learning and AI capabilities, including GPU acceleration and large language models, directly into PostgreSQL, eliminating the need for separate systems and data transfers.
reorproject/reor
A private, local-first AI-powered desktop app for personal knowledge management, offering automatic note linking, semantic search, and Q&A on your notes.
tensorchord/pgvecto.rs
A scalable, low-latency PostgreSQL extension written in Rust, enabling advanced vector similarity search directly within your relational database.
swyxio/ai-notes
A comprehensive knowledge base for software engineers to quickly grasp the latest developments in AI, especially generative AI and large language models.
ashishps1/learn-ai-engineering
A comprehensive, curated collection of free resources for learning AI, Machine Learning, LLMs, and AI Engineering from scratch.
jianchang512/clone-voice
A user-friendly web-based tool for voice cloning, text-to-speech, and speech-to-speech conversion, leveraging the Coqui XTTS_v2 model with multi-language support.
myshell-ai/OpenVoice
An AI voice synthesis library offering instant, accurate, and flexible voice cloning with multi-lingual support.
myshell-ai/MeloTTS
A high-quality, multi-lingual text-to-speech library supporting real-time CPU inference across various languages and accents.
wzpan/wukong-robot
A flexible, open-source platform for building personalized Chinese voice assistants and smart speakers, featuring modular design, multi-language ASR/TTS, ChatGPT integration, and BCI wake-up.
coqui-ai/TTS
A deep learning toolkit for Text-to-Speech, offering pretrained models, training tools, and dataset utilities.
netease-youdao/EmotiVoice
An open-source, multi-voice, and prompt-controlled text-to-speech engine capable of generating speech with diverse emotions in English and Chinese.
yl4579/StyleTTS2
StyleTTS 2 is a text-to-speech model that achieves human-level speech synthesis by leveraging style diffusion and adversarial training with large speech language models.
metavoiceio/metavoice-src
MetaVoice-1B is an open-source, 1.2B parameter foundational model for highly expressive, human-like text-to-speech synthesis and zero-shot voice cloning.
Plachtaa/VALL-E-X
An open-source implementation of Microsoft's VALL-E X, enabling zero-shot multilingual text-to-speech synthesis and voice cloning with emotion control.
jaywalnut310/vits
VITS is an end-to-end text-to-speech model that generates highly natural-sounding audio with diverse rhythms, outperforming traditional two-stage TTS systems.
kyegomez/tree-of-thoughts
A plug-and-play Python library implementing the Tree of Thoughts (ToT) algorithm to significantly enhance Large Language Model reasoning capabilities by up to 70%.
X-PLUG/mPLUG-DocOwl
A modularized multimodal large language model designed for OCR-free document understanding.
X-PLUG/mPLUG-Owl
A family of powerful multi-modal large language models (MLLMs) designed to advance AI's understanding and generation capabilities across various data types.
Chevey339/kelivo
A versatile Flutter-based LLM chat client supporting multiple AI providers and platforms with advanced features like multimodal input and web search.
HanaokaYuzu/Gemini-API
A reverse-engineered asynchronous Python API for the Google Gemini web app, enabling programmatic interaction with its advanced AI features.
bytedance/flowgram.ai
FlowGram is a composable and visual framework designed to help developers rapidly build extensible AI workflow platforms with integrated canvas, forms, and variable management.
The-Pocket/PocketFlow
A minimalist, 100-line LLM framework designed for building AI agents and workflows with zero bloat and maximum expressiveness.
n8n-io/self-hosted-ai-starter-kit
An open-source Docker Compose template for quickly setting up a secure, self-hosted local AI and low-code development environment.
wassupjay/n8n-free-templates
A comprehensive collection of 200+ plug-and-play n8n workflows, integrating classic automation with modern AI technologies like vector databases, embeddings, and large language models.
ChatAnyTeam/ChatAny
A unified web service providing one-click access to ChatGPT and various AI models like Midjourney and StabilityAI.
wangkai930418/awesome-diffusion-categorized
A meticulously categorized collection of research papers on diffusion models, organized by diverse subareas such as visual illusion, color in generation, image restoration, and text-guided editing.
mylxsw/aidea
An open-source, cross-platform application integrating mainstream large language models and image generation capabilities.
11cafe/jaaz
An open-source, privacy-focused multimodal creative AI assistant that serves as a local alternative to Canva and Manus for generating images and videos.
zombieyang/sd-ppp
Integrates advanced AI image generation capabilities directly into Adobe Photoshop, supporting various models and APIs like Midjourney and Replicate.
MrForExample/ComfyUI-3D-Pack
An extensive node suite that integrates cutting-edge 3D generation algorithms and models into ComfyUI, enabling seamless processing of 3D inputs like meshes and UV textures.
numz/ComfyUI-SeedVR2_VideoUpscaler
Official SeedVR2 Video Upscaler for ComfyUI, enabling high-quality video and image upscaling, also runnable as a standalone CLI.
ZHO-ZHO-ZHO/ComfyUI-Workflows-ZHO
A comprehensive and curated collection of ComfyUI workflows for diverse generative AI tasks, simplifying complex AI art and video creation.
Kanaries/graphic-walker
An open-source, embeddable React component for intuitive exploratory data analysis and visualization using drag-and-drop or natural language queries, positioned as an alternative to Tableau.
javahuang/SurveyKing
An AI-powered, self-hosted open-source platform for creating professional surveys and online exams with advanced features and one-command deployment.
hooram/ownphotos
A self-hosted, open-source alternative to Google Photos, offering AI-powered photo organization and management with a focus on privacy and user control.
nichtdax/awesome-totally-open-chatgpt
A curated list of truly open-source alternatives to ChatGPT, featuring instruction-tuned language models for conversational AI.
eigent-ai/eigent
Eigent is an open-source Cowork desktop application that empowers users to build, manage, and deploy custom AI workforces for automating complex workflows, offering a local and free alternative to commercial AI cowork platforms.
gitroomhq/postiz-app
An ultimate AI-powered social media scheduling and management tool to grow audience, capture leads, and streamline content strategy.
digoal/blog
A comprehensive open-source knowledge base offering learning videos, articles, and best practices for PostgreSQL, Greenplum, PolarDB, and related AI/database technologies.
jamesmurdza/awesome-ai-devtools
A comprehensive, categorized list of AI-powered developer tools designed to enhance productivity across various software development tasks.
deepfakes/faceswap
An open-source deep learning tool that enables users to recognize and swap faces in images and videos.
pydantic/monty
A minimal, secure, and high-performance Python interpreter written in Rust, designed for safely executing LLM-generated code within AI agents without container overhead.
AgriciDaniel/claude-seo
A comprehensive AI-powered SEO audit and strategic planning skill for Claude Code, offering deep analysis across technical, content, local, and international SEO with integrated reporting.
JCodesMore/ai-website-cloner-template
A template leveraging AI coding agents to reverse-engineer any website into a modern Next.js codebase with a single command.
PRIS-CV/DemoFusion
DemoFusion democratizes high-resolution AI image generation by unlocking the untapped potential of existing Latent Diffusion Models without requiring extensive computational resources.
Fanghua-Yu/SUPIR
SUPIR is an AI-driven project focused on developing practical algorithms for photo-realistic image restoration and upscaling in real-world scenarios.
philz1337x/clarity-upscaler
A free and open-source AI image upscaler and enhancer, offering a powerful alternative to commercial solutions like Magnific.
CodeWithCJ/SparkyFitness
A self-hosted, privacy-first fitness and nutrition tracking platform with AI features, designed for families.
academic/awesome-datascience
A comprehensive, open-source repository of Data Science learning resources and tools for real-world problem-solving.
dora-rs/dora
DORA is a high-performance, 100% Rust framework for building real-time, low-latency, and distributed AI-based robotic applications using a dataflow-oriented architecture.