Ecosystem & Stack: python
unslothai/unsloth
Unsloth Studio is a web UI that enables efficient local training and inference of open-source large language models and other AI models with significant VRAM and speed optimizations.
ZhuLinsen/daily_stock_analysis
An LLM-driven system for automated daily stock analysis across A/H/US markets, providing decision dashboards and multi-channel notifications at no cost.
googleapis/genai-toolbox
An open-source MCP server connecting AI agents, IDEs, and applications to enterprise databases for data interaction, schema exploration, and code generation.
ray-project/ray
Ray is a unified framework for scaling AI and Python applications from a laptop to a cluster, simplifying complex ML workloads with a distributed runtime and specialized libraries.
OpenHands/OpenHands
OpenHands is an AI-driven development platform that empowers users to build, run, and scale autonomous software agents for various development tasks.
volcengine/OpenViking
OpenViking is an open-source context database for AI Agents, unifying memory, resources, and skills through a file system paradigm to enable hierarchical context delivery and self-evolving capabilities.
bytedance/deer-flow
DeerFlow is an open-source SuperAgent harness designed to research, code, and create by orchestrating sub-agents, memory, and sandboxes with extensible skills for long-horizon tasks.
comet-ml/opik
An open-source platform for comprehensive tracing, evaluation, and optimization of LLM applications, RAG systems, and agentic workflows.
MemoriLabs/Memori
Memori provides an LLM-agnostic layer that transforms AI agent execution and conversations into structured, persistent memory for production systems.
langchain-ai/langchain
LangChain is a framework for building robust LLM-powered applications and intelligent agents by chaining together interoperable components and third-party integrations.
agno-agi/agno
Agno is a comprehensive platform for building, running, and managing scalable agentic software, addressing the unique challenges of AI agent interaction, governance, and trust in production environments.
mindsdb/mindsdb
MindsDB is an AI-powered query engine that enables AI agents to securely access, unify, and analyze data from diverse sources using natural language and SQL.
pipecat-ai/pipecat
An open-source Python framework for building real-time, voice-first, and multimodal conversational AI agents with composable pipelines.
docling-project/docling
Docling simplifies document processing, parsing diverse formats including advanced PDF understanding, and provides seamless integrations with the generative AI ecosystem.
langchain-ai/deepagents
Deep Agents is a batteries-included AI agent harness built with LangChain and LangGraph, offering out-of-the-box capabilities for planning, file system interaction, sub-agent delegation, and intelligent context management to tackle complex agentic tasks efficiently.
D4Vinci/Scrapling
An adaptive Python web scraping framework designed to handle everything from single requests to large-scale crawls, featuring anti-bot bypass and intelligent parsing.
github/spec-kit
An open-source toolkit that enables Spec-Driven Development, transforming executable specifications directly into working software implementations.
crewAIInc/crewAI
A lean, lightning-fast Python framework for orchestrating autonomous, collaborative AI agents to tackle complex tasks efficiently.
BerriAI/litellm
A unified open-source AI Gateway and Python SDK to call over 100 LLM APIs in an OpenAI-compatible format, offering features like cost tracking, load balancing, and guardrails.
deepset-ai/haystack
An open-source AI orchestration framework for building production-ready LLM applications with modular pipelines and agent workflows.
modelscope/ms-swift
A scalable and lightweight infrastructure for fine-tuning, inference, and deployment of over 1000 large language models (LLMs) and multimodal large language models (MLLMs) using advanced techniques.
pydantic/pydantic-ai
A Python agent framework built by the Pydantic team, designed to simplify and accelerate the development of production-grade Generative AI applications with type safety and robust observability.
promptfoo/promptfoo
A CLI and library for testing, evaluating, and red-teaming LLM applications to ensure security, reliability, and performance across various models.
sgl-project/sglang
SGLang is a high-performance serving framework for large language models and multimodal models, optimizing inference throughput and latency.
Chainlit/chainlit
A Python framework for rapidly building and deploying production-ready conversational AI applications.
vllm-project/vllm
vLLM is a high-throughput and memory-efficient open-source library designed for fast and easy serving of large language models.
langbot-app/LangBot
A production-grade, open-source platform for building and deploying AI-powered instant messaging bots across various chat platforms.
AstrBotDevs/AstrBot
AstrBot is an open-source, all-in-one AI agent chatbot platform and development framework that integrates with various instant messaging apps and LLMs, offering scalable conversational AI infrastructure.
GoogleCloudPlatform/generative-ai
Provides sample code, notebooks, and resources for building and managing generative AI workflows on Google Cloud using Vertex AI and Gemini.
agentscope-ai/ReMe
ReMe is a memory management framework for AI agents, addressing limited context windows and stateless sessions by providing persistent, automatically refined memory.
awslabs/amazon-bedrock-agentcore-samples
Amazon Bedrock AgentCore provides a framework-agnostic and model-agnostic infrastructure for securely deploying and operating advanced AI agents at scale, with this repository offering practical samples and tutorials.
MemTensor/MemOS
MemOS is an AI memory operating system designed for LLMs and AI agents, providing persistent, context-aware, and multi-modal memory for enhanced skill reuse and evolution across tasks.
datapizza-labs/datapizza-ai
A Python framework for building reliable, predictable, and observable Generative AI agents with minimal overhead.
Mai-with-u/MaiBot
MaiBot is an LLM-based intelligent agent designed to be a human-like digital companion, prioritizing warmth, authenticity, and genuine connection over perfection or efficiency.
iflytek/astron-rpa
AstronRPA is an open-source, enterprise-grade RPA desktop application offering low-code visual design to automate desktop and web processes, deeply integrating with AI agents for enhanced business automation.
camel-ai/camel
A multi-agent framework for studying emergent behaviors and scaling laws in AI systems through simulation and interaction.
mlflow/mlflow
An open-source AI engineering platform for debugging, evaluating, monitoring, and optimizing production-quality AI applications, including agents, LLMs, and ML models.
ComposioHQ/composio
Composio provides SDKs and a robust platform to empower AI agents with over 1000 toolkits, enabling them to turn intent into action through seamless tool search, context management, and authentication.
Skyvern-AI/skyvern
Automates complex browser-based workflows using LLMs and computer vision, providing a resilient and adaptive solution for web interaction.
open-webui/open-webui
A user-friendly, self-hosted AI platform providing a powerful interface for interacting with various LLMs, including Ollama and OpenAI-compatible APIs, with advanced RAG capabilities.
hiyouga/LlamaFactory
A unified and efficient framework for fine-tuning over 100 large language models (LLMs) and vision-language models (VLMs) with both CLI and Web UI.
OpenBB-finance/OpenBB
An open-source financial data platform that integrates diverse data sources and exposes them to analysts, quants, and AI agents.
topoteretes/cognee
An open-source knowledge engine designed to build personalized and dynamic memory for AI agents, enabling them to learn and provide relevant context.
Lightning-AI/pytorch-lightning
Streamlines complex deep learning engineering, enabling scalable AI model training and finetuning across diverse hardware with minimal code changes.
affaan-m/everything-claude-code
A comprehensive system for optimizing the performance, security, and learning capabilities of AI agent harnesses like Claude Code, Codex, and Cursor.
milvus-io/milvus
A high-performance, cloud-native vector database designed for scalable Approximate Nearest Neighbor (ANN) search on massive unstructured data.
1Panel-dev/MaxKB
An open-source platform for building enterprise-grade AI agents, integrating RAG, robust workflows, and multi-modal capabilities to enhance smart Q&A and automate complex business scenarios.
Unstructured-IO/unstructured
An open-source ETL solution for transforming complex documents into clean, structured data formats, optimized for language models.
browser-use/browser-use
Enables AI agents to interact with and automate tasks on websites, making web content accessible for large language models.
HKUDS/LightRAG
LightRAG is a simple and fast framework for Retrieval-Augmented Generation, designed to enhance LLM performance with external knowledge.
run-llama/llama_index
LlamaIndex is an open-source framework designed to build intelligent agentic applications by connecting Large Language Models (LLMs) with private or custom data sources, focusing on document understanding and OCR.
neuml/txtai
An all-in-one AI framework for semantic search, LLM orchestration, and language model workflows, powered by an embeddings database.
bentoml/BentoML
A Python library for building, deploying, and scaling AI/ML model inference APIs and serving systems.
ollama/ollama
Run open-source large language models locally on your machine with a simple CLI, REST API, and client libraries.
dataelement/Clawith
Clawith is an open-source multi-agent collaboration platform that empowers AI agents with persistent identities, long-term memory, and dedicated workspaces, enabling them to function as autonomous digital employees within teams.
trycua/cua
Cua provides open-source infrastructure, including sandboxes and SDKs, to build, train, and evaluate AI agents capable of controlling full desktop environments across macOS, Linux, and Windows.
gptme/gptme
A personal AI agent in your terminal, equipped with local tools for coding, terminal operations, and web browsing, enabling autonomous task execution.
OpenPipe/ART
An open-source framework for training multi-step LLM agents using reinforcement learning (GRPO) to improve their reliability and performance on real-world tasks, with an optional serverless training service.
google/adk-python
An open-source, code-first Python framework for building, evaluating, and deploying sophisticated AI agents with flexibility and control.
e2b-dev/E2B
An open-source infrastructure providing secure, isolated cloud sandboxes for executing AI-generated code with developer-friendly SDKs.
Nixtla/nixtla
A production-ready, pre-trained time series foundation model (TimeGPT) for accurate forecasting and anomaly detection across various domains with minimal code.
microsoft/autogen
A programming framework for building multi-agent AI applications that can operate autonomously or collaboratively with humans.
carla-simulator/carla
An open-source simulator for autonomous driving research, providing a flexible platform and open digital assets for the development, training, and validation of AD systems.
agentscope-ai/agentscope
A production-ready, extensible AI agent framework designed for building, deploying, and understanding intelligent agents powered by advanced LLMs, with built-in finetuning and multi-agent orchestration.
Netflix/metaflow
A human-centric Python framework for building, managing, and deploying real-life AI/ML systems from rapid prototyping to reliable production.
leon-ai/leon
An open-source, privacy-aware personal AI assistant designed for local operation and agentic task execution.
mem0ai/mem0
Mem0 provides a universal, intelligent memory layer for AI agents, enabling personalized and context-rich interactions across various applications.
microsoft/semantic-kernel
An enterprise-ready, model-agnostic SDK for building and orchestrating intelligent AI agents and multi-agent systems with cutting-edge LLM technology.
VectifyAI/PageIndex
PageIndex is a vectorless, reasoning-based RAG system that builds hierarchical tree indexes for human-like, context-aware document retrieval, outperforming traditional vector-based methods.
explosion/spaCy
An industrial-strength Python library for advanced Natural Language Processing, offering state-of-the-art models and a production-ready training system.
AlexsJones/llmfit
A terminal tool that detects your hardware and recommends optimal LLM models, providing performance benchmarks for local execution.
xorbitsai/inference
A unified, production-ready inference API for deploying and serving open-source language, speech, and multimodal AI models on various infrastructures.
cft0808/edict
An AI multi-agent orchestration system inspired by ancient Chinese imperial governance, featuring institutional review, real-time dashboards, and full audit trails for reliable and observable AI collaboration.
AAswordman/Operit
Operit AI is the first fully-featured AI assistant application for Android, offering powerful tool-calling, local model support, and a built-in Ubuntu 24 environment for advanced automation.
OpenDCAI/Paper2Any
An AI-driven platform that transforms research papers, text, or topics into editable scientific figures, technical diagrams, and presentation slides with universal file support.
InternLM/xtuner
A next-generation training engine optimized for ultra-large Mixture-of-Experts (MoE) models, offering superior efficiency and scalability.
datawhalechina/hello-agents
A comprehensive tutorial guiding users from foundational theories to practical implementation of AI-native intelligent agent systems.
wandb/wandb
An AI developer platform for tracking, visualizing, and managing machine learning models from experimentation to production.
iii-hq/iii
iii unifies diverse backend services like APIs, task queues, and schedulers into a single engine using Function, Trigger, and Worker primitives for simplified distributed system development.
llmware-ai/llmware
A unified Python framework for building local, private, and secure enterprise RAG pipelines using small, specialized LLMs and a comprehensive model catalog.
rasbt/LLMs-from-scratch
An educational project providing step-by-step code to build a ChatGPT-like Large Language Model (LLM) from scratch using PyTorch.
emcie-co/parlant
A production-ready framework for building controlled, consistent, and predictable customer-facing AI agent interactions with LLMs, optimized for context engineering.
lutzroeder/netron
A universal viewer for neural network, deep learning, and machine learning models, supporting a wide array of formats.
treeverse/dvc
DVC (Data Version Control) is a command-line tool and VS Code extension for managing data, models, and ML experiments, enabling reproducible machine learning projects.
FlagOpen/FlagEmbedding
FlagEmbedding (BGE) is a comprehensive toolkit offering state-of-the-art embedding models for efficient search, Retrieval-Augmented Generation (RAG), and various multimodal AI applications.
memodb-io/Acontext
Acontext is an open-source skill memory layer for AI agents that automatically captures learnings from agent runs and stores them as readable, editable, and shareable skill files.
facefusion/facefusion
An industry-leading, open-source platform for advanced AI-driven face manipulation and deepfake creation.
linyqh/NarratoAI
Leveraging AI models for one-click video commentary and editing, enabling efficient content creation.
p-e-w/heretic
Heretic is an AI model utility that automatically removes censorship and safety alignment from transformer-based language models without requiring expensive post-training.
evalstate/fast-agent
A flexible CLI-first framework for building, evaluating, and interacting with sophisticated multimodal LLM agents and workflows, offering comprehensive model and skill support.
shaxiu/XianyuAutoAgent
An AI-powered customer service bot for Xianyu, offering 24/7 automated support, multi-expert decision-making, smart negotiation, and context-aware conversations.
casibase/casibase
An open-source, enterprise-grade AI Cloud OS providing a knowledge base and a comprehensive management platform for AI models (MCP) and agents (A2A), complete with admin UI, user management, and SSO.
WEIFENG2333/VideoCaptioner
An AI-powered tool for comprehensive video subtitling, offering speech-to-text, intelligent segmentation, LLM-based optimization, translation, and video synthesis.
GiovanniPasq/agentic-rag-for-dummies
A modular framework built with LangGraph for developing advanced Agentic RAG systems, offering both learning materials and an extensible architecture.
yuruotong1/autoMate
An AI-driven local automation assistant that acts as a personal data warehouse and tool orchestrator for any LLM, enabling cross-vendor memory, file storage, and real-world actions.
yichuan-w/LEANN
LEANN is an innovative on-device vector database enabling private, efficient, and fast Retrieval Augmented Generation (RAG) across all your personal data with significant storage savings.
xming521/WeClone
A comprehensive platform enabling users to create personalized AI twins by fine-tuning Large Language Models with their unique chat history, capturing individual style and bringing digital selves to life.
TurixAI/TuriX-CUA
TuriX empowers AI models to directly interact with and control your desktop, automating complex tasks across various applications with high accuracy.
metorial/metorial
An open-source integration platform enabling AI models to connect with thousands of APIs, data sources, and tools via simple SDKs and the Model Context Protocol (MCP).
pinpoint-apm/pinpoint
An APM tool for large-scale distributed systems, providing real-time monitoring and code-level visibility across transactions.
snyk/agent-scan
Snyk Agent Scan is a security scanner designed to discover and analyze AI agent components for prompt injections, vulnerabilities, and sensitive data handling issues.
Upsonic/Upsonic
A Python framework for building autonomous and traditional AI agents, offering robust tools, prebuilt components, and integrated OCR capabilities.
xlang-ai/OSWorld
OSWorld is a benchmark and environment for evaluating multimodal AI agents on open-ended tasks within real computer operating systems.
Arize-ai/phoenix
An open-source platform for debugging, evaluating, and monitoring AI/ML models and pipelines.
langchain-ai/langchain-mcp-adapters
A lightweight Python library that enables seamless integration of Anthropic Model Context Protocol (MCP) tools with LangChain and LangGraph.
meta-llama/llama-cookbook
An official guide and collection of recipes for building applications with the Llama model family, covering inference, fine-tuning, and RAG.
huggingface/agents-course
A comprehensive, free online course from Hugging Face designed to teach the fundamentals and advanced techniques of building AI agents.
argilla-io/argilla
Argilla is an open-source collaboration tool for AI engineers and domain experts to build and manage high-quality datasets for various AI models, leveraging human feedback and programmatic workflows.
neo4j-labs/llm-graph-builder
A powerful application that transforms diverse unstructured data sources into structured Neo4j Knowledge Graphs using Large Language Models (LLMs) and LangChain.
AgentOps-AI/agentops
A Python SDK and platform providing comprehensive observability, monitoring, and evaluation tools for AI agents, from prototype to production.
datawhalechina/all-in-rag
A comprehensive, full-stack guide to Retrieval Augmented Generation (RAG) technology for large language model application development, covering theory, practice, and engineering best practices.
microsoft/markitdown
A lightweight Python utility for converting diverse file formats and office documents into structured Markdown, optimized for Large Language Models (LLMs) and text analysis pipelines.
datawhalechina/llm-universe
A beginner-friendly, hands-on tutorial for large language model (LLM) application development, focusing on practical skills and RAG implementation.
NirDiamant/RAG_Techniques
A repository showcasing various advanced Retrieval-Augmented Generation (RAG) techniques through detailed notebook tutorials.
run-llama/rags
Build and customize RAG pipelines and chatbots over your data using natural language, powered by Streamlit.
LearningCircuit/local-deep-research
An AI-powered research assistant for deep, agentic research, supporting local and cloud LLMs, multiple search sources, and ensuring data privacy with local, encrypted storage.
strands-agents/sdk-python
A Python SDK that simplifies the creation and deployment of AI agents using a model-driven approach, supporting diverse LLMs and advanced features.
CaviraOSS/OpenMemory
A self-hosted, local-first cognitive memory engine providing real long-term memory for LLM applications and AI agents, distinct from RAG or vector databases.
containers/ramalama
RamaLama is an open-source developer tool that simplifies the local serving and production inference of AI models by leveraging familiar container technology.
PaddlePaddle/FastDeploy
A high-performance inference and deployment toolkit for Large Language Models (LLMs) and Vision-Language Models (VLMs) based on PaddlePaddle.
kvcache-ai/Mooncake
A KVCache-centric disaggregated architecture for high-performance LLM serving, powering leading AI services.
OpenRLHF/OpenRLHF
An easy-to-use, scalable, and high-performance open-source framework for Reinforcement Learning from Human Feedback (RLHF), leveraging Ray and vLLM for distributed training of LLMs and VLMs.
katanaml/sparrow
A production-ready platform for structured data extraction and instruction calling using ML, LLM, and Vision LLM technologies.
Orchestra-Research/AI-Research-SKILLs
A comprehensive open-source library providing AI agents with the skills to autonomously conduct the entire AI research lifecycle, from ideation to paper writing.
skyzh/tiny-llm
A hands-on course for systems engineers to build an efficient LLM inference serving system from scratch on Apple Silicon using MLX, mimicking vLLM's core techniques.
xlite-dev/Awesome-LLM-Inference
A comprehensive, curated list of research papers and associated code implementations focused on optimizing Large Language Model (LLM) and Vision-Language Model (VLM) inference.
apconw/Aix-DB
Aix-DB is an intelligent data analysis system leveraging large language models and RAG technology to transform natural language queries into data insights and visualizations.
lance-format/lance
An open lakehouse format for multimodal AI, offering high-performance random access, vector indexing, and data versioning.
apache/airflow
A platform to programmatically author, schedule, and monitor data workflows.
kedro-org/kedro
A Python framework for building reproducible, maintainable, and modular data engineering and data science pipelines using software engineering best practices.
skypilot-org/skypilot
A unified system to run, manage, and scale AI workloads across any infrastructure, including Kubernetes, Slurm, and over 20 cloud providers.
datachain-ai/datachain
DataChain provides a typed, versioned, and queryable data context layer for unstructured data in object storage, empowering AI agents and pipelines with efficient metadata management and incremental computations.
evidentlyai/evidently
An open-source Python library for evaluating, testing, and monitoring ML and LLM systems from experiments to production, supporting tabular and text data with 100+ built-in metrics.
flyteorg/flyte
Dynamic, resilient open-source orchestrator for building scalable and reproducible data and ML pipelines on Kubernetes.
SwanHubX/SwanLab
SwanLab is an open-source, modern-design platform for tracking, visualizing, and analyzing AI/ML training experiments, supporting cloud and self-hosted deployments.
Avaiga/taipy
Taipy is a Python library that empowers data scientists and machine learning engineers to rapidly transform their data and AI algorithms into production-ready web applications.
HumanSignal/label-studio
An open-source, multi-type data labeling and annotation tool with a simple UI and standardized output, designed to prepare and improve data for machine learning models.
kubeflow/pipelines
An open-source platform for building, deploying, and managing end-to-end machine learning workflows on Kubernetes.
zenml-io/zenml
An open-source AI platform that unifies ML pipelines and agentic workflows, abstracting infrastructure complexity for ML/AI engineers.
aws/amazon-sagemaker-examples
A collection of Jupyter notebooks demonstrating how to build, train, and deploy machine learning models using Amazon SageMaker and its new Python SDK, SageMaker-Core.
GoogleCloudPlatform/agent-starter-pack
A Python package offering production-ready templates and infrastructure for deploying GenAI agents on Google Cloud, now superseded by `agents-cli` for ongoing development.
polyaxon/polyaxon
A comprehensive MLOps platform for managing, orchestrating, and scaling the machine learning lifecycle with reproducibility and automation.
aimhubio/aim
An open-source tool designed to log, compare, and observe machine learning training runs and AI metadata with an intuitive UI and programmatic API.
apache/hamilton
Apache Hamilton is a lightweight Python library that enables data scientists and engineers to define testable, modular, and self-documenting dataflows (DAGs) with built-in lineage and metadata, portable across any Python environment.
feast-dev/feast
An open-source feature store for AI/ML that streamlines the management and serving of features for model training and online inference.
great-expectations/great_expectations
A powerful data quality framework that provides expressive unit tests for your data, ensuring reliability and consistency.
dagster-io/dagster
A cloud-native data pipeline orchestrator designed for the development, production, and observation of data assets, featuring integrated lineage, observability, and a declarative programming model.
SWE-agent/mini-swe-agent
A radically simple yet highly performant AI agent for software engineering, capable of solving GitHub issues and assisting in command-line tasks.
chatchat-space/Langchain-Chatchat
An open-source, offline-deployable RAG and Agent application built with Langchain, supporting various LLMs for private, local knowledge base Q&A.
zilliztech/GPTCache
A semantic caching library for LLM queries, designed to drastically cut API costs and accelerate response times.
liaokongVFX/LangChain-Chinese-Getting-Started-Guide
A comprehensive Chinese-language tutorial for LangChain, guiding developers to build powerful applications powered by large language models.
TaskingAI/TaskingAI
An open-source BaaS platform that unifies hundreds of LLM models and provides comprehensive tools for developing, deploying, and managing AI-native applications and LLM-based agents.
shroominic/codeinterpreter-api
An open-source Python library providing a LangChain-compatible implementation of the ChatGPT Code Interpreter for sandboxed code execution.
rag-web-ui/rag-web-ui
An intelligent dialogue system leveraging RAG technology to build custom Q&A systems from diverse knowledge bases.
AsyncFuncAI/deepwiki-open
Automatically generates comprehensive, interactive, and visually rich wikis for GitHub, GitLab, and BitBucket repositories using AI.
clearml/clearml
ClearML streamlines AI/ML/LLM workflows with integrated experiment tracking, data management, MLOps/LLMOps orchestration, and model serving.
Netflix/maestro
Maestro is Netflix's highly scalable, general-purpose workflow orchestrator, providing a fully managed workflow-as-a-service for data and ML pipelines.
truefoundry/cognita
A production-ready RAG framework that provides modular, API-driven components and a UI to streamline the deployment and management of Retrieval Augmented Generation applications.
plexe-ai/plexe
Build machine learning models from natural language prompts using an AI-powered multi-agent system.
GokuMohandas/Made-With-ML
A comprehensive educational platform teaching developers how to design, develop, deploy, and iterate on production-grade machine learning applications.
PacktPublishing/LLM-Engineers-Handbook
A comprehensive practical guide and accompanying code repository for LLM engineers, covering the full lifecycle of building, deploying, and monitoring advanced LLM and RAG applications on AWS with LLMOps best practices.
microsoft/agent-lightning
A versatile framework designed to train and optimize AI agents from any framework with minimal code changes, leveraging advanced algorithms like Reinforcement Learning.
activeloopai/deeplake
Deep Lake is an AI data runtime and database optimized for deep learning, offering serverless multimodal data storage, scalable retrieval, and training capabilities.
bitsandbytes-foundation/bitsandbytes
A PyTorch library enabling accessible large language models through k-bit quantization, significantly reducing memory consumption for both inference and training.
ludwig-ai/ludwig
A low-code, declarative framework for building and deploying custom large language models (LLMs) and other deep neural networks with ease and efficiency.
hacksider/Deep-Live-Cam
Deep-Live-Cam enables real-time face swapping and one-click video deepfake generation using just a single image.
letta-ai/letta
A platform for building stateful AI agents equipped with advanced memory, enabling them to learn and self-improve over time.
camel-ai/owl
A cutting-edge framework for multi-agent collaboration to automate real-world tasks using AI.
xszyou/Fay
Fay is an AI agent framework designed to connect digital humans (2.5D, 3D, mobile, PC, web) and large language models (OpenAI compatible, DeepSeek) with various business systems.
jundot/omlx
An LLM inference server optimized for Apple Silicon, featuring continuous batching, tiered KV caching, and macOS menu bar management for efficient local AI.
microsoft/RD-Agent
Automates data-driven AI R&D processes using AI agents, excelling in machine learning engineering tasks.
harry0703/MoneyPrinterTurbo
An AI-powered tool that generates high-definition short videos automatically from a given topic or keywords, including script, footage, subtitles, and background music.
CoplayDev/unity-mcp
Empowers AI assistants to directly interact with and automate tasks within the Unity Editor, streamlining game development workflows.
aiming-lab/SimpleMem
SimpleMem offers an efficient, lifelong, and multimodal memory solution for LLM agents, featuring semantic lossless compression for diverse data types.
FareedKhan-dev/all-agentic-architectures
A comprehensive repository offering practical implementations of 17+ state-of-the-art AI agent architectures using LangChain and LangGraph for hands-on learning and development.
openvinotoolkit/openvino
OpenVINO is an open-source toolkit designed to optimize and deploy deep learning models for efficient AI inference across a wide range of hardware platforms.
simonw/llm
A command-line tool and Python library to interact with various large language models, both remote APIs and local models.
decodingai-magazine/second-brain-ai-assistant-course
Learn to build a personalized AI assistant leveraging your 'Second Brain' knowledge base using advanced LLM, RAG, and agentic techniques with MLOps best practices.
microsoft/AI-For-Beginners
A 12-week, 24-lesson curriculum from Microsoft to learn Artificial Intelligence for beginners, including practical lessons, quizzes, and labs.
spmallick/learnopencv
A comprehensive repository offering C++ and Python code examples for computer vision, deep learning, and AI research articles from LearnOpenCV.com.
AIDC-AI/ComfyUI-Copilot
An AI-powered custom node for ComfyUI that automates workflow creation, debugging, and optimization, acting as an intelligent development partner for AIGC.
coderamp-labs/gitingest
Gitingest transforms any Git repository into a structured, prompt-friendly text digest, optimized for large language models to understand codebases efficiently.
ScrapeGraphAI/Scrapegraph-ai
A Python library that leverages LLMs and graph logic to simplify web scraping and data extraction from various sources.
recommenders-team/recommenders
A comprehensive toolkit providing best practices, examples, and state-of-the-art algorithms to assist in prototyping, experimenting with, and operationalizing recommendation systems.
hpcaitech/ColossalAI
An open-source framework designed to make large AI model training and inference cheaper, faster, and more accessible through advanced distributed computing and memory optimization techniques.
MemPalace/mempalace
An open-source, local-first AI memory system designed for verbatim storage and high-recall semantic retrieval of conversation history.
HKUDS/nanobot
An ultra-lightweight, open-source personal AI agent platform designed for efficiency and broad compatibility across various LLM providers and communication channels.
danielmiessler/Fabric
Fabric is an open-source framework that organizes AI prompts by task, simplifying AI integration and augmenting human capabilities across various tools and interfaces.
agentscope-ai/QwenPaw
QwenPaw is a personal, self-hostable AI assistant platform designed for privacy, extensibility, and multi-channel integration, empowering users with full control over their data and capabilities.
NirDiamant/Prompt_Engineering
A comprehensive GitHub repository offering 22 hands-on Jupyter Notebook tutorials on prompt engineering techniques, from basic to advanced, for leveraging large language models.
TheR1D/shell_gpt
A command-line tool powered by AI large language models to quickly generate shell commands, code snippets, and documentation, significantly boosting developer productivity.
Integuru-AI/Integuru
An AI agent that automates the creation of permissionless integrations by reverse-engineering platforms' internal APIs.
tirth8205/code-review-graph
Optimizes AI coding tools by building a local knowledge graph of your codebase, significantly reducing token usage for code reviews and daily coding tasks.
google/langextract
A Python library leveraging LLMs to extract structured information from unstructured text with precise source grounding and interactive visualization.
microsoft/UFO
UFO³ is a revolutionary framework that orchestrates intelligent agents across multiple heterogeneous devices to automate complex, cross-device workflows and tasks.
livekit/agents
A framework for building realtime, programmable voice AI agents that can see, hear, and understand.
nesquena/hermes-webui
A lightweight, self-hosted web interface providing full CLI parity for the persistent and self-improving Hermes AI Agent, accessible from any device.
huggingface/datasets
A lightweight library providing one-line dataloaders and efficient pre-processing tools for a vast hub of AI datasets, supporting various ML frameworks.
Pythagora-io/gpt-pilot
GPT Pilot is an AI developer companion designed to build complete, production-ready applications, automating up to 95% of the coding process with human oversight.
microsoft/fara
An ultra-compact 7B parameter AI agent designed by Microsoft to automate multi-step computer tasks through visual perception and direct interface interaction.
BoundaryML/baml
BAML is an AI framework that transforms prompt engineering into schema engineering, enabling developers to build reliable, type-safe AI workflows and agents across multiple programming languages.
lancedb/lancedb
An open-source, developer-friendly embedded retrieval library and multimodal AI lakehouse for fast, scalable vector search and data management.
plastic-labs/honcho
An open-source memory library and managed service for building stateful AI agents that learn and adapt over time.
genkit-ai/genkit
A Google-built open-source framework simplifying the development and deployment of full-stack AI applications across JavaScript, Go, and Python.
infiniflow/infinity
An AI-native database designed for LLM applications, offering incredibly fast hybrid search across dense vector, sparse vector, tensor, and full-text data.
databendlabs/databend
A unified, open-source enterprise data warehouse built in Rust, offering analytics, vector search, and full-text search, with agent-ready capabilities for AI workloads.
volcengine/MineContext
MineContext is an open-source, proactive context-aware AI partner that enhances productivity by understanding your digital world and delivering timely insights.
zilliztech/deep-searcher
DeepSearcher is an open-source platform that leverages LLMs and vector databases to enable deep research, intelligent Q&A, and comprehensive reporting on private enterprise data.
towhee-io/towhee
A cutting-edge framework for building fast and simple neural data processing pipelines, especially for unstructured multi-modal data using LLMs.
docarray/docarray
A Python library for representing, transmitting, storing, and retrieving multimodal data, designed for AI applications.
decodingai-magazine/llm-twin-course
A free, hands-on course to build a production-ready LLM & RAG system, including a personalized AI replica, applying LLMOps best practices.
AI-Hypercomputer/maxtext
A high-performance, scalable JAX-based open-source library for training large language models on Google Cloud TPUs and GPUs.
oumi-ai/oumi
An end-to-end platform for fine-tuning, evaluating, and deploying open-source Large Language Models (LLMs) and Vision Language Models (VLMs).
dstackai/dstack
A vendor-agnostic unified control plane for GPU provisioning and orchestration across clouds, Kubernetes, and on-prem for AI/ML workloads.
Nerogar/OneTrainer
A comprehensive, one-stop solution for training various diffusion models with advanced features and a user-friendly interface.
roboflow/maestro
A streamlined tool to accelerate the fine-tuning of popular multimodal models like Florence-2, PaliGemma 2, and Qwen2.5-VL.
bespokelabsai/curator
A Python library for generating and curating high-quality synthetic data for AI model training and structured data extraction.
beam-cloud/beta9
An ultrafast, open-source Pythonic runtime for deploying and scaling serverless GPU inference, sandboxes, and background jobs with zero infrastructure overhead.
FunAudioLLM/CosyVoice
CosyVoice is an advanced multi-lingual large language model-based text-to-speech system offering state-of-the-art voice generation, cloning, and full-stack deployment capabilities.
stochasticai/xTuring
xTuring simplifies the process of fine-tuning and deploying open-source Large Language Models (LLMs) on private data, ensuring privacy, efficiency, and scalability.
adobe-research/custom-diffusion
Enables fast and efficient multi-concept customization of text-to-image diffusion models like Stable Diffusion using a few images.
tianrun-chen/SAM-Adapter-PyTorch
A PyTorch-based framework to adapt Meta AI's Segment Anything Model (SAM) for improved performance on challenging downstream computer vision tasks using adapters and prompts.
ashishpatel26/LLM-Finetuning
A collection of guides and code for efficiently fine-tuning large language models using PEFT (LoRA) and Hugging Face transformers.
ModelCloud/GPTQModel
A toolkit for quantizing (compressing) Large Language Models (LLMs) with hardware acceleration across various GPUs and CPUs, integrating with popular inference frameworks.
lxe/simple-llm-finetuner
A beginner-friendly UI for fine-tuning large language models (LLMs) using the LoRA method on commodity NVIDIA GPUs.
mymusise/ChatGLM-Tuning
A cost-effective solution for finetuning ChatGLM-6B with LoRA, enabling personalized large language models.
markqvist/Reticulum
A cryptography-based networking stack enabling the creation of resilient, decentralized, and censorship-resistant networks over diverse hardware, independent of traditional IP infrastructure.
datawhalechina/self-llm
A comprehensive Linux-based guide for beginners to quickly fine-tune and deploy open-source LLMs and MLLMs, tailored for Chinese learners.
markqvist/NomadNet
Nomad Network enables off-grid, private, and resilient mesh communication with strong encryption, operating independently of the public internet.
adapter-hub/adapters
A unified library extending HuggingFace Transformers for parameter-efficient and modular transfer learning in NLP.
lyogavin/airllm
Optimizes large language model inference to run 70B models on a single 4GB GPU without quantization, enabling efficient deployment on resource-constrained hardware.
labmlai/annotated_deep_learning_paper_implementations
A comprehensive collection of PyTorch implementations for over 60 deep learning papers, featuring side-by-side annotated notes for enhanced understanding.
Akegarasu/lora-scripts
A comprehensive GUI and script collection for training LoRA and Dreambooth models for Stable Diffusion, built upon kohya-ss's sd-scripts.
ymcui/Chinese-LLaMA-Alpaca
An open-source project providing Chinese LLaMA and instruction-tuned Alpaca large language models, optimized for Chinese NLP and local deployment on CPU/GPU.
microsoft/LoRA
A Python library implementing LoRA (Low-Rank Adaptation) to efficiently fine-tune large language models by significantly reducing trainable parameters and storage requirements.
wenge-research/YAYI
YaYi is an open-source Chinese Large Language Model, built on LLaMA 2 & BLOOM, designed for secure, reliable, and domain-specific applications through extensive instruction tuning.
natolambert/rlhf-book
A comprehensive open-source textbook and code repository dedicated to Reinforcement Learning from Human Feedback (RLHF) and post-training language models.
argilla-io/distilabel
Distilabel is a framework for generating synthetic data and AI feedback, enabling engineers to build fast, reliable, and scalable AI pipelines based on verified research.
PKU-Alignment/align-anything
A modular framework for aligning any-modality large models with human intentions and values using various fine-tuning and reinforcement learning methods.
InternLM/InternLM
A series of high-performance, cost-effective open-source large language models (LLMs) designed for general-purpose usage and advanced reasoning.
zai-org/ImageReward
A human preference reward model for evaluating and improving text-to-image generation models.
tatsu-lab/alpaca_eval
An automatic, fast, and cost-effective evaluation framework for instruction-following language models, highly correlated with human judgments.
ymcui/Chinese-LLaMA-Alpaca-2
An open-source project providing Chinese LLaMA-2 and Alpaca-2 large language models with expanded Chinese vocabulary, enhanced capabilities, and support for ultra-long contexts up to 64K.
RLHFlow/RLHF-Reward-Modeling
A comprehensive collection of recipes and code for training various reward models crucial for Reinforcement Learning from Human Feedback (RLHF) in large language models.
THUDM/WebGLM
An efficient and cost-effective web-enhanced question answering system that integrates web search and retrieval with a large language model, optimized with human preferences.
Docta-ai/docta
Docta is an advanced data-centric AI platform that detects and rectifies issues in various data types to improve model performance.
OpenLMLab/MOSS-RLHF
An open-source framework providing code, models, and insights for stable Reinforcement Learning from Human Feedback (RLHF) training in Large Language Models, focusing on the PPO algorithm and reward modeling.
IBM/mcp-context-forge
A unified open-source AI gateway and proxy for federating MCP, A2A, and REST/gRPC APIs, offering centralized governance, discovery, and observability for AI clients and agents.
dottxt-ai/outlines
Outlines is a Python library that guarantees structured outputs from Large Language Models (LLMs) during generation, eliminating the need for post-processing and ensuring data validity.
mufeedvh/code2prompt
A powerful CLI tool and ecosystem to convert entire codebases into structured, token-counted prompts for Large Language Models.
microsoft/promptflow
A comprehensive development suite for building, testing, evaluating, deploying, and monitoring high-quality LLM-based AI applications.
alirezarezvani/claude-skills
A comprehensive open-source library of modular skills and plugins to enhance AI coding agents with domain-specific expertise.
AI4Finance-Foundation/FinGPT
FinGPT democratizes access to large language models tailored for finance, offering cost-effective and rapidly adaptable solutions to overcome the limitations of proprietary financial AI.
AI4Finance-Foundation/FinRobot
An open-source AI agent platform leveraging large language models for automated financial analysis, investment research, and algorithmic trading.
algorithmicsuperintelligence/optillm
OptiLLM is an OpenAI API-compatible inference proxy that uses 20+ state-of-the-art techniques to significantly boost LLM accuracy and performance on reasoning tasks without requiring any training.
huggingface/diffusers
A modular PyTorch library for state-of-the-art diffusion models, enabling easy inference and training for image, video, and audio generation.
AUTOMATIC1111/stable-diffusion-webui
A comprehensive web interface for Stable Diffusion, enabling users to generate and manipulate images with advanced AI features.
carson-katri/dream-textures
Integrates Stable Diffusion directly into Blender for seamless AI-powered texture generation, concept art creation, and image manipulation within 3D workflows.
ashawkey/stable-dreamfusion
A PyTorch implementation for generating 3D models from text or images, leveraging NeRF and diffusion models like Stable Diffusion.
SamurAIGPT/Generative-Media-Skills
Provides a multimodal toolset for AI agents to generate, edit, and display professional-grade images, videos, and audio using a CLI-powered architecture.
HisMax/RedInk
RedInk is an AI-powered tool that streamlines Xiaohongshu post creation, generating complete image and text content from a single sentence input.
SamurAIGPT/AI-Youtube-Shorts-Generator
An AI-powered Python tool that automatically generates engaging YouTube Shorts from long-form videos by identifying viral-worthy moments and vertically cropping them.
Hunyuan-PromptEnhancer/PromptEnhancer
A prompt rewriting tool that refines user prompts into clearer, structured versions to enhance the quality of text-to-image generation and image-to-image editing.
kuprel/min-dalle
A fast, minimal PyTorch port of DALL·E Mini for efficient text-to-image generation.
lucidrains/DALLE-pytorch
An open-source PyTorch implementation and replication of OpenAI's DALL-E, a text-to-image transformer, including CLIP for generation ranking.
XavierXiao/Dreambooth-Stable-Diffusion
This project implements Google's Dreambooth technique on Stable Diffusion, enabling users to fine-tune a text-to-image model with a few custom examples for personalized image generation.
lucidrains/deep-daze
A command-line tool for generating images from text descriptions using OpenAI's CLIP and Siren neural networks.
NVIDIA-NeMo/NeMo
A scalable generative AI framework for researchers and developers focused on Large Language Models, Multimodal, and Speech AI (ASR, TTS).
OpenBMB/VoxCPM
VoxCPM2 is a tokenizer-free, 2B-parameter Text-to-Speech system supporting 30 languages, creative voice design, and controllable voice cloning with 48kHz studio-quality audio output.
GetStream/Vision-Agents
Build low-latency, multi-modal AI agents that process real-time video and audio using various LLMs and vision models.
snakers4/silero-models
A collection of pre-trained, end-to-end text-to-speech models designed for simplicity, speed, and natural-sounding speech across multiple languages.
PaddlePaddle/PaddleSpeech
An easy-to-use open-source toolkit built on PaddlePaddle, offering state-of-the-art models for diverse speech and audio tasks like ASR, TTS, translation, and speaker verification.
IAHispano/Applio
A user-friendly, high-quality AI-powered tool for transforming voices with a focus on performance and customization.
fishaudio/Bert-VITS2
An open-source text-to-speech model that combines the VITS2 backbone with multilingual BERT for high-quality, multi-language speech synthesis.
2noise/ChatTTS
A generative speech model optimized for natural, expressive dialogue in LLM assistants, featuring fine-grained prosodic control.
rsxdalv/TTS-WebUI
A unified Gradio and React web interface integrating a vast collection of open-source Text-to-Speech, audio generation, and voice conversion AI models.
rany2/edge-tts
Access Microsoft Edge's online text-to-speech service from Python without needing Edge, Windows, or an API key.
CorentinJ/Real-Time-Voice-Cloning
A deep learning framework for real-time voice cloning and text-to-speech synthesis from short audio samples.
denizsafak/abogen
Generate high-quality audiobooks and voiceovers from various text formats with synchronized captions.
babysor/MockingBird
A real-time voice cloning toolkit that allows users to replicate a voice in 5 seconds and generate arbitrary speech.
santinic/audiblez
A Python-based tool to convert e-books (EPUB) into high-quality M4B audiobooks using advanced text-to-speech models.
RVC-Boss/GPT-SoVITS
A powerful web-based tool for few-shot voice cloning and text-to-speech, enabling high-quality voice generation from minimal audio data.
Eventual-Inc/Daft
A high-performance data engine for AI and multimodal workloads, processing diverse data types at scale with Python and Rust.
vortex-data/vortex
Vortex is a next-generation, high-performance, and extensible open-source columnar file format and toolkit designed for blazing-fast data processing and storage, especially with object storage.
rerun-io/rerun
An open source SDK for logging, storing, querying, and visualizing multimodal and multi-rate data, especially for robotics and AI.
embeddings-benchmark/mteb
MTEB is a comprehensive benchmark and evaluation framework designed to assess the performance of text embedding models and retrieval systems across a wide range of tasks.
NVIDIA-NeMo/DataDesigner
A flexible framework by NVIDIA NeMo for generating high-quality synthetic datasets with diverse distributions, meaningful correlations, and robust validation.
pixeltable/pixeltable
A declarative, transactional Python library for building multimodal AI applications with incremental data storage, transformation, indexing, and orchestration.
yzhao062/pyod
A comprehensive Python library for multi-modal anomaly detection, featuring 60+ algorithms and agentic AI capabilities for scalable, expert-level investigations.
EvolvingLMMs-Lab/lmms-eval
A unified, reproducible, and efficient multimodal evaluation toolkit for large language models across text, image, video, and audio tasks.
Blaizzy/mlx-audio
An efficient audio processing library built on Apple's MLX framework, enabling fast text-to-speech, speech-to-text, and speech-to-speech capabilities on Apple Silicon devices.
kyegomez/BitNet
A PyTorch implementation of BitNet, enabling highly efficient 1-bit transformers for large language models.
fikrikarim/parlor
Parlor is an on-device, real-time multimodal AI that enables natural voice and vision conversations, running entirely on your local machine.
morphik-org/morphik-core
A comprehensive AI-native toolset for accurate document search and storage, designed to integrate complex context from visually rich and multimodal data into AI applications.
rom1504/clip-retrieval
A comprehensive toolkit for computing CLIP embeddings and building scalable semantic search and retrieval systems for multimodal data.
emcf/thepipe
A Python library for extracting clean markdown, multimodal media, and structured data from complex documents using vision-language models.
collabora/WhisperLive
A highly optimized, nearly-live speech-to-text application leveraging OpenAI's Whisper model for real-time audio transcription.
intel/auto-round
AutoRound is an advanced quantization toolkit for Large Language Models (LLMs) and Vision-Language Models (VLMs), enabling high-accuracy, ultra-low-bit inference across diverse hardware.
edwko/OuteTTS
A versatile interface for OuteTTS models, providing flexible text-to-speech generation capabilities across various AI inference backends and hardware platforms.
datawhalechina/handy-ollama
A comprehensive tutorial guiding users to deploy large language models locally on CPU using Ollama, making LLM inference accessible without dedicated GPU resources.
xtekky/chatgpt-clone
A self-hosted, enhanced user interface for the ChatGPT API, offering a cleaner design and customizable features.
ramon-victor/freegpt-webui
A free web UI for GPT 3.5/4 models, requiring no API key and offering enhanced jailbreaks.
GaiZhenbiao/ChuanhuChatGPT
A feature-rich web GUI for ChatGPT and various LLMs, offering advanced functionalities like agents, file-based QA, web search, and finetuning with a refined user experience.
Marker-Inc-Korea/AutoRAG
An open-source framework that automates the evaluation and optimization of Retrieval-Augmented Generation (RAG) pipelines using AutoML-style automation for specific datasets.
julep-ai/julep
An open-source, serverless platform for building and deploying complex, agent-based AI workflows with persistent memory and tool orchestration.
lastmile-ai/aiconfig
A config-based framework for building, managing, and iterating on generative AI applications by separating AI behavior from application code.
ZHangZHengEric/Sage
A production-ready multi-agent system framework designed for executing complex tasks, automating workflows, and integrating with various communication channels.
PipedreamHQ/pipedream
Pipedream is a free, event-driven integration platform for developers to connect APIs and build powerful automations with pre-built components or custom code.
PrefectHQ/prefect
Prefect is a Python-based workflow orchestration framework designed to build resilient, dynamic data pipelines that automate processes and recover from unexpected changes.
galaxyproject/galaxy
A web-based platform for accessible, reproducible, and transparent data-intensive science, especially in bioinformatics.
astronomer/astronomer-cosmos
Integrate dbt Core projects seamlessly into Apache Airflow DAGs and Task Groups, enabling robust data transformation orchestration.
BasedHardware/omi
An open-source AI assistant that captures screen and conversations, transcribes in real-time, generates summaries, and offers an AI chat with comprehensive memory.
sentient-agi/OML-1.0-Fingerprinting
A framework for embedding secret cryptographic fingerprints into Large Language Models (LLMs) via fine-tuning to verify ownership and prevent unauthorized use.
dbiir/UER-py
An open-source PyTorch-based framework for NLP pre-training and fine-tuning, offering modularity, reproducibility, and a comprehensive model zoo for various downstream tasks.
liucongg/ChatGLM-Finetuning
A toolkit for finetuning ChatGLM series models (ChatGLM-6B, ChatGLM2-6B, ChatGLM3-6B) using various methods like Freeze, Lora, P-tuning, and full parameter training for downstream NLP tasks.
hegelai/prompttools
An open-source, self-hostable toolkit for testing, experimenting with, and evaluating prompts, large language models (LLMs), and vector databases.
canopyai/Orpheus-TTS
Orpheus TTS is a state-of-the-art open-source text-to-speech system built on a Llama-3b backbone, aiming to generate human-sounding, emotionally rich speech with low latency.
jianchang512/ChatTTS-ui
Provides a local web interface and API for ChatTTS to synthesize text into speech, supporting mixed Chinese, English, and numbers.
jianchang512/clone-voice
A user-friendly, open-source tool that clones any human voice to generate speech from text or convert existing audio, featuring a web interface and multi-language support.
myshell-ai/OpenVoice
An open-source AI model for instant, accurate, and flexible voice cloning, supporting cross-lingual synthesis and granular style control.
MoonInTheRiver/DiffSinger
DiffSinger is an official PyTorch implementation of a singing voice synthesis (SVS) and text-to-speech (TTS) system, leveraging a shallow diffusion mechanism for high-quality audio generation.
myshell-ai/MeloTTS
A high-quality, multi-lingual text-to-speech library supporting various languages and accents, optimized for real-time CPU inference.
wzpan/wukong-robot
A flexible, open-source Chinese voice assistant/smart speaker project with ChatGPT integration and brain-computer interface support.
coqui-ai/TTS
A deep learning toolkit for advanced, multi-language Text-to-Speech generation and voice cloning, suitable for research and production.
netease-youdao/EmotiVoice
EmotiVoice is an open-source, multi-voice, and prompt-controlled text-to-speech engine supporting English and Chinese with emotional synthesis capabilities.
yl4579/StyleTTS2
StyleTTS 2 is a cutting-edge text-to-speech model achieving human-level speech synthesis through style diffusion and adversarial training with large speech language models.
metavoiceio/metavoice-src
MetaVoice-1B is an open-source 1.2B parameter foundational model for highly expressive, human-like text-to-speech synthesis with advanced voice cloning capabilities.
Plachtaa/VALL-E-X
An open-source implementation of Microsoft's VALL-E X, enabling zero-shot multilingual text-to-speech synthesis and voice cloning with emotion control.
jaywalnut310/vits
VITS is an end-to-end text-to-speech model that generates highly natural-sounding audio with diverse rhythms, outperforming traditional two-stage TTS systems.
Stability-AI/stability-sdk
A Python SDK for interacting with Stability AI's APIs, enabling programmatic access to AI models like Stable Diffusion for image generation and upscaling.
kyegomez/tree-of-thoughts
A plug-and-play Python library implementing the Tree of Thoughts algorithm to significantly enhance Large Language Model reasoning capabilities.
PySpur-Dev/pyspur
PySpur is a visual playground designed to accelerate the iteration, debugging, and deployment of AI agents, helping engineers overcome common challenges like prompt hell and workflow blindspots.
autodistill/autodistill
Autodistill automates the process of training small, fast supervised models from unlabeled images by leveraging large foundation models, eliminating the need for manual data labeling.
om-ai-lab/OmAgent
A Python library simplifying the development of multimodal language agents by abstracting complex engineering and providing native multimodal support.
nonebot/nonebot2
A powerful asynchronous multi-platform chatbot framework written in Python, designed for building highly customizable and extensible bots.
xtekky/gpt4free
GPT4Free (g4f) is a community-driven project that aggregates various free and accessible LLM providers, offering a unified API, clients, and GUI for flexible AI model interaction.
2FastLabs/agent-squad
A flexible, open-source framework for orchestrating multiple AI agents to manage complex conversations and tasks efficiently.
python-telegram-bot/python-telegram-bot
A comprehensive, asynchronous Python wrapper for the Telegram Bot API, simplifying bot creation.
HanaokaYuzu/Gemini-API
A reverse-engineered asynchronous Python API for the Google Gemini web app, enabling programmatic interaction with its advanced AI features.
Cog-Creators/Red-DiscordBot
A highly customizable, self-hosted Discord bot offering a wide range of features through a modular plugin system.
pdm-project/pdm
A modern and fast Python package and dependency manager that adheres to the latest PEP standards, offering flexible environment management and a powerful plugin system.
blackholll/loonflow
An intelligent, visual, and extensible open-source platform for enterprise-grade process automation and workflow management.
The-Pocket/PocketFlow
A minimalist, 100-line LLM framework designed for building AI agents and workflows with zero bloat and maximum expressiveness.
NexaAI/nexa-sdk
A high-performance SDK enabling day-0 local inference of frontier LLMs and VLMs across diverse hardware (NPU, GPU, CPU) and platforms (PC, mobile, IoT) with minimal energy.
LykosAI/StabilityMatrix
A multi-platform package manager and inference UI designed to simplify the installation, management, and updating of various Stable Diffusion web UIs and their associated components.
pydn/ComfyUI-to-Python-Extension
A powerful tool that translates ComfyUI visual workflows into executable Python code for automation and repeatable generation.
KohakuBlueleaf/LyCORIS
LyCORIS is a library implementing various parameter-efficient fine-tuning (PEFT) algorithms for Stable Diffusion, extending beyond conventional LoRA methods to enhance model adaptation.
s0md3v/sd-webui-roop
An extension for the AUTOMATIC1111 Stable Diffusion web-ui that enables face-replacement in generated images, now archived due to evolving ethical considerations regarding generative media.
nateraw/stable-diffusion-videos
Create dynamic and visually captivating videos by smoothly morphing between different text prompts using Stable Diffusion.
Comfy-Org/desktop
A packaged desktop application for Windows and macOS that bundles ComfyUI, simplifying its installation and management for local AI workflow generation.
pythongosssss/ComfyUI-Custom-Scripts
A collection of custom scripts and UI enhancements designed to improve the user experience and workflow efficiency within ComfyUI.
MrForExample/ComfyUI-3D-Pack
An extensive node suite that integrates advanced 3D input processing and asset generation into ComfyUI using cutting-edge AI algorithms and models.
AIGODLIKE/AIGODLIKE-ComfyUI-Translation
A ComfyUI plugin that provided multilingual translation for its user interface and nodes, now being deprecated in favor of ComfyUI's native localization.
eigent-ai/eigent
Eigent is an open-source desktop application that empowers users to build, manage, and deploy custom AI workforces for automating complex workflows, offering a local and free alternative to commercial AI coworking platforms.
pydantic/monty
A minimal, secure Python interpreter written in Rust, designed for fast and safe execution of LLM-generated code within AI agents, bypassing traditional container overhead.
gristlabs/grist-core
Grist is a modern open-source relational spreadsheet that combines the flexibility of a spreadsheet with the robustness of a database, featuring Python formulas and a portable SQLite format.
openrecall/openrecall
OpenRecall is an open-source, privacy-first cross-platform alternative to proprietary digital memory solutions, allowing users to capture, search, and revisit their digital history locally.
GaParmar/img2img-turbo
A one-step image-to-image translation framework leveraging Stable Diffusion Turbo for rapid generation across various tasks like sketch-to-image and day-to-night transformations.
Fanghua-Yu/SUPIR
SUPIR is an AI-driven project focused on developing practical algorithms for photo-realistic image restoration and upscaling in real-world scenarios.
Stability-AI/StableSwarmUI
A modular, high-performance web user interface for Stable Diffusion, emphasizing accessible powertools and extensibility for both beginners and advanced users.
stitionai/devika
An open-source AI agent that acts as a software engineer, capable of understanding instructions, planning, researching, and writing code to build software.
dora-rs/dora
DORA is a high-performance, 100% Rust framework for building real-time, low-latency, and distributed AI-based robotic applications using a dataflow-oriented architecture.
3b1b/manim
A Python-based animation engine for creating precise, explanatory mathematical videos.
flet-dev/flet
Flet is a Python framework that empowers developers to build real-time web, mobile, and desktop applications from a single codebase, eliminating the need for prior frontend experience.
ArchiveBox/ArchiveBox
An open-source, self-hosted web archiving tool designed to preserve web content from various sources in multiple durable formats for long-term access.
mustbeperfect/definitive-opensource
A meticulously curated, community-driven directory of high-quality, consumer-facing open-source applications across various platforms.
wilsonfreitas/awesome-quant
A meticulously curated list of open-source libraries, packages, and resources for quantitative finance professionals.
lfnovo/open-notebook
An open-source, privacy-focused alternative to Google's Notebook LM, offering flexible AI model choices, multi-modal content organization, and advanced features like multi-speaker podcast generation.
voicepaw/so-vits-svc-fork
A fork of so-vits-svc offering real-time singing voice conversion with an enhanced interface, improved accuracy, and simplified installation.
Kludex/uvicorn
Uvicorn is a high-performance ASGI web server for Python, enabling asynchronous web applications with support for HTTP/1.1 and WebSockets.
Usagi-org/ai-goofish-monitor
An AI-powered, Playwright-based system for real-time and scheduled monitoring and intelligent analysis of Xianyu (Goofish) e-commerce listings, featuring a comprehensive web management UI.
alibaba/zvec
Zvec is a lightweight, lightning-fast, in-process vector database built on Alibaba's Proxima, enabling scalable and low-latency similarity search directly within applications.
FireRedTeam/FireRed-OpenStoryline
FireRed-OpenStoryline is an AI video editing agent that transforms manual editing into intention-driven directing through natural language interaction and LLM-powered planning.
crosspoint-reader/crosspoint-reader
Open-source firmware for the Xteink X4 e-paper display reader, offering an enhanced EPUB reading experience and replacing the official closed-source software.
9001/copyparty
A portable, all-in-one file server supporting multiple protocols (HTTP, WebDAV, SFTP, FTP) with accelerated resumable uploads, media indexing, and sharing capabilities.
kyegomez/OpenMythos
An open-source, theoretical reconstruction of the Claude Mythos LLM architecture, featuring a Recurrent-Depth Transformer and sparse Mixture of Experts for advanced reasoning.
BIT-DataLab/Edit-Banana
Edit Banana transforms static, uneditable content like images of diagrams into fully manipulatable and editable assets using advanced AI.
tslearn-team/tslearn
A comprehensive Python toolkit for machine learning tasks specifically tailored for time series analysis.
awslabs/agentcore-samples
Amazon Bedrock AgentCore provides a framework-agnostic and model-agnostic infrastructure for securely deploying and operating advanced AI agents at scale, with this repository offering practical samples and tutorials.
ipython/ipython
IPython is a powerful interactive command shell for Python, offering enhanced introspection, rich media support, and advanced features for productive interactive computing.
ashleve/lightning-hydra-template
A user-friendly template integrating PyTorch Lightning and Hydra to streamline deep learning experimentation and development.
google/magika
Magika is a fast and highly accurate AI-powered tool for identifying file content types, crucial for security and content routing.
llm-workflow-engine/llm-workflow-engine
A powerful command-line interface and workflow manager designed to streamline interaction with various Large Language Models, including ChatGPT and GPT-4.
eternnoir/pyTelegramBotAPI
A simple yet extensible Python library for interacting with the Telegram Bot API, supporting both synchronous and asynchronous operations.
GeeeekExplorer/nano-vllm
A lightweight and optimized Python library for fast offline large language model inference, offering comparable or better performance than vLLM with a more readable codebase.
the-open-agent/openagent
An open-source enterprise-level AI Cloud OS providing a knowledge base and management platform for various large language models and AI agents.
SWE-agent/SWE-agent
An AI agent that autonomously fixes GitHub issues, finds cybersecurity vulnerabilities, and performs coding tasks using large language models.
pycaret/pycaret
An open-source, low-code AutoML platform for Python, offering a sklearn-native engine and a React-based control plane for end-to-end machine learning workflows.
stepfun-ai/gelab-zero
GELab-Zero is the first fully open-source GUI Agent solution, offering a plug-and-play infrastructure and a local 4B model for privacy-controlled, cloud-independent mobile AI agent development.
SylphAI-Inc/AdalFlow
AdalFlow is a PyTorch-like open-source library designed to build and automatically optimize large language model (LLM) applications, from chatbots and RAG systems to complex AI agents.
ollama/ollama-python
A Python library providing the easiest way to integrate Python 3.8+ projects with Ollama for local and cloud LLM interactions.
microsoft/agent-framework
A comprehensive, multi-language framework for building, orchestrating, and deploying AI agents and complex multi-agent workflows.
errbotio/errbot
Errbot is an open-source, Python-based chatbot framework designed to integrate with various chat services, enabling interactive scripting and automation directly from chatrooms.
qualcomm/nexa-sdk
A high-performance SDK enabling day-0 local inference of frontier LLMs and VLMs across diverse hardware (NPU, GPU, CPU) and platforms (PC, mobile, IoT) with minimal energy.
Lightning-AI/litgpt
A high-performance, no-abstraction toolkit providing recipes for pretraining, finetuning, and deploying over 20 large language models at scale.
deepchecks/deepchecks
An open-source solution for continuous validation of ML models and data, ensuring quality from research to production.
DataTalksClub/mlops-zoomcamp
A free 9-week online course from DataTalks.Club, designed to teach the fundamentals of MLOps, from experimentation to deployment and monitoring.
IDEA-CCNL/Fengshenbang-LM
Fengshenbang-LM is an open-source large model system by IDEA Research Institute, serving as infrastructure for Chinese AIGC and cognitive intelligence.
jina-ai/discoart
Create stunning Disco Diffusion artworks with a single line of Python code, offering a professional API and robust integration capabilities.
ucbepic/docetl
DocETL is an agentic LLM-powered framework designed for building and executing complex data processing and ETL pipelines, especially for documents.
firerpa/lamda
A powerful, AI-ready Android RPA agent framework for next-generation mobile automation, offering robust on-device services and extensive APIs.
riffusion/riffusion-hobby
A library for real-time music and audio generation leveraging stable diffusion, offering CLI, interactive app, and API capabilities.
souzatharsis/podcastfy
An open-source Python package that transforms multi-modal content into captivating multilingual audio conversations using GenAI, serving as a programmatic alternative to tools like NotebookLM.
microsoft/TypeChat
A library that simplifies building robust natural language interfaces by leveraging types for schema-driven LLM interactions, replacing complex prompt engineering.
CodeGraphContext/CodeGraphContext
Transforms code repositories into a queryable graph database, providing deep contextual understanding for AI assistants and developers.
plotly/dash
A Python framework for building interactive data science web applications and dashboards without requiring JavaScript.
Cinnamon/kotaemon
An open-source, customizable RAG UI and framework for secure, multi-modal document Q&A with various LLM support.
CloakHQ/CloakBrowser
CloakBrowser is a stealth Chromium browser engineered with C++ source-level fingerprint patches to bypass advanced bot detection, serving as a drop-in replacement for Playwright and Puppeteer.
cocoindex-io/cocoindex
CocoIndex is an incremental data indexing framework that provides continuously fresh context from diverse enterprise data sources for AI agents and LLM applications.
RhetTbull/osxphotos
A Python application and library to programmatically interact with Apple Photos on macOS, enabling querying of metadata and flexible photo export.
GoogleCloudPlatform/professional-services
A repository of common solutions and tools developed by Google Cloud's Professional Services team to address various challenges on Google Cloud Platform.
Fosowl/agenticSeek
A 100% local and private AI assistant that autonomously browses the web, writes code, and plans tasks, eliminating cloud dependencies and recurring costs.
brokermr810/QuantDinger
A self-hosted, AI-powered quantitative trading platform offering a complete stack for market research, strategy development, backtesting, and live execution across crypto, stocks, and forex.
mindsdb/minds-platform
An open-source AI platform providing foundational capabilities for automation and semantic search, enabling developers to build, control, and deploy production-ready AI systems anywhere.
supertone-inc/supertonic
Supertonic is a lightning-fast, on-device, multilingual text-to-speech system offering high-quality audio and privacy without cloud dependencies.