Tags: #generative-ai
googleapis/genai-toolbox
An open-source MCP server connecting AI agents, IDEs, and applications to enterprise databases for data interaction, schema exploration, and code generation.
docling-project/docling
Docling simplifies document processing, parsing diverse formats including advanced PDF understanding, and provides seamless integrations with the generative AI ecosystem.
pydantic/pydantic-ai
A Python agent framework built by the Pydantic team, designed to simplify and accelerate the development of production-grade Generative AI applications with type safety and robust observability.
GoogleCloudPlatform/generative-ai
Provides sample code, notebooks, and resources for building and managing generative AI workflows on Google Cloud using Vertex AI and Gemini.
Comfy-Org/ComfyUI
A powerful and modular visual interface for designing and executing advanced Stable Diffusion and other generative AI pipelines.
microsoft/generative-ai-for-beginners
A comprehensive 21-lesson curriculum designed to introduce beginners to Generative AI and guide them through building AI applications.
google-gemini/genai-processors
A lightweight Python library for building modular, asynchronous, and composable AI pipelines, unifying content processing for Generative AI models and tools.
NirDiamant/GenAI_Agents
A comprehensive repository offering over 50 tutorials and implementations for building Generative AI agents, from basic conversational bots to complex multi-agent systems.
NirDiamant/RAG_Techniques
A repository showcasing various advanced Retrieval-Augmented Generation (RAG) techniques through detailed notebook tutorials.
kserve/kserve
A standardized, scalable, multi-framework platform for deploying generative and predictive AI models on Kubernetes.
steven2358/awesome-generative-ai
A comprehensive, curated list of modern Generative Artificial Intelligence projects, services, and learning resources.
calesthio/OpenMontage
An open-source, agentic AI system that transforms plain language descriptions into complete video productions, handling research, scripting, asset generation, editing, and final composition.
bghira/SimpleTuner
A user-friendly, versatile fine-tuning kit for image, video, and audio diffusion models, emphasizing simplicity and cutting-edge features.
adobe-research/custom-diffusion
Enables fast and efficient multi-concept customization of text-to-image diffusion models like Stable Diffusion using a few images.
agentheroes/agentheroes
Generate, animate, and schedule AI characters and their content for social media.
InternLM/InternLM
A series of high-performance, cost-effective open-source large language models (LLMs) designed for general-purpose usage and advanced reasoning.
promptslab/Awesome-Prompt-Engineering
A comprehensive, hand-curated collection of resources for Prompt Engineering and Context Engineering, specifically for Large Language Models.
huggingface/diffusers
A modular PyTorch library for state-of-the-art diffusion models, enabling easy inference and training for image, video, and audio generation.
AUTOMATIC1111/stable-diffusion-webui
A comprehensive web interface for Stable Diffusion, enabling users to generate and manipulate images with advanced AI features.
hua1995116/awesome-ai-painting
A comprehensive collection of resources and tutorials for AI painting, covering platforms, usage guides, deployment, and industry news.
Stability-AI/StableStudio
An open-source web-based interface for generative AI, enabling users to create and edit AI-generated images with a flexible plugin system.
SamurAIGPT/Generative-Media-Skills
Provides a multimodal toolset for AI agents to generate, edit, and display professional-grade images, videos, and audio using a CLI-powered architecture.
filipecalegario/awesome-generative-ai
A comprehensive and curated list of Generative AI tools, models, artworks, and references, organized for easy navigation and discovery.
YangLing0818/Diffusion-Models-Papers-Survey-Taxonomy
A comprehensive and continuously updated collection and categorization of research papers on diffusion models.
lucidrains/imagen-pytorch
A PyTorch implementation of Google's Imagen, a state-of-the-art text-to-image neural network, enabling advanced generative AI capabilities.
lucidrains/DALLE2-pytorch
A PyTorch implementation of OpenAI's DALL-E 2, a state-of-the-art neural network for text-to-image synthesis.
lucidrains/DALLE-pytorch
An open-source PyTorch implementation and replication of OpenAI's DALL-E, a text-to-image transformer, including CLIP for generation ranking.
XavierXiao/Dreambooth-Stable-Diffusion
This project implements Google's Dreambooth technique on Stable Diffusion, enabling users to fine-tune a text-to-image model with a few custom examples for personalized image generation.
NVIDIA-NeMo/NeMo
A scalable generative AI framework for researchers and developers focused on Large Language Models, Multimodal, and Speech AI (ASR, TTS).
2noise/ChatTTS
A generative speech model optimized for natural, expressive dialogue in LLM assistants, featuring fine-grained prosodic control.
genieincodebottle/generative-ai
A comprehensive repository offering structured learning paths, practical projects, and career preparation resources for Generative AI and Machine Learning.
Dooy/chatgpt-web-midjourney-proxy
A unified web interface and multi-platform client for various AI services including ChatGPT, Midjourney, Suno, Luma, and more, offering a seamless multi-modal AI experience.
lastmile-ai/aiconfig
A config-based framework for building, managing, and iterating on generative AI applications by separating AI behavior from application code.
alan-ai/alan-sdk-web
Alan AI SDK for Web enables developers to embed a self-coding, generative AI layer into web applications, automating feature creation and UI/logic generation in real-time.
swyxio/ai-notes
A curated collection of notes and resources for software engineers to quickly get up to speed on new AI developments, focusing on generative AI and large language models.
Yutong-Zhou-cv/Awesome-Text-to-Image
A comprehensive curated list of resources, papers, datasets, and projects related to text-to-image generation and manipulation.
NExT-GPT/NExT-GPT
The first end-to-end multimodal large language model (MM-LLM) capable of perceiving and generating content in arbitrary combinations of text, image, video, and audio.
FurkanGozukara/Stable-Diffusion
A comprehensive repository offering expert-level tutorials, guides, and courses on Generative AI, focusing on Stable Diffusion, SDXL, LoRA, DreamBooth, and related technologies.
rgthree/rgthree-comfy
An essential collection of nodes and improvements designed to make ComfyUI workflows cleaner, easier, and faster for generative AI artists.
ZHO-ZHO-ZHO/ComfyUI-Workflows-ZHO
A comprehensive collection of ComfyUI workflows designed to streamline various AI image and video generation tasks.
GaParmar/img2img-turbo
A one-step image-to-image translation framework leveraging Stable Diffusion Turbo for rapid generation across various tasks like sketch-to-image and day-to-night transformations.
LearnPrompt/LearnPrompt
A permanently free and open-source AIGC course platform covering prompt engineering, generative AI tools like ChatGPT, Midjourney, Stable Diffusion, and advanced topics such as LLM fine-tuning and AI agents.
mahseema/awesome-ai-tools
A comprehensive, curated list of top Artificial Intelligence tools, covering various categories from generative AI to LLMs and specialized applications.
jina-ai/discoart
Create stunning Disco Diffusion artworks with a single line of Python code, offering a professional API and robust integration capabilities.
AbdBarho/stable-diffusion-webui-docker
Simplifies the local deployment and usage of Stable Diffusion with various user-friendly web interfaces via Docker.