Tags: #generative-ai
googleapis/genai-toolbox
An open-source MCP server connecting AI agents, IDEs, and applications to enterprise databases for data interaction, schema exploration, and code generation.
pydantic/pydantic-ai
A Python agent framework built by the Pydantic team to quickly and confidently develop production-grade Generative AI applications with a focus on type safety and observability.
GoogleCloudPlatform/generative-ai
Provides sample code, notebooks, and resources for building and managing generative AI workflows on Google Cloud using Vertex AI and Gemini.
Comfy-Org/ComfyUI
A powerful and modular visual interface for designing and executing advanced Stable Diffusion and other generative AI model pipelines.
microsoft/generative-ai-for-beginners
A comprehensive 21-lesson curriculum from Microsoft to help beginners learn and build Generative AI applications.
google-gemini/genai-processors
A lightweight Python library for building modular, asynchronous, and composable AI pipelines, enabling efficient, parallel, and multimodal content processing for Generative AI applications.
NirDiamant/RAG_Techniques
A comprehensive repository showcasing advanced Retrieval-Augmented Generation (RAG) techniques through detailed, practical notebook tutorials.
tmc/langchaingo
A Go language framework for building applications with Large Language Models (LLMs) through composability.
kserve/kserve
KServe is a standardized, scalable, and multi-framework platform for deploying and serving both generative and predictive AI models on Kubernetes.
steven2358/awesome-generative-ai
A meticulously curated list of modern Generative Artificial Intelligence projects and services, offering a structured overview of the rapidly evolving AI landscape.
openvinotoolkit/openvino
OpenVINO is an open-source toolkit designed to optimize and deploy deep learning models for efficient AI inference across diverse hardware platforms, from edge to cloud.
calesthio/OpenMontage
An open-source, agentic AI system that transforms plain language descriptions into complete videos, handling research, scripting, asset generation, editing, and composition.
bghira/SimpleTuner
A user-friendly, versatile fine-tuning kit for image, video, and audio diffusion models, emphasizing simplicity and cutting-edge features.
adobe-research/custom-diffusion
Enables fast and efficient multi-concept customization of text-to-image diffusion models like Stable Diffusion using a few images.
InternLM/InternLM
A series of high-performance, cost-efficient large language models (LLMs) designed for general-purpose usage and advanced reasoning.
huggingface/diffusers
A modular PyTorch library for state-of-the-art diffusion models, enabling easy generation of images, audio, and more.
Stability-AI/StableStudio
A web-based open-source interface for creating and editing generative AI images, serving as the community version of DreamStudio.
SamurAIGPT/Generative-Media-Skills
A multimodal toolset enabling AI agents to generate, edit, and display professional-grade images, videos, and audio using a CLI-powered architecture.
filipecalegario/awesome-generative-ai
A comprehensive and curated list of Generative AI tools, models, research, and educational resources, covering various modalities and applications.
YangLing0818/Diffusion-Models-Papers-Survey-Taxonomy
A comprehensive collection and categorization of diffusion model papers, accompanied by a detailed survey and taxonomy.
lucidrains/imagen-pytorch
A PyTorch implementation of Google's Imagen, a state-of-the-art text-to-image neural network that surpasses DALL-E2 in synthesis quality.
lucidrains/DALLE2-pytorch
A PyTorch implementation of OpenAI's DALL-E 2, enabling advanced text-to-image synthesis through a diffusion-based neural network architecture.
lucidrains/DALLE-pytorch
An open-source PyTorch implementation of OpenAI's DALL-E, a text-to-image transformer, including CLIP for generation ranking.
lucidrains/deep-daze
A simple command-line tool for generating artistic images from text descriptions using OpenAI's CLIP and Siren neural networks.
NVIDIA-NeMo/NeMo
A scalable generative AI framework for building, customizing, and deploying models focused on Large Language Models, Multimodal, and Speech AI (ASR, TTS).
2noise/ChatTTS
A generative speech model optimized for natural and expressive daily dialogue, especially for LLM assistants.
genieincodebottle/generative-ai
A comprehensive repository offering roadmaps, projects, use cases, and interview preparation materials for mastering Generative AI.
lastmile-ai/aiconfig
A config-based framework for building, managing, and iterating on generative AI applications by separating AI behavior from application code.
alan-ai/alan-sdk-web
Alan AI SDK for Web enables developers to embed a self-coding, generative AI layer into web applications, automating feature creation and UI/logic generation in real-time.
swyxio/ai-notes
A comprehensive knowledge base for software engineers to quickly grasp the latest developments in AI, especially generative AI and large language models.
Yutong-Zhou-cv/Awesome-Text-to-Image
A comprehensive curated list of resources, papers, datasets, and projects focused on text-to-image generation and manipulation.
NExT-GPT/NExT-GPT
The first end-to-end multimodal large language model (MM-LLM) that perceives input and generates output in arbitrary combinations (any-to-any) of text, image, video, and audio.
FurkanGozukara/Stable-Diffusion
A comprehensive repository offering expert-level tutorials, guides, and courses on various Generative AI technologies, primarily focusing on Stable Diffusion and its ecosystem.
rgthree/rgthree-comfy
Enhances ComfyUI workflows with intuitive custom nodes and quality-of-life improvements for a cleaner, easier, and faster experience.
Comfy-Org/desktop
A packaged desktop application for Windows and macOS that bundles ComfyUI, ComfyUI-Manager, and necessary dependencies for easy local AI workflow execution.
ZHO-ZHO-ZHO/ComfyUI-Workflows-ZHO
A comprehensive and curated collection of ComfyUI workflows for diverse generative AI tasks, simplifying complex AI art and video creation.
GaParmar/img2img-turbo
A one-step image-to-image translation framework leveraging Stable Diffusion Turbo for rapid generation across various tasks like sketch-to-image and day-to-night transformations.
LearnPrompt/LearnPrompt
A permanently free and open-source AIGC course platform covering prompt engineering, generative AI tools like ChatGPT, Midjourney, Stable Diffusion, and advanced topics such as LLM fine-tuning and AI agents.