AI Agent Toolset / Generative Media Library
3.2k 2026-05-01
SamurAIGPT/Generative-Media-Skills
Provides a multimodal toolset for AI agents to generate, edit, and display professional-grade images, videos, and audio using a CLI-powered architecture.
Core Features
Agent-native design with CLI scripts and structured JSON outputs.
Access to over 100 AI models including Midjourney v7, Kling 3.0, and Seedance 2.0.
Expert knowledge layer for professional cinematography, design, and branding.
Direct media display and local file support for seamless workflows.
MCP Server for exposing tools to Claude Desktop, Cursor, and other compatible agents.
Quick Start
npm install -g muapi-cliDetailed Introduction
This project offers a comprehensive, schema-driven architecture enabling AI agents like Claude Code, Cursor, and Gemini CLI to perform advanced multimodal media generation. It leverages the `muapi-cli` to provide high-performance tools for creating, editing, and displaying professional-grade images, videos, and audio. With an expert knowledge layer embedding domain-specific logic and access to over 100 cutting-edge AI models, it streamlines complex creative tasks into agent-native, CLI-powered workflows, significantly enhancing AI agents' creative capabilities.