Ecosystem & Stack: rocm

Multimodal AI Inference and Serving Framework

4.4k

vllm-project/vllm-omni

vLLM-Omni is an efficient, flexible, and easy-to-use framework extending vLLM to serve omni-modality models (text, image, video, audio) with high throughput and an OpenAI-compatible API.

multimodal-ai model-serving inference-framework

Details

Local Web Interface for Text-to-Speech

Python

7.5k

jianchang512/ChatTTS-ui

Provides a local web interface and API for the ChatTTS model, enabling text-to-speech synthesis with support for mixed languages and numbers.

chattts text-to-speech web-ui

Replaces:

Cloud Text-to-Speech Services

Details

AI Generative WebUI

stable diffusion

7.1k

vladmandic/sdnext

An all-in-one open-source WebUI for AI generative image and video creation, captioning, and processing, built on Stable Diffusion.

ai-art-generation stable-diffusion web-ui

Details