Ecosystem & Stack: rocm
Multimodal AI Inference and Serving Framework
python
4.4k
vllm-project/vllm-omni
vLLM-Omni is an efficient, flexible, and easy-to-use framework extending vLLM to serve omni-modality models (text, image, video, audio) with high throughput and an OpenAI-compatible API.
Local Web Interface for Text-to-Speech
Python
7.5k
jianchang512/ChatTTS-ui
Provides a local web interface and API for the ChatTTS model, enabling text-to-speech synthesis with support for mixed languages and numbers.
Replaces:
Details AI Generative WebUI
stable diffusion
7.1k
vladmandic/sdnext
An all-in-one open-source WebUI for AI generative image and video creation, captioning, and processing, built on Stable Diffusion.