Tags: #multi-modal-ai
AI Agent Development Framework
Python
7.7k
GetStream/Vision-Agents
A framework for building intelligent, low-latency multi-modal AI agents that can process real-time video and audio using various LLMs and vision models.
Unified AI Interface / Cross-platform AI Client
Vercel
6.7k
Dooy/chatgpt-web-midjourney-proxy
A unified web interface and cross-platform client for various AI services including ChatGPT, Midjourney, Suno, Luma, and more, offering a seamless multi-modal AI experience.
Replaces:
Details AI Automation Platform
2.1k
heshengtao/super-agent-party
An all-in-one, self-hosted AI companion platform enabling desktop automation, multi-agent chat, and versatile bot deployments.