Tags: #video-understanding
Multimodal AI Chatbot Framework
3.3k
OpenGVLab/Ask-Anything
An advanced multimodal AI chatbot framework that enables conversational interaction and deep understanding of video and image content, integrating various large language models.
Multimodal AI System
2.9k
InternLM/InternLM-XComposer
A comprehensive multimodal AI system specializing in long-term streaming video and audio interactions, offering advanced vision-language understanding and composition.