Tags: #multi-modal
MemTensor/MemOS
An AI memory operating system for LLM and agent systems, enabling persistent skill memory for cross-task reuse and evolution.
Portkey-AI/gateway
A blazing fast, open-source AI Gateway for routing requests to 1600+ LLMs and multi-modal models with integrated guardrails, ensuring reliability and scalability.
xszyou/Fay
Fay is an AI agent framework designed to connect digital humans (2.5D, 3D, mobile, PC, web) and large language models (OpenAI compatible, DeepSeek) with various business systems.
bghira/SimpleTuner
A user-friendly, versatile fine-tuning kit for image, video, and audio diffusion models, emphasizing simplicity and cutting-edge features.
yzhao062/pyod
A comprehensive Python library offering 60+ anomaly detectors for multi-modal data, featuring an agentic workflow for AI agents and benchmark-backed orchestration.
X-PLUG/mPLUG-Owl
A family of powerful multi-modal large language models (MLLMs) designed to advance AI's understanding and generation capabilities across various data types.