Ecosystem & Stack: xllm
Large Vision-Language Model Framework
xllm
5.9k
om-ai-lab/VLM-R1
A stable and generalizable R1-style Large Vision-Language Model (VLM) framework that enhances visual understanding tasks through reinforced learning, outperforming SFT models in generalization.