vlm-r1

Here are 4 public repositories matching this topic...

Solve Visual Understanding with Reinforced VLMs

reinforcement-learning vlm multimodal llm qwen deepseek-r1 grpo r1-zero vlm-r1 multimodal-r1

Skywork-R1V is an advanced multimodal AI model series developed by Skywork AI (Kunlun Inc.), specializing in vision-language reasoning.

reinforcement-learning reasoning vlm llm multimodal-understanding deepseek-r1 grpo vlm-r1 multimodal-r1 r1v skywork-r1v

(ICCV-2025 Official Code)) Improving Generalist Model with Domain-Specific Experts

Proposed fuzzy reward model with GRPO to improve VLM's abilities in crowd counting task.

reinforcement-learning vlm crowdcounting llm reward-model r1-zero vlm-r1 multimodal-r1

Add a description, image, and links to the vlm-r1 topic page so that developers can more easily learn about it.

To associate your repository with the vlm-r1 topic, visit your repo's landing page and select "manage topics."