When Does RL Help Medical VLMs? Disentangling Vision, SFT, and RL Gains Paper • 2603.01301 • Published 3 days ago • 8
LoopFormer: Elastic-Depth Looped Transformers for Latent Reasoning via Shortcut Modulation Paper • 2602.11451 • Published 21 days ago • 15
PC-GRPO Collection Qwen2.5-VL-3B & 7B models trained with PC-GRPO in the paper: Puzzle Curriculum GRPO for Vision-Centric Reasoning • 9 items • Updated 20 days ago • 3
LoopFormer Collection Models trained in the ICLR2026 paper: LoopFormer: Elastic-Depth Looped Transformers for Latent Reasoning via Shortcut Modulation • 17 items • Updated 13 days ago • 2
PickStyle: Video-to-Video Style Transfer with Context-Style Adapters Paper • 2510.07546 • Published Oct 8, 2025 • 22