MM-ACT: Learn from Multimodal Parallel Generation to Act Paper • 2512.00975 • Published 10 days ago • 6
ARM-Thinker: Reinforcing Multimodal Generative Reward Models with Agentic Tool Use and Visual Reasoning Paper • 2512.05111 • Published 6 days ago • 45