3 13 1

Haomin Wang

KiyotakaWang

https://hmwang2002.github.io/

hmwang2002

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago

MM-ACT: Learn from Multimodal Parallel Generation to Act

upvoted a paper 2 days ago

ARM-Thinker: Reinforcing Multimodal Generative Reward Models with Agentic Tool Use and Visual Reasoning

upvoted a paper about 1 month ago

Human-Agent Collaborative Paper-to-Page Crafting for Under $0.1

View all activity

Organizations

upvoted a paper 1 day ago

MM-ACT: Learn from Multimodal Parallel Generation to Act

Paper • 2512.00975 • Published 7 days ago • 6

upvoted a paper 2 days ago

ARM-Thinker: Reinforcing Multimodal Generative Reward Models with Agentic Tool Use and Visual Reasoning

Paper • 2512.05111 • Published 3 days ago • 40

upvoted a paper about 1 month ago

Human-Agent Collaborative Paper-to-Page Crafting for Under $0.1

Paper • 2510.19600 • Published Oct 22 • 68

updated a Space about 2 months ago

README

🏃

published a Space about 2 months ago

README

🏃

liked a dataset about 2 months ago

InternSVG/SArena

Viewer • Updated Nov 4 • 14k • 2.53k • 8

updated a dataset about 2 months ago

InternSVG/SArena

Viewer • Updated Nov 4 • 14k • 2.53k • 8

authored a paper about 2 months ago

InternSVG: Towards Unified SVG Tasks with Multimodal Large Language Models

Paper • 2510.11341 • Published Oct 13 • 34

published a dataset about 2 months ago

InternSVG/SArena

Viewer • Updated Nov 4 • 14k • 2.53k • 8

upvoted 2 papers about 2 months ago

Vlaser: Vision-Language-Action Model with Synergistic Embodied Reasoning

Paper • 2510.11027 • Published Oct 13 • 21

InternSVG: Towards Unified SVG Tasks with Multimodal Large Language Models

Paper • 2510.11341 • Published Oct 13 • 34

commented a paper about 2 months ago

InternSVG: Towards Unified SVG Tasks with Multimodal Large Language Models

Paper • 2510.11341 • Published Oct 13 • 34 •

Haomin Wang

AI & ML interests

Recent Activity

Organizations

KiyotakaWang's activity

README

README