14 32 9

Henry Hengyuan Zhao

hhenryz

https://zhaohengyuan1.github.io/

AI & ML interests

Multimodal Reasoning, Human-AI Interaction, GUI Automation

Recent Activity

upvoted a paper 15 days ago

Computer-Use Agents as Judges for Generative User Interface

upvoted a paper 28 days ago

Grounding Computer Use Agents on Human Demonstrations

liked a dataset 29 days ago

open-thoughts/OpenThoughts3-1.2M

View all activity

Organizations

upvoted a paper 15 days ago

Computer-Use Agents as Judges for Generative User Interface

Paper • 2511.15567 • Published 21 days ago • 51

upvoted a paper 28 days ago

Grounding Computer Use Agents on Human Demonstrations

Paper • 2511.07332 • Published 30 days ago • 104

liked a dataset 29 days ago

open-thoughts/OpenThoughts3-1.2M

Viewer • Updated Jun 9 • 1.2M • 13.9k • 187

upvoted 2 papers about 1 month ago

Reinforcement Learning with Verifiable Rewards Implicitly Incentivizes Correct Reasoning in Base LLMs

Paper • 2506.14245 • Published Jun 17 • 44

VCode: a Multimodal Coding Benchmark with SVG as Symbolic Visual Representation

Paper • 2511.02778 • Published Nov 4 • 101

liked a dataset about 1 month ago

CSU-JPG/Chart2Code

Updated 2 days ago • 263 • 4

updated a collection about 2 months ago

Personal Interest

Collection

5 items • Updated Oct 23

upvoted a paper about 2 months ago

From Charts to Code: A Hierarchical Benchmark for Multimodal Models

Paper • 2510.17932 • Published Oct 20 • 7

commented a paper about 2 months ago

From Charts to Code: A Hierarchical Benchmark for Multimodal Models

Paper • 2510.17932 • Published Oct 20 • 7 •

upvoted a collection about 2 months ago

Qwen3-VL

Collection

37 items • Updated Nov 1 • 502

upvoted 2 papers 2 months ago

Paper2Video: Automatic Video Generation from Scientific Papers

Paper • 2510.05096 • Published Oct 6 • 117

LongLive: Real-time Interactive Long Video Generation

Paper • 2509.22622 • Published Sep 26 • 184

upvoted a paper 3 months ago

BaseReward: A Strong Baseline for Multimodal Reward Model

Paper • 2509.16127 • Published Sep 19 • 21

liked a dataset 4 months ago

lmms-lab/TempCompass

Viewer • Updated Jun 10, 2024 • 7.54k • 1.75k • 6

upvoted a collection 5 months ago

NVILA

Collection

11 items • Updated Sep 13 • 16

upvoted a paper 5 months ago

InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models

Paper • 2504.10479 • Published Apr 14 • 303

updated a dataset 5 months ago

hhenryz/WorldGUI-Bench

Updated Jun 28 • 163

upvoted a paper 7 months ago

Paper2Poster: Towards Multimodal Poster Automation from Scientific Papers

Paper • 2505.21497 • Published May 27 • 109

published a dataset 7 months ago

hhenryz/WorldGUI-Bench

Updated Jun 28 • 163

upvoted a paper 7 months ago

Perception, Reason, Think, and Plan: A Survey on Large Multimodal Reasoning Models

Paper • 2505.04921 • Published May 8 • 186

Henry Hengyuan Zhao

AI & ML interests

Recent Activity

Organizations

hhenryz's activity