arxiv:2509.24897
Yuran Wang
Ryann829
·
AI & ML interests
Multimodal Large Language Model
Recent Activity
upvoted
a
paper
about 1 month ago
When Modalities Conflict: How Unimodal Reasoning Uncertainty Governs
Preference Dynamics in MLLMs
authored
a paper
about 1 month ago
Ocean-OCR: Towards General OCR Application via a Vision-Language Model
authored
a paper
about 1 month ago
DualToken: Towards Unifying Visual Understanding and Generation with
Dual Visual Vocabularies