arxiv:2509.22186
Bin Wang
wanderkid
AI & ML interests
Computer Vision, Multimodal Large Language Model
Recent Activity
liked
a Space
1 day ago
opendatalab/TRivia-3B
liked
a dataset
6 days ago
opendatalab/AICC
authored
a paper
2 months ago
MinerU2.5: A Decoupled Vision-Language Model for Efficient
High-Resolution Document Parsing