jingyun

hjy

huajingyun

AI & ML interests

NLP

Recent Activity

upvoted a paper about 1 month ago

AVoCaDO: An Audiovisual Video Captioner Driven by Temporal Orchestration

upvoted a paper about 1 month ago

When Modalities Conflict: How Unimodal Reasoning Uncertainty Governs Preference Dynamics in MLLMs

authored a paper about 2 months ago

AVoCaDO: An Audiovisual Video Captioner Driven by Temporal Orchestration

View all activity

Organizations

None yet

upvoted 2 papers about 1 month ago

AVoCaDO: An Audiovisual Video Captioner Driven by Temporal Orchestration

Paper • 2510.10395 • Published Oct 12 • 29

When Modalities Conflict: How Unimodal Reasoning Uncertainty Governs Preference Dynamics in MLLMs

Paper • 2511.02243 • Published Nov 4 • 24

authored a paper about 2 months ago

AVoCaDO: An Audiovisual Video Captioner Driven by Temporal Orchestration

Paper • 2510.10395 • Published Oct 12 • 29

authored a paper 3 months ago

Kwai Keye-VL 1.5 Technical Report

Paper • 2509.01563 • Published Sep 1 • 37

liked a Space 3 months ago

FineVision: Open Data is All You Need

📝

210

A new open-source dataset for training VLMs

liked a model 3 months ago

Kwai-Keye/Keye-VL-1_5-8B

Video-Text-to-Text • 9B • Updated Sep 4 • 51.7k • 59

liked 2 models 4 months ago

deepseek-ai/DeepSeek-V3.1

Text Generation • 685B • Updated Sep 5 • 78.8k • • 808

facebook/dinov3-vit7b16-pretrain-lvd1689m

Image Feature Extraction • 7B • Updated Aug 19 • 26.4k • 194

upvoted a paper 5 months ago

Mengzi: Towards Lightweight yet Ingenious Pre-trained Models for Chinese

Paper • 2110.06696 • Published Oct 13, 2021 • 2

authored 4 papers 5 months ago

Mengzi: Towards Lightweight yet Ingenious Pre-trained Models for Chinese

Paper • 2110.06696 • Published Oct 13, 2021 • 2

HAIC: Improving Human Action Understanding and Generation with Better Captions for Multi-modal Large Language Models

Paper • 2502.20811 • Published Feb 28 • 3

Mavors: Multi-granularity Video Representation for Multimodal Large Language Model

Paper • 2504.10068 • Published Apr 14 • 30

Kwai Keye-VL Technical Report

Paper • 2507.01949 • Published Jul 2 • 131

upvoted a paper 5 months ago

Kwai Keye-VL Technical Report

Paper • 2507.01949 • Published Jul 2 • 131

liked 2 models 6 months ago

moonshotai/Kimi-VL-A3B-Thinking-2506

Image-Text-to-Text • 16B • Updated Aug 18 • 151k • 325

deepseek-ai/DeepSeek-R1-0528-Qwen3-8B

Text Generation • 8B • Updated May 29 • 505k • • 994

upvoted 2 papers 6 months ago

HAIC: Improving Human Action Understanding and Generation with Better Captions for Multi-modal Large Language Models

Paper • 2502.20811 • Published Feb 28 • 3

OneRec Technical Report

Paper • 2506.13695 • Published Jun 16 • 17

liked 2 datasets 6 months ago

omni-research/Tarsier2-Recap-585K

Preview • Updated Jan 24 • 31.8k • 19

yys/OpenOrca-Chinese

Viewer • Updated Sep 8, 2023 • 3.25M • 199 • 100

jingyun

AI & ML interests

Recent Activity

Organizations

hjy's activity

FineVision: Open Data is All You Need