interleaved-bench

university

Activity Feed

AI & ML interests

None defined yet.

Recent Activity

JiaxinGe updated a dataset about 2 months ago

interleaved-bench/TweetBench

JiaxinGe updated a dataset about 2 months ago

interleaved-bench/TweetBench

JiaxinGe published a dataset 2 months ago

interleaved-bench/TweetBench

View all activity

JiaxinGe

updated a dataset about 2 months ago

interleaved-bench/TweetBench

Viewer • Updated Oct 16 • 4.69k • 201 • 1

JiaxinGe

published a dataset 2 months ago

interleaved-bench/TweetBench

Viewer • Updated Oct 16 • 4.69k • 201 • 1

JiaxinGe

in interleaved-bench/TweetBench 4 months ago

Add `rubrics` column across all splits

#2 opened 4 months ago by

JiaxinGe

Add `rubrics` column across all splits

#1 opened 4 months ago by

JiaxinGe

Kyunnilee

authored 2 papers 6 months ago

Generate, but Verify: Reducing Hallucination in Vision-Language Models with Retrospective Resampling

Paper • 2504.13169 • Published Apr 17 • 39

Puzzled by Puzzles: When Vision-Language Models Can't Take a Hint

Paper • 2505.23759 • Published May 29 • 5

longlian

authored 2 papers 8 months ago

Learning Adaptive Parallel Reasoning with Language Models

Paper • 2504.15466 • Published Apr 21 • 44

Describe Anything: Detailed Localized Image and Video Captioning

Paper • 2504.16072 • Published Apr 22 • 63

davidchan

authored a paper 9 months ago

TULIP: Towards Unified Language-Image Pretraining

Paper • 2503.15485 • Published Mar 19 • 49

longlian

authored a paper 9 months ago

Atlas: Multi-Scale Attention Improves Long Context Image Modeling

Paper • 2503.12355 • Published Mar 16 • 12

g-luo

authored 3 papers about 1 year ago

Diffusion Hyperfeatures: Searching Through Time and Space for Semantic Correspondence

Paper • 2305.14334 • Published May 23, 2023 • 1

Readout Guidance: Learning Control from Diffusion Features

Paper • 2312.02150 • Published Dec 4, 2023 • 3

Task Vectors are Cross-Modal

Paper • 2410.22330 • Published Oct 29, 2024 • 11

davidchan

authored 4 papers over 1 year ago

davidchan

posted an update over 1 year ago

Post

619

🚨 Launching The Visual Haystacks (VHs) Benchmark: the first "visual-centric" Needle-In-A-Haystack (NIAH) benchmark to assess LMMs' capability in long-context visual retrieval and reasoning.

Check it out!
tsunghanwu/visual_haystacks
https://visual-haystacks.github.io/
https://arxiv.org/abs/2407.13766
https://github.com/visual-haystacks/vhs_benchmark

longlian

authored a paper almost 2 years ago

Rethinking Patch Dependence for Masked Autoencoders

Paper • 2401.14391 • Published Jan 25, 2024 • 26

davidchan

authored a paper almost 2 years ago

Task Oriented Dialogue as a Catalyst for Self-Supervised Automatic Speech Recognition

Paper • 2401.02417 • Published Jan 4, 2024 • 1

AI & ML interests

Recent Activity

Team members 6

interleaved-bench's activity

Add `rubrics` column across all splits

Add `rubrics` column across all splits