Bugrahan's picture

31 24

Bugrahan

nuwandaa

·

nuwandda

AI & ML interests

None yet

Recent Activity

upvoted a paper 7 days ago

Canvas-to-Image: Compositional Image Generation with Multimodal Controls

upvoted a paper about 1 month ago

VFXMaster: Unlocking Dynamic Visual Effect Generation via In-Context Learning

upvoted a paper about 2 months ago

Thinking with Camera: A Unified Multimodal Model for Camera-Centric Understanding and Generation

View all activity

Organizations

upvoted a paper 7 days ago

Canvas-to-Image: Compositional Image Generation with Multimodal Controls

Paper • 2511.21691 • Published 11 days ago • 32

upvoted a paper about 1 month ago

VFXMaster: Unlocking Dynamic Visual Effect Generation via In-Context Learning

Paper • 2510.25772 • Published Oct 29 • 32

upvoted a paper about 2 months ago

Thinking with Camera: A Unified Multimodal Model for Camera-Centric Understanding and Generation

Paper • 2510.08673 • Published Oct 9 • 125

upvoted a paper 2 months ago

EditVerse: Unifying Image and Video Editing and Generation with In-Context Learning

Paper • 2509.20360 • Published Sep 24 • 17

upvoted 3 papers 5 months ago

Ultra3D: Efficient and High-Fidelity 3D Generation with Part Attention

Paper • 2507.17745 • Published Jul 23 • 35

Pixels, Patterns, but No Poetry: To See The World like Humans

Paper • 2507.16863 • Published Jul 21 • 68

Yume: An Interactive World Generation Model

Paper • 2507.17744 • Published Jul 23 • 87

upvoted 5 papers 6 months ago

TaskCraft: Automated Generation of Agentic Tasks

Paper • 2506.10055 • Published Jun 11 • 32

Native-Resolution Image Synthesis

Paper • 2506.03131 • Published Jun 3 • 18

EasyText: Controllable Diffusion Transformer for Multilingual Text Rendering

Paper • 2505.24417 • Published May 30 • 13

ARM: Adaptive Reasoning Model

Paper • 2505.20258 • Published May 26 • 45

Alchemist: Turning Public Text-to-Image Data into Generative Gold

Paper • 2505.19297 • Published May 25 • 84

upvoted 2 papers 7 months ago

TEMPURA: Temporal Event Masked Prediction and Understanding for Reasoning in Action

Paper • 2505.01583 • Published May 2 • 8

YoChameleon: Personalized Vision and Language Generation

Paper • 2504.20998 • Published Apr 29 • 12

upvoted 6 papers 8 months ago

DreamO: A Unified Framework for Image Customization

Paper • 2504.16915 • Published Apr 23 • 24

VLM-R1: A Stable and Generalizable R1-style Large Vision-Language Model

Paper • 2504.07615 • Published Apr 10 • 35

Seaweed-7B: Cost-Effective Training of Video Generation Foundation Model

Paper • 2504.08685 • Published Apr 11 • 130

DDT: Decoupled Diffusion Transformer

Paper • 2504.05741 • Published Apr 8 • 77

Less-to-More Generalization: Unlocking More Controllability by In-Context Generation

Paper • 2504.02160 • Published Apr 2 • 37

One-Minute Video Generation with Test-Time Training

Paper • 2504.05298 • Published Apr 7 • 110