RynnVLA-002: A Unified Vision-Language-Action and World Model Paper β’ 2511.17502 β’ Published 15 days ago β’ 24
Parrot: Persuasion and Agreement Robustness Rating of Output Truth -- A Sycophancy Robustness Benchmark for LLMs Paper β’ 2511.17220 β’ Published 15 days ago β’ 16
Benchmarking Diversity in Image Generation via Attribute-Conditional Human Evaluation Paper β’ 2511.10547 β’ Published 23 days ago β’ 4
UniVA: Universal Video Agent towards Open-Source Next-Generation Video Generalist Paper β’ 2511.08521 β’ Published 25 days ago β’ 37
One Small Step in Latent, One Giant Leap for Pixels: Fast Latent Upscale Adapter for Your Diffusion Models Paper β’ 2511.10629 β’ Published 23 days ago β’ 122
Depth Anything 3: Recovering the Visual Space from Any Views Paper β’ 2511.10647 β’ Published 23 days ago β’ 92
Kimi Linear: An Expressive, Efficient Attention Architecture Paper β’ 2510.26692 β’ Published Oct 30 β’ 114
WithAnyone: Towards Controllable and ID Consistent Image Generation Paper β’ 2510.14975 β’ Published Oct 16 β’ 84
Durian: Dual Reference-guided Portrait Animation with Attribute Transfer Paper β’ 2509.04434 β’ Published Sep 4 β’ 10
OpenAI-GPT 20B, 37B ,120B: Neo, reg, uncensored, ablit. Collection OpenAi's model in various sizes and formats, including NEO Imatrix, DI, Tri Matrix, Uncensored, Albiterated, and Brainstorm 20x (37B). β’ 9 items β’ Updated 19 days ago β’ 9
200+ Roleplay, Creative Writing, Uncensored, NSFW models. Collection Oldest models listed first, with Newest models at bottom of the page. Most repos have full examples, instructions, best settings and so on. β’ 335 items β’ Updated 1 day ago β’ 378
InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency Paper β’ 2508.18265 β’ Published Aug 25 β’ 208
T-LoRA: Single Image Diffusion Model Customization Without Overfitting Paper β’ 2507.05964 β’ Published Jul 8 β’ 119
Thinking with Images for Multimodal Reasoning: Foundations, Methods, and Future Frontiers Paper β’ 2506.23918 β’ Published Jun 30 β’ 89
WebSailor: Navigating Super-human Reasoning for Web Agent Paper β’ 2507.02592 β’ Published Jul 3 β’ 123
Fine-Grained Preference Optimization Improves Spatial Reasoning in VLMs Paper β’ 2506.21656 β’ Published Jun 26 β’ 15