1 52 122

gerald hewes

gerald29

AI & ML interests

None yet

Recent Activity

liked a model about 1 month ago

internlm/JanusCoderV-8B

liked a model about 1 month ago

MiniMaxAI/MiniMax-M2

liked a model about 2 months ago

katanemo/Arch-Router-1.5B

View all activity

Organizations

upvoted 2 papers about 2 months ago

StreamingVLM: Real-Time Understanding for Infinite Video Streams

Paper • 2510.09608 • Published Oct 10 • 50

CommonForms: A Large, Diverse Dataset for Form Field Detection

Paper • 2509.16506 • Published Sep 20 • 19

upvoted a paper 3 months ago

Kosmos-2.5: A Multimodal Literate Model

Paper • 2309.11419 • Published Sep 20, 2023 • 55

upvoted an article 6 months ago

Article

(LoRA) Fine-Tuning FLUX.1-dev on Consumer Hardware

Jun 19

•

upvoted 2 papers 6 months ago

V-JEPA 2: Self-Supervised Video Models Enable Understanding, Prediction and Planning

Paper • 2506.09985 • Published Jun 11 • 29

ComfyUI-Copilot: An Intelligent Assistant for Automated Workflow Development

Paper • 2506.05010 • Published Jun 5 • 79

upvoted a collection 7 months ago

D-FINE

Collection

State-of-the-art real-time object detection model with Apache 2.0 licence • 15 items • Updated May 5 • 56

upvoted a collection 8 months ago

Perception Encoder

Collection

17 items • Updated Jul 11 • 71

upvoted a paper 8 months ago

Qwen2.5-Omni Technical Report

Paper • 2503.20215 • Published Mar 26 • 166

upvoted an article 9 months ago

Article

Introducing smolagents: simple agents that write actions in code.

Dec 31, 2024

•

1.15k

upvoted a paper 9 months ago

Executable Code Actions Elicit Better LLM Agents

Paper • 2402.01030 • Published Feb 1, 2024 • 182

upvoted an article 9 months ago

Article

🦸🏻#14: What Is MCP, and Why Is Everyone – Suddenly!– Talking About It?

Mar 17

•

344

upvoted a paper 9 months ago

SynCity: Training-Free Generation of 3D Worlds

Paper • 2503.16420 • Published Mar 20 • 27

upvoted 7 papers 10 months ago

LLM-based User Profile Management for Recommender System

Paper • 2502.14541 • Published Feb 20 • 6

From RAG to Memory: Non-Parametric Continual Learning for Large Language Models

Paper • 2502.14802 • Published Feb 20 • 13

Enhancing Cognition and Explainability of Multimodal Foundation Models with Self-Synthesized Data

Paper • 2502.14044 • Published Feb 19 • 8

RelaCtrl: Relevance-Guided Efficient Control for Diffusion Transformers

Paper • 2502.14377 • Published Feb 20 • 12

Scaling Text-Rich Image Understanding via Code-Guided Synthetic Multimodal Data Generation

Paper • 2502.14846 • Published Feb 20 • 14

NAVIG: Natural Language-guided Analysis with Vision Language Models for Image Geo-localization

Paper • 2502.14638 • Published Feb 20 • 11

S^2R: Teaching LLMs to Self-verify and Self-correct via Reinforcement Learning

Paper • 2502.12853 • Published Feb 18 • 29

gerald hewes

AI & ML interests

Recent Activity

Organizations

gerald29's activity

(LoRA) Fine-Tuning FLUX.1-dev on Consumer Hardware

Introducing smolagents: simple agents that write actions in code.

🦸🏻#14: What Is MCP, and Why Is Everyone – Suddenly!– Talking About It?