Charles DUVAL's picture

In a Training Loop 🔄

7 14

Charles DUVAL

Chumafly

·

AI & ML interests

None yet

Organizations

None yet

upvoted a collection 3 months ago

Qwen3-Omni

6 items • Updated Oct 9 • 166

upvoted 3 papers 6 months ago

Fast-dLLM: Training-free Acceleration of Diffusion LLM by Enabling KV Cache and Parallel Decoding

Paper • 2505.22618 • Published May 28 • 44

Taming LLMs by Scaling Learning Rates with Gradient Grouping

Paper • 2506.01049 • Published Jun 1 • 38

More Thinking, Less Seeing? Assessing Amplified Hallucination in Multimodal Reasoning Models

Paper • 2505.21523 • Published May 23 • 13

upvoted an article 6 months ago

Article

No GPU left behind: Unlocking Efficiency with Co-located vLLM in TRL

+4

Jun 3

•

96

upvoted a paper 6 months ago

AlphaOne: Reasoning Models Thinking Slow and Fast at Test Time

Paper • 2505.24863 • Published May 30 • 97

upvoted an article about 1 year ago

Article

On Learning JAX – A Framework for High Performance Machine Learning

Dec 3, 2023

•

3