Anwar's picture

Anwar

abdoali5672

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 2 days ago

PretrainZero: Reinforcement Active Pretraining

upvoted a paper 3 days ago

DeepSeek-V3.2: Pushing the Frontier of Open Large Language Models

upvoted a paper 5 days ago

DeepSeekMath-V2: Towards Self-Verifiable Mathematical Reasoning

View all activity

Organizations

None yet

commented 2 papers about 2 months ago

Predicting LLM Reasoning Performance with Small Proxy Model

Paper • 2509.21013 • Published Sep 25 • 1 •

Direct Multi-Token Decoding

Paper • 2510.11958 • Published Oct 13 • 5 •

commented 3 papers 3 months ago

Reasoning over Boundaries: Enhancing Specification Alignment via Test-time Delibration

Paper • 2509.14760 • Published Sep 18 • 53 •

AU-Harness: An Open-Source Toolkit for Holistic Evaluation of Audio LLMs

Paper • 2509.08031 • Published Sep 9 • 21 •

VLA-Adapter: An Effective Paradigm for Tiny-Scale Vision-Language-Action Model

Paper • 2509.09372 • Published Sep 11 • 239 •

commented 6 papers 6 months ago

SuperBPE: Space Travel for Language Models

Paper • 2503.13423 • Published Mar 17 • 13 •

Reflect, Retry, Reward: Self-Improving LLMs via Reinforcement Learning

Paper • 2505.24726 • Published May 30 • 276 •

BPE Gets Picky: Efficient Vocabulary Refinement During Tokenizer Training

Paper • 2409.04599 • Published Sep 6, 2024 • 2 •

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

Paper • 2501.04519 • Published Jan 8 • 286 •

Pychop: Emulating Low-Precision Arithmetic in Numerical Methods and Neural Networks

Paper • 2504.07835 • Published Apr 10 •

Low-Precision Training of Large Language Models: Methods, Challenges, and Opportunities

Paper • 2505.01043 • Published May 2 • 10 •

commented a paper 7 months ago

A Survey on Post-training of Large Language Models

Paper • 2503.06072 • Published Mar 8 • 10 •