12 415 2

Anwar

abdoali5672

AI & ML interests

None yet

Recent Activity

upvoted a paper 2 days ago

PretrainZero: Reinforcement Active Pretraining

upvoted a paper 3 days ago

DeepSeek-V3.2: Pushing the Frontier of Open Large Language Models

upvoted a paper 5 days ago

DeepSeekMath-V2: Towards Self-Verifiable Mathematical Reasoning

View all activity

Organizations

None yet

upvoted a paper 2 days ago

PretrainZero: Reinforcement Active Pretraining

Paper • 2512.03442 • Published 4 days ago • 39

upvoted a paper 3 days ago

DeepSeek-V3.2: Pushing the Frontier of Open Large Language Models

Paper • 2512.02556 • Published 5 days ago • 166

upvoted a paper 5 days ago

DeepSeekMath-V2: Towards Self-Verifiable Mathematical Reasoning

Paper • 2511.22570 • Published 9 days ago • 63

upvoted 2 papers 8 days ago

NorMuon: Making Muon more efficient and scalable

Paper • 2510.05491 • Published Oct 7 • 8

ROOT: Robust Orthogonalized Optimizer for Neural Network Training

Paper • 2511.20626 • Published 11 days ago • 169

upvoted 2 papers 10 days ago

SSA: Sparse Sparse Attention by Aligning Full and Sparse Attention Outputs in Feature Space

Paper • 2511.20102 • Published 12 days ago • 26

Soft Adaptive Policy Optimization

Paper • 2511.20347 • Published 11 days ago • 33

upvoted a paper 11 days ago

General Agentic Memory Via Deep Research

Paper • 2511.18423 • Published 14 days ago • 155

upvoted 2 papers 12 days ago

OpenMMReasoner: Pushing the Frontiers for Multimodal Reasoning with an Open and General Recipe

Paper • 2511.16334 • Published 16 days ago • 91

Unveiling Intrinsic Dimension of Texts: from Academic Abstract to Creative Story

Paper • 2511.15210 • Published 18 days ago • 86

upvoted 2 papers 14 days ago

Step-Audio-R1 Technical Report

Paper • 2511.15848 • Published 17 days ago • 51

Agent0: Unleashing Self-Evolving Agents from Zero Data via Tool-Integrated Reasoning

Paper • 2511.16043 • Published 17 days ago • 104

upvoted 2 papers 17 days ago

MiroThinker: Pushing the Performance Boundaries of Open-Source Research Agents via Model, Context, and Interactive Scaling

Paper • 2511.11793 • Published 22 days ago • 158

AraLingBench A Human-Annotated Benchmark for Evaluating Arabic Linguistic Capabilities of Large Language Models

Paper • 2511.14295 • Published 19 days ago • 71

upvoted a paper 19 days ago

DoPE: Denoising Rotary Position Embedding

Paper • 2511.09146 • Published 25 days ago • 92

upvoted a paper 20 days ago

Virtual Width Networks

Paper • 2511.11238 • Published 22 days ago • 35

upvoted 2 papers 22 days ago

Mid-Training of Large Language Models: A Survey

Paper • 2510.06826 • Published Oct 8 • 1

Black-Box On-Policy Distillation of Large Language Models

Paper • 2511.10643 • Published 23 days ago • 46

upvoted a paper 24 days ago

The Era of Agentic Organization: Learning to Organize with Language Models

Paper • 2510.26658 • Published Oct 30 • 26

upvoted a paper 25 days ago

QueST: Incentivizing LLMs to Generate Difficult Problems

Paper • 2510.17715 • Published Oct 20 • 33

Anwar

AI & ML interests

Recent Activity

Organizations

abdoali5672's activity