Shrimai Prabhumoye's picture

1 11 3

Shrimai Prabhumoye

shrimai19

·

https://shrimai.github.io/

AI & ML interests

None yet

Organizations

upvoted 3 papers 2 months ago

Front-Loading Reasoning: The Synergy between Pretraining and Post-Training Data

Paper • 2510.03264 • Published Sep 26 • 23

RLAD: Training LLMs to Discover Abstractions for Solving Reasoning Problems

Paper • 2510.02263 • Published Oct 2 • 8

RLP: Reinforcement as a Pretraining Objective

Paper • 2510.01265 • Published Sep 26 • 40

upvoted a paper 3 months ago

VerlTool: Towards Holistic Agentic Reinforcement Learning with Tool Use

Paper • 2509.01055 • Published Sep 1 • 75

upvoted a paper 4 months ago

NVIDIA Nemotron Nano 2: An Accurate and Efficient Hybrid Mamba-Transformer Reasoning Model

Paper • 2508.14444 • Published Aug 20 • 38

upvoted 6 papers 7 months ago

Think Only When You Need with Large Hybrid-Reasoning Models

Paper • 2505.14631 • Published May 20 • 20

Optimizing Anytime Reasoning via Budget Relative Policy Optimization

Paper • 2505.13438 • Published May 19 • 36

Qwen3 Technical Report

Paper • 2505.09388 • Published May 14 • 317

Model Merging in Pre-training of Large Language Models

Paper • 2505.12082 • Published May 17 • 40

Thinkless: LLM Learns When to Think

Paper • 2505.13379 • Published May 19 • 50

Insights into DeepSeek-V3: Scaling Challenges and Reflections on Hardware for AI Architectures

Paper • 2505.09343 • Published May 14 • 73