view article Article OpenReasoning-Nemotron: A Family of State-of-the-Art Distilled Reasoning Models Jul 18 • 50
Llama Nemotron Feedback-Edit Inference-Time Scaling Collection Novel ITS approach for open-ended tasks - No. 1 on Arena Hard on 18 Mar 2025 • 5 items • Updated 3 days ago • 6
SSMs Collection A collection of Mamba-2-based research models with 8B parameters trained on 3.5T tokens for comparison with Transformers. • 5 items • Updated 3 days ago • 29
NV-Embed Collection NV-Embed is a generalist embedding model encompassing retrieval, reranking, classification, clustering, STS tasks. • 3 items • Updated 3 days ago • 16
RLHF Collection A collection of models trained with Reinforcement Learning from Human Feedback (RLHF). • 4 items • Updated 3 days ago • 6
OpenMath Collection A collection of models and datasets introduced in "OpenMathInstruct-1: A 1.8 Million Math Instruction Tuning Dataset" • 15 items • Updated 3 days ago • 45
InstructRetro Collection InstructRetro is an autoregressive decoder-only language model (LM) with retrieval-augmented pretraining and instruction tuning. • 4 items • Updated 3 days ago • 10
Canary Collection A collection of multilingual and multitask speech to text models from NVIDIA NeMo 🐤 • 5 items • Updated 3 days ago • 29
Parakeet Collection NeMo Parakeet ASR Models attain strong speech recognition accuracy while being efficient for inference. Available in CTC and RNN-Transducer variants. • 12 items • Updated 3 days ago • 47
SteerLM Collection A collection of models and datasets relating to SteerLM and HelpSteer. • 7 items • Updated 3 days ago • 15
Nemotron 3 8B Collection The Nemotron 3 8B Family of models is optimized for building production-ready generative AI applications for the enterprise. • 5 items • Updated 3 days ago • 52