Lansechen/Qwen2.5-3B-Distill-om220k-fem32768-batch32-epoch3-8192 Text Generation • 3B • Updated Mar 22 • 3
Lansechen/Qwen2.5-3B-Distill-om220k-fhm32768-batch32-epoch3-8192 Text Generation • 3B • Updated Mar 24 • 3
chenggong1995/Qwen-2.5-Base-3B-gen8-scale-MATH-lighteval-olympiads_aime-unique-ghpo-beta0-epoch3 Text Generation • 3B • Updated Apr 11 • 4
chenggong1995/Qwen-2.5-Base-3B-gen8-scale-math_selected-grpo-beta0-epoch3 Text Generation • 3B • Updated Apr 10 • 7
chenggong1995/openr1-Qwen-2.5-Base-3B-gen8-scale-NuminaMath-TIR-100-grpo-beta0-epoch2 Text Generation • 3B • Updated Apr 11 • 8
Lansechen/Qwen2.5-3B-Open-R1-GRPO-math-selected-cosine-noRW Text Generation • 3B • Updated Apr 11 • 8
Lansechen/Qwen2.5-3B-Open-R1-GRPO-math-selected-cosine-noRW-RP Text Generation • 3B • Updated Apr 15 • 6
Lansechen/Qwen2.5-3B-Open-R1-GRPO-math-selected-cosine-noRW-RP-v2 Text Generation • 3B • Updated Apr 16 • 4