Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
ganglii
's Collections
DRPO
DisCO
DisCO
updated
Oct 8
Discriminative Constrained Optimization for Reinforcing Large Reasoning Models
Upvote
1
ganglii/DisCO-1.5B-logL
2B
•
Updated
May 26
•
12
ganglii/DisCO-1.5B-Lratio
2B
•
Updated
May 26
•
13
ganglii/DisCO-7B-logL
8B
•
Updated
May 26
•
9
ganglii/DisCO-7B-Lratio
8B
•
Updated
May 26
•
10
Upvote
1
Share collection
View history
Collection guide
Browse collections