huangyundu's picture

5 1

huangyundu

yundu

·

AI & ML interests

None yet

Recent Activity

liked a model about 1 month ago

moonshotai/Kimi-K2-Thinking

upvoted a collection about 1 month ago

upvoted a paper about 1 month ago

Supervised Reinforcement Learning: From Expert Trajectories to Step-wise Reasoning

View all activity

Organizations

None yet

liked a model about 1 month ago

moonshotai/Kimi-K2-Thinking

Text Generation • Updated Nov 8 • 381k • • 1.52k

upvoted a collection about 1 month ago

post-train

1 item • Updated Oct 31 • 1

upvoted 4 papers about 1 month ago

Supervised Reinforcement Learning: From Expert Trajectories to Step-wise Reasoning

Paper • 2510.25992 • Published Oct 29 • 45

Reasoning with Sampling: Your Base Model is Smarter Than You Think

Paper • 2510.14901 • Published Oct 16 • 47

Tongyi DeepResearch Technical Report

Paper • 2510.24701 • Published Oct 28 • 97

Agent Learning via Early Experience

Paper • 2510.08558 • Published Oct 9 • 266