tzjz89's picture

7 4

tzjz89

tzjz89

·

AI & ML interests

NLP

Recent Activity

upvoted a paper 6 days ago

Stabilizing Reinforcement Learning with LLMs: Formulation and Practices

upvoted a paper 5 months ago

Group Sequence Policy Optimization

upvoted a paper 6 months ago

Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning

View all activity

Organizations

models 0

None public yet

datasets 0

None public yet