Junkang Wu's picture

Junkang Wu

junkang0909

·

https://junkangwu.github.io/

AI & ML interests

LLM alignment

Recent Activity

upvoted a paper 5 days ago

Skill1: Unified Evolution of Skill-Augmented Agents via Reinforcement Learning

upvoted a paper 5 days ago

Rubric-based On-policy Distillation

upvoted a paper about 2 months ago

On the Direction of RLVR Updates for LLM Reasoning: Identification and Exploitation

View all activity

Organizations

None yet

commented a paper 8 months ago

Quantile Advantage Estimation for Entropy-Safe Reasoning

Paper • 2509.22611 • Published Sep 26, 2025 • 119 •

commented a paper about 1 year ago

RePO: ReLU-based Preference Optimization

Paper • 2503.07426 • Published Mar 10, 2025 • 2 •