arxiv:2602.10693
floyed shen
floyed
AI & ML interests
None yet
Recent Activity
upvoted a paper about 20 hours ago
Bootstrapping Exploration with Group-Level Natural Language Feedback in Reinforcement Learning commented on
a paper
7 days ago
VESPO: Variational Sequence-Level Soft Policy Optimization for Stable Off-Policy LLM Training upvoted a paper 11 days ago
dLLM: Simple Diffusion Language Modeling