Trained ExpRL checkpoints. Paper link: https://arxiv.org/abs/2606.17024
Violet Xiang PRO
violetxi
AI & ML interests
None yet
Recent Activity
upvoted a paper about 2 hours ago
ExpRL: Exploratory RL for LLM Mid-Training updated a collection about 2 hours ago
ExpRL submitted a paper about 2 hours ago
ExpRL: Exploratory RL for LLM Mid-Training