arxiv:2604.07776
Xing Han Lù
xhluca
AI & ML interests
None yet
Recent Activity
upvoted a paper 9 days ago
CUA-Gym: Scaling Verifiable Training Environments and Tasks for Computer-Use Agents upvoted a paper 13 days ago
Forecasting Downstream Performance of LLMs With Proxy Metrics updated a Space 18 days ago
McGill-NLP/agent-reward-bench-leaderboard