What will happen if we train a Q function for digital agents?
HAO BAI
JackBAI
AI & ML interests
Representation learning, language models.
Recent Activity
upvoted a paper about 6 hours ago
OpenWebRL: Demystifying Online Multi-turn Reinforcement Learning for Visual Web Agents upvoted a paper 18 days ago
Orchard: An Open-Source Agentic Modeling Framework upvoted a paper about 1 month ago
Heterogeneous Scientific Foundation Model Collaboration