WangShouli
WangSl2004
AI & ML interests
None yet
Recent Activity
upvoted a paper about 3 hours ago
Reproducing, Analyzing, and Detecting Reward Hacking in Rubric-Based Reinforcement Learning liked a dataset 6 months ago
SWE-bench/SWE-bench upvoted a paper 7 months ago
MiniCPM-V 4.5: Cooking Efficient MLLMs via Architecture, Data, and
Training Recipe