ThinMQM (automated translation evaluation, MQM) model and data collection.
Runzhe Zhan
rzzhan
AI & ML interests
None yet
Recent Activity
upvoted a paper about 19 hours ago
π-Bench: Evaluating Proactive Personal Assistant Agents in Long-Horizon Workflows updated a model 2 days ago
Simplified-Reasoning/SU-01 liked a model 3 days ago
Simplified-Reasoning/SU-01