Runzhe Zhan's picture

Runzhe Zhan

rzzhan

·

https://runzhe.me/

Ririkoo

AI & ML interests

None yet

Recent Activity

upvoted a paper about 19 hours ago

π-Bench: Evaluating Proactive Personal Assistant Agents in Long-Horizon Workflows

updated a model 2 days ago

Simplified-Reasoning/SU-01

liked a model 3 days ago

Simplified-Reasoning/SU-01

View all activity

Organizations

Collections 2

models 10

rzzhan/ThinMQM-8B

Text Generation • 8B • Updated Oct 28, 2025 • 4

rzzhan/ExGRPO-Llama3.1-8B-Instruct

Text Generation • 8B • Updated Oct 24, 2025 • 3

rzzhan/ExGRPO-Llama3.1-8B-Zero

Text Generation • 8B • Updated Oct 24, 2025

rzzhan/ExGRPO-Qwen2.5-Math-1.5B-Zero

Text Generation • 2B • Updated Oct 24, 2025 • 3

rzzhan/ExGRPO-Qwen2.5-7B-Instruct

Text Generation • 8B • Updated Oct 24, 2025 • 3

rzzhan/ExGRPO-LUFFY-7B-Continual

Text Generation • 8B • Updated Oct 24, 2025 • 3 • 1

rzzhan/ExGRPO-Qwen2.5-Math-7B-Zero

Text Generation • 8B • Updated Oct 24, 2025 • 3

rzzhan/ThinMQM-7B

8B • Updated Oct 24, 2025 • 2

rzzhan/ThinMQM-32B

33B • Updated Oct 24, 2025 • 2

rzzhan/tiny-llama-stories-42m

Updated Sep 17, 2024 • 2 • 1

datasets 1

rzzhan/ThinMQM-12k

Viewer • Updated Oct 24, 2025 • 23.9k • 24