Hanning Zhang's picture

Hanning Zhang

HanningZhang

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 14 days ago

Agents' Last Exam

upvoted a paper about 1 month ago

Code as Agent Harness

liked a dataset 3 months ago

nvidia/Nemotron-Agentic-v1

View all activity

Organizations

Papers 2

arxiv:2505.02391

arxiv:2502.19613

models 282

HanningZhang/deepseek_only_conjecture_claude_deepseek_train_data_max1_5e-7_bs32_decay1e-6_2ep_ep1

Text Generation • 7B • Updated Jan 12 • 4

HanningZhang/PhysProver-7B

Text Generation • 7B • Updated Jan 4 • 6 • 1

HanningZhang/physlean_ds_prover_noapply_grpo_1e-6_bs256_step35

Text Generation • 7B • Updated Jan 2 • 3

HanningZhang/physlean_ds_prover_grpo_1e-6_bs256_step90

Text Generation • 7B • Updated Dec 31, 2025 • 1

HanningZhang/physicslean_kimina_train_gen_from_claude_and_grok_deepseek_one_sample_ep1

Text Generation • 8B • Updated Dec 14, 2025 • 1

HanningZhang/physicslean_kimina_train_gen_from_claude_and_grok_deepseek_one_sample_ep2

Text Generation • 8B • Updated Dec 14, 2025 • 2

HanningZhang/physicslean_kimina_train_gen_from_claude_and_grok_deepseek_all_ep2

Text Generation • 8B • Updated Dec 14, 2025 • 2

HanningZhang/physicslean_kimina_train_gen_from_claude_and_grok_deepseek_all_ep1

Text Generation • 8B • Updated Dec 14, 2025 • 2

HanningZhang/physicslean_kimina_train_gen_from_grok_deepseek_one_sample_ep2

Text Generation • 8B • Updated Dec 14, 2025 • 3

HanningZhang/physicslean_kimina_train_gen_from_grok_deepseek_one_sample_ep1

Text Generation • 8B • Updated Dec 14, 2025 • 3

View 282 models

datasets 236

HanningZhang/SAGE-all-correct-top_p10_selfcorr

Viewer • Updated Mar 6 • 2.84k • 7

HanningZhang/SAGE-all-correct-top_p10

Viewer • Updated Mar 6 • 2.84k • 4

HanningZhang/SAGE-all-correct

Viewer • Updated Mar 4 • 2.99k • 4

HanningZhang/OpenGenAlign-v2

Viewer • Updated Sep 30, 2025 • 43.5k • 14

HanningZhang/RAG-Reward-Modeling-v2

Viewer • Updated Sep 30, 2025 • 43.5k • 26

HanningZhang/scalebio_distill_qwen_math

Viewer • Updated Sep 23, 2025 • 2k • 9

HanningZhang/test-self-rewarding

Viewer • Updated Sep 4, 2025 • 40k • 6

HanningZhang/test-no-self-rewarding

Viewer • Updated Sep 4, 2025 • 40k • 7

HanningZhang/MLE-Policy-Trajectory

Viewer • Updated Jul 8, 2025 • 1.22k • 5

HanningZhang/MLE-Reward-Rating

Viewer • Updated Jul 8, 2025 • 1.86k • 10

View 236 datasets