·
AI & ML interests
None yet
Organizations
None yet
models 37
aidando73/simplerl-v8-checkpoints
Updated
aidando73/simplerl-Qwen2.5-Math-7B-v5-checkpoint40
8B • Updated • 2
aidando73/simplerl-v5-checkpoints
Updated
aidando73/simplerl-v6-checkpoints
Updated
aidando73/simplerl-v4-checkpoints
Updated
aidando73/simplerl-single-grpo-v1-checkpoints
Updated
aidando73/Qwen-2.5-7B-Simple-RL-v9
Text Generation
• 8B • Updated • 4
aidando73/Qwen-2.5-7B-Simple-RL-v8
Text Generation
• 8B • Updated • 3
aidando73/Qwen-2.5-7B-Simple-RL-v7
Text Generation
• 8B • Updated • 2
aidando73/Qwen-2.5-7B-Simple-RL-v6
Text Generation
• 8B • Updated • 3
datasets 11
aidando73/grpo-gsm8k-experiments
Preview
• Updated • 119
aidando73/math_level3to5_data
Viewer
• Updated • 17k • 22
aidando73/Qwen2-0.5B-GRPO-checkpoints
Updated • 56
aidando73/grpo-summarization-evals
Preview
• Updated • 4
Viewer
• Updated • 488 • 11
Updated • 114
aidando73/llama-coding-agent-evals
Updated • 2.08k
aidando73/swe-bench-fine-tune
Preview
• Updated • 203
aidando73/llama-codes-swe-bench-evals
Viewer
• Updated • 149k • 375
aidando73/open-hands-swe-bench-evals
Preview
• Updated • 106