Aidan Do's picture

Aidan Do

aidando73

·

AI & ML interests

None yet

Organizations

None yet

models 37

aidando73/simplerl-v8-checkpoints

Updated Apr 6, 2025

aidando73/simplerl-Qwen2.5-Math-7B-v5-checkpoint40

8B • Updated Apr 4, 2025 • 1

aidando73/simplerl-v5-checkpoints

Updated Apr 4, 2025

aidando73/simplerl-v6-checkpoints

Updated Apr 4, 2025

aidando73/simplerl-v4-checkpoints

Updated Apr 2, 2025

aidando73/simplerl-single-grpo-v1-checkpoints

Updated Mar 29, 2025

aidando73/Qwen-2.5-7B-Simple-RL-v9

Text Generation • 8B • Updated Mar 24, 2025 • 4

aidando73/Qwen-2.5-7B-Simple-RL-v8

Text Generation • 8B • Updated Mar 24, 2025 • 3

aidando73/Qwen-2.5-7B-Simple-RL-v7

Text Generation • 8B • Updated Mar 23, 2025 • 2

aidando73/Qwen-2.5-7B-Simple-RL-v6

Text Generation • 8B • Updated Mar 23, 2025 • 3

datasets 11

aidando73/grpo-gsm8k-experiments

Preview • Updated Apr 4, 2025 • 167

aidando73/math_level3to5_data

Viewer • Updated Mar 24, 2025 • 17k • 20

aidando73/Qwen2-0.5B-GRPO-checkpoints

Updated Mar 18, 2025 • 70

aidando73/grpo-summarization-evals

Preview • Updated Mar 18, 2025 • 6

aidando73/wiki-events

Viewer • Updated Mar 13, 2025 • 488 • 14

aidando73/coding-agent-2

Updated Mar 9, 2025 • 116

aidando73/llama-coding-agent-evals

Updated Jan 22, 2025 • 2.38k

aidando73/swe-bench-fine-tune

Preview • Updated Jan 17, 2025 • 208

aidando73/llama-codes-swe-bench-evals

Viewer • Updated Jan 12, 2025 • 149k • 392

aidando73/open-hands-swe-bench-evals

Preview • Updated Dec 26, 2024 • 125

View 11 datasets