lllqaq/R2EGym-7B-Agent-Coder-Instruct-merged_bucketab_4sources_20260228_101548_32768_3gpu_oomfix Text Generation • 333k • Updated about 12 hours ago • 25
lllqaq/R2EGym-14B-Agent-Coder-Instruct1-traj_reward1_loose_4sources_shuf42_ckpt2400 841k • Updated 3 days ago • 11
lllqaq/R2EGym-14B-Agent-Coder-Instruct1-merged_bucketab_4sources_20260228_101548_32768_4gpu_oomfix Text Generation • 841k • Updated 5 days ago • 14
lllqaq/R2EGym-14B-Agent-Coder-Instruct-traj_bucketAB_multi_3sources_bucketAB_sft_shuf42 Text Generation • 841k • Updated 5 days ago • 12
lllqaq/R2EGym-32B-Agent-Coder-Instruct-fimMidPostV2-r2egym-32k-ckpt808 1.12M • Updated 9 days ago • 10
lllqaq/R2EGym-14B-Agent-Coder-Instruct-trajmix-gpt5miniAB-claude45AB-r2egymSFT-shuf42-32k-8gpu-oomfix Text Generation • 841k • Updated 10 days ago • 11
lllqaq/R2EGym-32B-Agent-Coder-Instruct-fim_midtrain_data_0108_212k_posttrain_r2egym_32768_8gpu Text Generation • 1.12M • Updated 11 days ago • 14
lllqaq/R2EGym-32B-Agent-Coder-Instruct-fim_midtrain_data_0108_212k_32768_8gpu Text Generation • 1.12M • Updated 14 days ago • 10
lllqaq/R2EGym-14B-Agent-Coder-Instruct1-traj-gpt5mini-full-bucketAB_32768_oomfix Text Generation • 841k • Updated 17 days ago • 13
lllqaq/R2EGym-32B-Agent-Coder-Instruct-r2egym_32768_8gpu Text Generation • 1.12M • Updated 17 days ago • 16
lllqaq/R2EGym-7B-Agent-Coder-Instruct-merged_bucketAB_32768_8gpu_oomfix Text Generation • 333k • Updated 21 days ago • 8
lllqaq/R2EGym-14B-Agent-Coder-Instruct-merged_bucketAB_32768_8gpu_oomfix Text Generation • 841k • Updated 22 days ago • 10
lllqaq/R2EGym-30B-Agent-Coder-Instruct-r2egym_32768_8gpu Text Generation • 211k • Updated 22 days ago • 12
lllqaq/R2EGym-7B-Agent-Coder-Instruct1-traj-gpt5mini-ab-sample400 Text Generation • 333k • Updated Jan 28
lllqaq/R2EGym-7B-Agent-Coder-Instruct1-r2egym-official-first400 Text Generation • 333k • Updated Jan 28 • 1
lllqaq/R2EGym-7B-Agent-Coder-Instruct1-gpt5plusr2egym-shuffle42-ropeyarn Text Generation • 333k • Updated Jan 27 • 1
lllqaq/R2EGym-14B-Agent-Coder-Instruct1-gpt5plusr2egym-shuffle42 Text Generation • 841k • Updated Jan 27 • 1
lllqaq/R2EGym-7B-Agent-Coder-Instruct1-gpt5plusr2egym-shuffle42 Text Generation • 333k • Updated Jan 27 • 1
lllqaq/R2EGym-7B-Agent-Coder-Instruct1-gpt5-traj-run1-filtered1 Text Generation • 333k • Updated Jan 26 • 3
lllqaq/R2EGym-7B-Agent-Coder-Instruct1-gpt5-traj-run1-full Text Generation • 333k • Updated Jan 26 • 1