infinitylogesh/qwen3_1_7b_base_grpo_math_12k_fullfinetuning_baseline_50 2B • Updated 6 days ago • 13
infinitylogesh/qwen3_1_7b_base_grpo_math_12k_fullfinetuning_baseline_50 2B • Updated 6 days ago • 13
infinitylogesh/qwen3_1_7b_base_grpo_math_12k_fullfinetuning_baseline_100 2B • Updated 6 days ago • 9
infinitylogesh/qwen3_1_7b_base_grpo_math_12k_fullfinetuning_baseline_100 2B • Updated 6 days ago • 9
infinitylogesh/qwen3_1_7b_base_grpo_math_12k_fullfinetuning_baseline 2B • Updated 6 days ago • 10
infinitylogesh/qwen3_1_7b_base_grpo_math_12k_fullfinetuning_baseline 2B • Updated 6 days ago • 10
infinitylogesh/book_dataset_no_mem_token_gte_largev1_5_M512_C1024_1B Viewer • Updated 22 days ago • 606k • 85
infinitylogesh/book_dataset_no_mem_token_gte_largev1_5_M512_C1024_1B Viewer • Updated 22 days ago • 606k • 85