Raghav-Singhal/tulu3sft-normal-smollm-1p7b-500B-30n-2048sl-960gbsz Text Generation • 2B • Updated about 4 hours ago
Raghav-Singhal/normal-smollm-1p7b-500B-30n-2048sl-960gbsz Text Generation • 2B • Updated about 6 hours ago
Raghav-Singhal/dpo-tulu3-lr1e-6-beta0.05-tulu3sft-100B-normal-fixed-off-policy-if 2B • Updated 9 days ago • 22
Raghav-Singhal/dpo-tulu3-lr1e-6-beta0.1-tulu3sft-100B-normal-fixed-off-policy-if 2B • Updated 9 days ago • 93
Raghav-Singhal/dpo-tulu3-lr5e-7-tulu3sft-100B-no-bad-data-off-policy-if Text Generation • 2B • Updated 9 days ago • 413
Raghav-Singhal/dpo-tulu3-lr5e-7-tulu3sft-100B-normal-fixed-off-policy-if Text Generation • 2B • Updated 9 days ago • 427
Raghav-Singhal/tulu3sft-normal-smollm-1p7b-100B-20n-2048sl-960gbsz-no-bad-data 2B • Updated 10 days ago • 77
Raghav-Singhal/pretrain-normal-smollm-1p7b-100B-20n-2048sl-960gbsz-no-bad-data 2B • Updated 10 days ago • 42
Raghav-Singhal/tulu3-normal-fixed-smollm-1p7b-100B-20n-2048sl-960gbsz-4n-gbs128 2B • Updated 11 days ago • 352
Raghav-Singhal/pretrain-normal-smollm-1p7b-100B-20n-2048sl-960gbsz-sft-tulu3sft Text Generation • 2B • Updated 12 days ago • 184
Raghav-Singhal/pretrain-normal-smollm-1p7b-100B-20n-2048sl-960gbsz Text Generation • 2B • Updated 12 days ago • 38