Jackrong/Qwen3.5-0.8B-Claude-4.6-Opus-Reasoning-Distilled Text Generation • 0.9B • Updated 4 days ago • 358 • 3
Jackrong/Qwen3.5-2B-Claude-4.6-Opus-Reasoning-Distilled Text Generation • 2B • Updated 3 days ago • 716 • 7
Jackrong/Qwen3.5-9B-Claude-4.6-Opus-Reasoning-Distilled Text Generation • 10B • Updated 3 days ago • 1.46k • 19
Jackrong/Qwen3.5-4B-Claude-4.6-Opus-Reasoning-Distilled Text Generation • 5B • Updated 3 days ago • 829 • 6
Jackrong/Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled Text Generation • 28B • Updated 2 days ago • 23k • 334
🤏 Smol-Data Collection Tried and tested mixes for strong pretraining. Inspired by https://huggingface.co/blog/codelion/optimal-dataset-mixing • 14 items • Updated 8 days ago • 12