LoRA adapters trained for 5 progressively shorter chain-of-thought styles on GSM8K, plus the eval artifacts behind the Pareto curve.
Frolov Anatolii
ssurface
·
AI & ML interests
None yet
Recent Activity
updated a dataset about 8 hours ago
s-nlp/toolace-rus-gemma4-31b published a dataset about 8 hours ago
s-nlp/toolace-rus-gemma4-31b updated a model 15 days ago
s-nlp/tool-calling-hallucination-modernbert-base-glaive-100pct