-
ATLAS: Adaptive Transfer Scaling Laws for Multilingual Pretraining, Finetuning, and Decoding the Curse of Multilinguality
Paper • 2510.22037 • Published • 19 -
Less is More: Recursive Reasoning with Tiny Networks
Paper • 2510.04871 • Published • 505 -
The Dragon Hatchling: The Missing Link between the Transformer and Models of the Brain
Paper • 2509.26507 • Published • 544 -
Scaling Language-Centric Omnimodal Representation Learning
Paper • 2510.11693 • Published • 101
Clément Castellon
Clemspace
AI & ML interests
Reinforcement learning, Neural Architecture Search, Transformers
Recent Activity
liked
a model
about 4 hours ago
LiquidAI/LFM2.5-1.2B-Thinking
updated
a collection
3 months ago
Bangers 2025
updated
a collection
3 months ago
Bangers 2025