fieryTransition
's Collections
papers
updated
Gemini 1.5: Unlocking multimodal understanding across millions of tokens
of context
Paper
•
2403.05530
•
Published
•
65
Aurora-M: The First Open Source Multilingual Language Model Red-teamed
according to the U.S. Executive Order
Paper
•
2404.00399
•
Published
•
42
Rho-1: Not All Tokens Are What You Need
Paper
•
2404.07965
•
Published
•
93
Magpie: Alignment Data Synthesis from Scratch by Prompting Aligned LLMs
with Nothing
Paper
•
2406.08464
•
Published
•
71
Building Math Agents with Multi-Turn Iterative Preference Learning
Paper
•
2409.02392
•
Published
•
16
RADLADS: Rapid Attention Distillation to Linear Attention Decoders at
Scale
Paper
•
2505.03005
•
Published
•
36
MeshCoder: LLM-Powered Structured Mesh Code Generation from Point Clouds
Paper
•
2508.14879
•
Published
•
69
Stronger Normalization-Free Transformers
Paper
•
2512.10938
•
Published
•
20
DyLoRA: Parameter Efficient Tuning of Pre-trained Models using Dynamic
Search-Free Low-Rank Adaptation
Paper
•
2210.07558
•
Published
•
1
Emergent temporal abstractions in autoregressive models enable hierarchical reinforcement learning
Paper
•
2512.20605
•
Published
•
61