-
FutureX: An Advanced Live Benchmark for LLM Agents in Future Prediction
Paper ⢠2508.11987 ⢠Published ⢠73 -
InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency
Paper ⢠2508.18265 ⢠Published ⢠222 -
Less is More: Recursive Reasoning with Tiny Networks
Paper ⢠2510.04871 ⢠Published ⢠514
Garrosh Icecream
GarroshIcecream
AI & ML interests
From tiny SLMs to massive LLMs. Iām all about text-to-text fun.
Organizations
None yet
READ ON TOILET
-
Speed Always Wins: A Survey on Efficient Architectures for Large Language Models
Paper ⢠2508.09834 ⢠Published ⢠53 -
The Landscape of Agentic Reinforcement Learning for LLMs: A Survey
Paper ⢠2509.02547 ⢠Published ⢠238 -
DeepSearch: Overcome the Bottleneck of Reinforcement Learning with Verifiable Rewards via Monte Carlo Tree Search
Paper ⢠2509.25454 ⢠Published ⢠147 -
DeMo: Decoupled Momentum Optimization
Paper ⢠2411.19870 ⢠Published ⢠6
P(DOOM) = 1.0
-
FutureX: An Advanced Live Benchmark for LLM Agents in Future Prediction
Paper ⢠2508.11987 ⢠Published ⢠73 -
InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency
Paper ⢠2508.18265 ⢠Published ⢠222 -
Less is More: Recursive Reasoning with Tiny Networks
Paper ⢠2510.04871 ⢠Published ⢠514
READ ON TOILET
-
Speed Always Wins: A Survey on Efficient Architectures for Large Language Models
Paper ⢠2508.09834 ⢠Published ⢠53 -
The Landscape of Agentic Reinforcement Learning for LLMs: A Survey
Paper ⢠2509.02547 ⢠Published ⢠238 -
DeepSearch: Overcome the Bottleneck of Reinforcement Learning with Verifiable Rewards via Monte Carlo Tree Search
Paper ⢠2509.25454 ⢠Published ⢠147 -
DeMo: Decoupled Momentum Optimization
Paper ⢠2411.19870 ⢠Published ⢠6
models 0
None public yet
datasets 0
None public yet