DeepResearchEval: An Automated Framework for Deep Research Task Construction and Agentic Evaluation Paper • 2601.09688 • Published 1 day ago • 90
Controlled Self-Evolution for Algorithmic Code Optimization Paper • 2601.07348 • Published 4 days ago • 94
Fast-ThinkAct: Efficient Vision-Language-Action Reasoning via Verbalizable Latent Planning Paper • 2601.09708 • Published 1 day ago • 36
A^3-Bench: Benchmarking Memory-Driven Scientific Reasoning via Anchor and Attractor Activation Paper • 2601.09274 • Published 1 day ago • 74
ExpSeek: Self-Triggered Experience Seeking for Web Agents Paper • 2601.08605 • Published 2 days ago • 15
EvoFSM: Controllable Self-Evolution for Deep Research with Finite State Machines Paper • 2601.09465 • Published 1 day ago • 11
Aligning Text, Code, and Vision: A Multi-Objective Reinforcement Learning Framework for Text-to-Visualization Paper • 2601.04582 • Published 8 days ago • 5
EpiCaR: Knowing What You Don't Know Matters for Better Reasoning in LLMs Paper • 2601.06786 • Published 5 days ago • 6
VideoLoom: A Video Large Language Model for Joint Spatial-Temporal Understanding Paper • 2601.07290 • Published 4 days ago • 7
UM-Text: A Unified Multimodal Model for Image Understanding Paper • 2601.08321 • Published 3 days ago • 5
ArenaRL: Scaling RL for Open-Ended Agents via Tournament-based Relative Ranking Paper • 2601.06487 • Published 6 days ago • 41
MemoBrain: Executive Memory as an Agentic Brain for Reasoning Paper • 2601.08079 • Published 3 days ago • 34
MemGovern: Enhancing Code Agents through Learning from Governed Human Experiences Paper • 2601.06789 • Published 5 days ago • 73
Watching, Reasoning, and Searching: A Video Deep Research Benchmark on Open Web for Agentic Video Reasoning Paper • 2601.06943 • Published 4 days ago • 198