DARYL LaMar MOORE
darylmooreNC
·
AI & ML interests
Agents, training, reasoning
Recent Activity
updated a collection 1 day ago
LLM Architectures updated a collection 1 day ago
Agentic AI Training and Tuning updated a collection 4 days ago
LLM Reasoning Organizations
None yet
Researcg
Multi-Agent Infrastructure
-
Latent Collaboration in Multi-Agent Systems
Paper • 2511.20639 • Published • 128 -
Learning When to Act or Refuse: Guarding Agentic Reasoning Models for Safe Multi-Step Tool Use
Paper • 2603.03205 • Published • 13 -
Hyperagents
Paper • 2603.19461 • Published • 50 -
Code-as-Room: Generating 3D Rooms from Top-Down View Images via Agentic Code Synthesis
Paper • 2605.18451 • Published • 40
LLM Architectures
-
Kimi Linear: An Expressive, Efficient Attention Architecture
Paper • 2510.26692 • Published • 133 -
GLM-5: from Vibe Coding to Agentic Engineering
Paper • 2602.15763 • Published • 150 -
Believe Your Model: Distribution-Guided Confidence Calibration
Paper • 2603.03872 • Published • 40 -
OpenWorldLib: A Unified Codebase and Definition of Advanced World Models
Paper • 2604.04707 • Published • 203
Reinforcement Learning
-
Demystifying Reinforcement Learning in Agentic Reasoning
Paper • 2510.11701 • Published • 33 -
LoongRL:Reinforcement Learning for Advanced Reasoning over Long Contexts
Paper • 2510.19363 • Published • 63 -
Supervised Reinforcement Learning: From Expert Trajectories to Step-wise Reasoning
Paper • 2510.25992 • Published • 48 -
Teaching Pretrained Language Models to Think Deeper with Retrofitted Recurrence
Paper • 2511.07384 • Published • 19
Sports Predictive Modeling
Research AI
-
OpenResearcher: A Fully Open Pipeline for Long-Horizon Deep Research Trajectory Synthesis
Paper • 2603.20278 • Published • 98 -
Rethinking Token-Level Policy Optimization for Multimodal Chain-of-Thought
Paper • 2603.22847 • Published • 26 -
Omni-SimpleMem: Autoresearch-Guided Discovery of Lifelong Multimodal Agent Memory
Paper • 2604.01007 • Published • 31
LLM Reasoning
-
Strategic Navigation or Stochastic Search? How Agents and Humans Reason Over Document Collections
Paper • 2603.12180 • Published • 65 -
In-Context Reinforcement Learning for Tool Use in Large Language Models
Paper • 2603.08068 • Published • 43 -
Generative Recursive Reasoning
Paper • 2605.19376 • Published • 27
LLM Training Methodologies
-
Teaching Pretrained Language Models to Think Deeper with Retrofitted Recurrence
Paper • 2511.07384 • Published • 19 -
Believe Your Model: Distribution-Guided Confidence Calibration
Paper • 2603.03872 • Published • 40 -
WiT: Waypoint Diffusion Transformers via Trajectory Conflict Navigation
Paper • 2603.15132 • Published • 35 -
Long Context Pre-Training with Lighthouse Attention
Paper • 2605.06554 • Published • 29
Agentic AI Training and Tuning
-
Tongyi DeepResearch Technical Report
Paper • 2510.24701 • Published • 103 -
Kimi Linear: An Expressive, Efficient Attention Architecture
Paper • 2510.26692 • Published • 133 -
Natural-Language Agent Harnesses
Paper • 2603.25723 • Published • 25 -
CORAL: Towards Autonomous Multi-Agent Evolution for Open-Ended Discovery
Paper • 2604.01658 • Published • 55
Agentic AI
-
The Landscape of Agentic Reinforcement Learning for LLMs: A Survey
Paper • 2509.02547 • Published • 238 -
Demystifying Reinforcement Learning in Agentic Reasoning
Paper • 2510.11701 • Published • 33 -
DeepAnalyze: Agentic Large Language Models for Autonomous Data Science
Paper • 2510.16872 • Published • 113
Large Language Models
-
Universal Deep Research: Bring Your Own Model and Strategy
Paper • 2509.00244 • Published • 14 -
The Landscape of Agentic Reinforcement Learning for LLMs: A Survey
Paper • 2509.02547 • Published • 238 -
Efficient Multi-modal Large Language Models via Progressive Consistency Distillation
Paper • 2510.00515 • Published • 41 -
DeepSearch: Overcome the Bottleneck of Reinforcement Learning with Verifiable Rewards via Monte Carlo Tree Search
Paper • 2509.25454 • Published • 147
Visual Reasoning Models
Research AI
-
OpenResearcher: A Fully Open Pipeline for Long-Horizon Deep Research Trajectory Synthesis
Paper • 2603.20278 • Published • 98 -
Rethinking Token-Level Policy Optimization for Multimodal Chain-of-Thought
Paper • 2603.22847 • Published • 26 -
Omni-SimpleMem: Autoresearch-Guided Discovery of Lifelong Multimodal Agent Memory
Paper • 2604.01007 • Published • 31
Researcg
LLM Reasoning
-
Strategic Navigation or Stochastic Search? How Agents and Humans Reason Over Document Collections
Paper • 2603.12180 • Published • 65 -
In-Context Reinforcement Learning for Tool Use in Large Language Models
Paper • 2603.08068 • Published • 43 -
Generative Recursive Reasoning
Paper • 2605.19376 • Published • 27
Multi-Agent Infrastructure
-
Latent Collaboration in Multi-Agent Systems
Paper • 2511.20639 • Published • 128 -
Learning When to Act or Refuse: Guarding Agentic Reasoning Models for Safe Multi-Step Tool Use
Paper • 2603.03205 • Published • 13 -
Hyperagents
Paper • 2603.19461 • Published • 50 -
Code-as-Room: Generating 3D Rooms from Top-Down View Images via Agentic Code Synthesis
Paper • 2605.18451 • Published • 40
LLM Training Methodologies
-
Teaching Pretrained Language Models to Think Deeper with Retrofitted Recurrence
Paper • 2511.07384 • Published • 19 -
Believe Your Model: Distribution-Guided Confidence Calibration
Paper • 2603.03872 • Published • 40 -
WiT: Waypoint Diffusion Transformers via Trajectory Conflict Navigation
Paper • 2603.15132 • Published • 35 -
Long Context Pre-Training with Lighthouse Attention
Paper • 2605.06554 • Published • 29
LLM Architectures
-
Kimi Linear: An Expressive, Efficient Attention Architecture
Paper • 2510.26692 • Published • 133 -
GLM-5: from Vibe Coding to Agentic Engineering
Paper • 2602.15763 • Published • 150 -
Believe Your Model: Distribution-Guided Confidence Calibration
Paper • 2603.03872 • Published • 40 -
OpenWorldLib: A Unified Codebase and Definition of Advanced World Models
Paper • 2604.04707 • Published • 203
Agentic AI Training and Tuning
-
Tongyi DeepResearch Technical Report
Paper • 2510.24701 • Published • 103 -
Kimi Linear: An Expressive, Efficient Attention Architecture
Paper • 2510.26692 • Published • 133 -
Natural-Language Agent Harnesses
Paper • 2603.25723 • Published • 25 -
CORAL: Towards Autonomous Multi-Agent Evolution for Open-Ended Discovery
Paper • 2604.01658 • Published • 55
Reinforcement Learning
-
Demystifying Reinforcement Learning in Agentic Reasoning
Paper • 2510.11701 • Published • 33 -
LoongRL:Reinforcement Learning for Advanced Reasoning over Long Contexts
Paper • 2510.19363 • Published • 63 -
Supervised Reinforcement Learning: From Expert Trajectories to Step-wise Reasoning
Paper • 2510.25992 • Published • 48 -
Teaching Pretrained Language Models to Think Deeper with Retrofitted Recurrence
Paper • 2511.07384 • Published • 19
Agentic AI
-
The Landscape of Agentic Reinforcement Learning for LLMs: A Survey
Paper • 2509.02547 • Published • 238 -
Demystifying Reinforcement Learning in Agentic Reasoning
Paper • 2510.11701 • Published • 33 -
DeepAnalyze: Agentic Large Language Models for Autonomous Data Science
Paper • 2510.16872 • Published • 113
Sports Predictive Modeling
Large Language Models
-
Universal Deep Research: Bring Your Own Model and Strategy
Paper • 2509.00244 • Published • 14 -
The Landscape of Agentic Reinforcement Learning for LLMs: A Survey
Paper • 2509.02547 • Published • 238 -
Efficient Multi-modal Large Language Models via Progressive Consistency Distillation
Paper • 2510.00515 • Published • 41 -
DeepSearch: Overcome the Bottleneck of Reinforcement Learning with Verifiable Rewards via Monte Carlo Tree Search
Paper • 2509.25454 • Published • 147