OpenReasoning

Activity Feed

AI & ML interests

None defined yet.

lincharliesun

authored 4 papers 2 months ago

FABLE: Forest-Based Adaptive Bi-Path LLM-Enhanced Retrieval for Multi-Document Reasoning

Paper • 2601.18116 • Published Jan 26 • 13

submitted 2 papers to Daily Papers 2 months ago

FABLE: Forest-Based Adaptive Bi-Path LLM-Enhanced Retrieval for Multi-Document Reasoning

Paper • 2601.18116 • Published Jan 26 • 13

TriPlay-RL: Tri-Role Self-Play Reinforcement Learning for LLM Safety Alignment

Paper • 2601.18292 • Published Jan 26 • 12

mx1024

authored a paper 4 months ago

MiroThinker: Pushing the Performance Boundaries of Open-Source Research Agents via Model, Context, and Interactive Scaling

Paper • 2511.11793 • Published Nov 14, 2025 • 194

lincharliesun

authored a paper 7 months ago

Efficient Switchable Safety Control in LLMs via Magic-Token-Guided Co-Training

Paper • 2508.14904 • Published Aug 12, 2025 • 2

JjjjjZzz

authored 2 papers 10 months ago

TinyR1-32B-Preview: Boosting Accuracy with Branch-Merge Distillation

Paper • 2503.04872 • Published Mar 6, 2025 • 15

Evaluation is All You Need: Strategic Overclaiming of LLM Reasoning Capabilities Through Evaluation Design

Paper • 2506.04734 • Published Jun 5, 2025 • 21

mx1024

authored 2 papers 10 months ago

Stress Testing Generalization: How Minor Modifications Undermine Large Language Model Performance

Paper • 2502.12459 • Published Feb 18, 2025 • 3

Evaluation is All You Need: Strategic Overclaiming of LLM Reasoning Capabilities Through Evaluation Design

Paper • 2506.04734 • Published Jun 5, 2025 • 21

lincharliesun

authored a paper 10 months ago

Evaluation is All You Need: Strategic Overclaiming of LLM Reasoning Capabilities Through Evaluation Design

Paper • 2506.04734 • Published Jun 5, 2025 • 21

zhaoguangxiang

authored 3 papers about 1 year ago

Stress Testing Generalization: How Minor Modifications Undermine Large Language Model Performance

Paper • 2502.12459 • Published Feb 18, 2025 • 3

LongAttn: Selecting Long-context Training Data via Token-level Attention

Paper • 2502.16860 • Published Feb 24, 2025 • 1

Chain-of-Thought Matters: Improving Long-Context Language Models with Reasoning Path Supervision

Paper • 2502.20790 • Published Feb 28, 2025

Husserl233

authored a paper about 1 year ago

TinyR1-32B-Preview: Boosting Accuracy with Branch-Merge Distillation

Paper • 2503.04872 • Published Mar 6, 2025 • 15

yuhanwuuu

authored a paper about 1 year ago

TinyR1-32B-Preview: Boosting Accuracy with Branch-Merge Distillation

Paper • 2503.04872 • Published Mar 6, 2025 • 15

zhaoguangxiang

authored a paper about 1 year ago

TinyR1-32B-Preview: Boosting Accuracy with Branch-Merge Distillation

Paper • 2503.04872 • Published Mar 6, 2025 • 15

lincharliesun

authored a paper about 1 year ago

Stress Testing Generalization: How Minor Modifications Undermine Large Language Model Performance

Paper • 2502.12459 • Published Feb 18, 2025 • 3

AI & ML interests

Team members 6

OpenReasoning's activity