8 4

David Limpus

TheRealPilot638

AI & ML interests

HW/SW Co-design for efficient AI inference & training | RL

Recent Activity

upvoted a collection about 2 months ago

TraDo Series

upvoted a paper 2 months ago

Recursive Language Models

upvoted a paper 2 months ago

GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization

View all activity

Organizations

upvoted a collection about 2 months ago

TraDo Series

Collection

SOTA Diffusion Large Language Models • 5 items • Updated Sep 11, 2025 • 13

upvoted 2 papers 2 months ago

Recursive Language Models

Paper • 2512.24601 • Published Dec 31, 2025 • 91

GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization

Paper • 2601.05242 • Published Jan 8 • 229

upvoted an article 3 months ago

Article

⚡ nano-vLLM: Lightweight, Low-Latency LLM Inference from Scratch

Jun 28, 2025

•

updated a dataset 8 months ago

TheRealPilot638/Llama-3.1-8B-PRM-Skywork-llama-o1-Math500

Updated Jul 20, 2025 • 6

published a dataset 8 months ago

TheRealPilot638/Llama-3.1-8B-PRM-Skywork-llama-o1-Math500

Updated Jul 20, 2025 • 6

updated a dataset 8 months ago

TheRealPilot638/Qwen3-8B-report-recreate-Math500

Updated Jul 14, 2025 • 8

published a dataset 8 months ago

TheRealPilot638/Qwen3-8B-report-recreate-Math500

Updated Jul 14, 2025 • 8

updated a dataset 8 months ago

TheRealPilot638/Qwen3-8B-Math500-baseline

Viewer • Updated Jul 12, 2025 • 500 • 7

published a dataset 8 months ago

TheRealPilot638/Qwen3-8B-Math500-baseline

Viewer • Updated Jul 12, 2025 • 500 • 7

updated a dataset 8 months ago

TheRealPilot638/DeepSeek-R1-0528-Qwen3-8B-Math500-baseline

Viewer • Updated Jul 11, 2025 • 500 • 6

upvoted 2 articles 8 months ago

Article

All LLMs Will Be Sparse BitNet Hybrids

May 14, 2025

•

Article

Fine-tuning LLMs to 1.58bit: extreme quantization made easy

Sep 18, 2024

•

276

published a dataset 9 months ago

TheRealPilot638/DeepSeek-R1-0528-Qwen3-8B-Math500-baseline

Viewer • Updated Jul 11, 2025 • 500 • 6

updated 2 datasets 9 months ago

TheRealPilot638/Qwen3-8B-BS16-RLHF-PRM-GPQA

Viewer • Updated Jun 26, 2025 • 198 • 96

TheRealPilot638/DeepSeek-R1-Distill-Llama-8B-Reasoning-longToken-GPQA

Viewer • Updated Jun 26, 2025 • 198 • 88

published a dataset 9 months ago

TheRealPilot638/Qwen3-8B-BS16-RLHF-PRM-GPQA

Viewer • Updated Jun 26, 2025 • 198 • 96

updated a dataset 9 months ago

TheRealPilot638/DeepSeek-R1-Distill-Qwen3-8B-Reasoning-longToken-GPQA

Viewer • Updated Jun 25, 2025 • 198 • 103

published 2 datasets 9 months ago

TheRealPilot638/DeepSeek-R1-Distill-Llama-8B-Reasoning-longToken-GPQA

Viewer • Updated Jun 26, 2025 • 198 • 88

TheRealPilot638/DeepSeek-R1-Distill-Qwen3-8B-Reasoning-longToken-GPQA

Viewer • Updated Jun 25, 2025 • 198 • 103

David Limpus

AI & ML interests

Recent Activity

Organizations

TheRealPilot638's activity

⚡ nano-vLLM: Lightweight, Low-Latency LLM Inference from Scratch

All LLMs Will Be Sparse BitNet Hybrids

Fine-tuning LLMs to 1.58bit: extreme quantization made easy