Block Diffusion for Flash Speculative Decoding
AI & ML interests
Efficient AI
Recent Activity
Papers
DFlash: Block Diffusion for Flash Speculative Decoding
ParoQuant: Pairwise Rotation Quantization for Efficient Reasoning LLM Inference
Pairwise Rotation Quantization for Efficient Reasoning LLM Inference
-
ParoQuant: Pairwise Rotation Quantization for Efficient Reasoning LLM Inference
Paper • 2511.10645 • Published • 10 -
z-lab/Qwen3.6-27B-PARO
Image-Text-to-Text • 6B • Updated • 3.31k • 13 -
z-lab/gemma-4-31B-it-PARO
Image-Text-to-Text • 6B • Updated • 16.1k • 19 -
z-lab/gemma-4-E2B-it-PARO
Image-Text-to-Text • 3B • Updated • 1.16k • 6
Block Diffusion for Flash Speculative Decoding
Pairwise Rotation Quantization for Efficient Reasoning LLM Inference
-
ParoQuant: Pairwise Rotation Quantization for Efficient Reasoning LLM Inference
Paper • 2511.10645 • Published • 10 -
z-lab/Qwen3.6-27B-PARO
Image-Text-to-Text • 6B • Updated • 3.31k • 13 -
z-lab/gemma-4-31B-it-PARO
Image-Text-to-Text • 6B • Updated • 16.1k • 19 -
z-lab/gemma-4-E2B-it-PARO
Image-Text-to-Text • 3B • Updated • 1.16k • 6
models 42
z-lab/MiniMax-M2.7-DFlash
0.6B • Updated • 10 • 6
z-lab/gemma-4-26B-A4B-it-DFlash
Text Generation • 0.4B • Updated • 8.16k • 35
z-lab/gemma-4-31B-it-DFlash
Text Generation • 2B • Updated • 5.19k • 71
z-lab/Qwen3.6-27B-PARO
Image-Text-to-Text • 6B • Updated • 3.31k • 13
z-lab/Qwen3.5-35B-A3B-PARO
Image-Text-to-Text • 6B • Updated • 959 • 11
z-lab/Qwen3.5-27B-PARO
Image-Text-to-Text • 6B • Updated • 4.31k • 20
z-lab/Qwen3.5-9B-PARO
Image-Text-to-Text • 3B • Updated • 28.1k • 47
z-lab/Qwen3.5-4B-PARO
Image-Text-to-Text • 1B • Updated • 2.12k • 17
z-lab/Qwen3.5-2B-PARO
Image-Text-to-Text • 1B • Updated • 262 • 3
z-lab/Qwen3.5-0.8B-PARO
Image-Text-to-Text • 0.4B • Updated • 382 • 4
datasets 7
z-lab/kimi-k26-regen
Viewer • Updated • 1M • 18 • 3
z-lab/humaneval-long
Viewer • Updated • 1k • 41
z-lab/gsm8k-filtered
Viewer • Updated • 1.31k • 30
z-lab/mt-bench-filtered
Viewer • Updated • 79 • 16
z-lab/mbpp-sanitized-filtered
Viewer • Updated • 256 • 70
z-lab/humaneval-filtered
Viewer • Updated • 137 • 15
z-lab/qwen3-4b-instruct-100k
Viewer • Updated • 100k • 19