Inference Optimization
community
AI & ML interests
None defined yet.
Recent Activity
View all activity
models 167
inference-optimization/gpt-oss-120b-ckpt4-speculator.eagle3
0.9B • Updated
• 28
inference-optimization/gpt-oss-120b-ckpt3-speculator.eagle3
0.9B • Updated
• 45
inference-optimization/Qwen3-Coder-Next.w4a16
Text Generation • 12B • Updated
• 1.62k
inference-optimization/Qwen3-32B-Thinking-speculator.eagle3
Text Generation • 2B • Updated
• 27
inference-optimization/DeepSeek-R1-NVFP4-FP8-BLOCK
397B • Updated
• 50
inference-optimization/Llama-3.2-3B-Instruct_7_bits_mode_heuristic
3B • Updated
• 15
inference-optimization/Llama-3.2-3B-Instruct_7_bits_mode_noise
3B • Updated
• 13
inference-optimization/Llama-3.2-3B-Instruct_7_bits_mode_hybrid
3B • Updated
• 14
inference-optimization/Llama-3.2-3B-Instruct_6.5_bits_mode_heuristic
3B • Updated
• 15
inference-optimization/Llama-3.2-3B-Instruct_6.5_bits_mode_noise
3B • Updated
• 15