Cerebras REAP
Collection
Sparse MoE models compressed using REAP (Router-weighted Expert Activation Pruning) method • 30 items • Updated
• 131
./llama-server -m /models/mistralai_Mistral-Small-3.2-24B-Instruct-2506-Q4_K_M.gguf --jinja --chat-template-file /models/Mistral-Small-3.2-24B-Instruct-2506.jinja