Speculative Decoding Draft Models
Collection
Collection of OpenVINO optimized efficient draft models for speculative decoding
•
5 items
•
Updated
•
10
Deep Learning Inference, Deep Learning Model Optimization