zentorch TorchAO Quantized Models - PyTorch 2.10 Collection TorchAO quantized models for AMD EPYC CPU inference. The inference stack includes vLLM (0.15.0 to 0.18.0), PyTorch 2.10, and zentorch 5.2.1. • 4 items • Updated 3 days ago
DL-QAT: Weight-Decomposed Low-Rank Quantization-Aware Training for Large Language Models Paper • 2504.09223 • Published Apr 12, 2025
AMD-Hummingbird: Towards an Efficient Text-to-Video Model Paper • 2503.18559 • Published Mar 24, 2025 • 5
Agent Laboratory: Using LLM Agents as Research Assistants Paper • 2501.04227 • Published Jan 8, 2025 • 96