Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
CompressedGemma
/
HPC-Quantize
like
0
License:
mit
Model card
Files
Files and versions
xet
Community
1
Copy to bucket
new
main
HPC-Quantize
Commit History
Delete generate_imatrix.py
00ba2db
verified
CompressedGemma
commited on
18 days ago
Upload 5 files
7803d72
verified
CompressedGemma
commited on
18 days ago
Upload 3 files
766f12c
verified
CompressedGemma
commited on
25 days ago
Upload hpc_forward_merged.c
e9294cc
verified
CompressedGemma
commited on
25 days ago
Upload 2 files
414e1de
verified
CompressedGemma
commited on
25 days ago
Upload 2 files
7d55b19
verified
CompressedGemma
commited on
27 days ago
Qwen changes
6bf97ec
verified
CompressedGemma
commited on
27 days ago
Upload hpc_forward_merged.c
1581489
verified
CompressedGemma
commited on
28 days ago
Qwen attention tensors
44e6b86
verified
CompressedGemma
commited on
29 days ago
Update README.md
099fd3c
verified
CompressedGemma
commited on
about 1 month ago
Update README.md
5a67f67
verified
CompressedGemma
commited on
May 7
Fix OOM
c9097e7
verified
CompressedGemma
commited on
May 7
Fix os import
e81a80a
verified
CompressedGemma
commited on
May 7
Auto-load tokenizer for merge rules
0a9e7db
verified
CompressedGemma
commited on
May 7
Heavily experimental
20bea07
verified
CompressedGemma
commited on
May 7
This should do it
a5c5f6c
verified
CompressedGemma
commited on
May 7
Some tensors are transposed lmao
f67ea3a
verified
CompressedGemma
commited on
May 7
Wow Qwen......
965a465
verified
CompressedGemma
commited on
May 7
Qwen......
fca1031
verified
CompressedGemma
commited on
May 7
Qwen patches
8a88d87
verified
CompressedGemma
commited on
May 7
Experimental imatrix
9b8ff1c
verified
CompressedGemma
commited on
May 7
Upload generate_imatrix.py
60295b3
verified
CompressedGemma
commited on
May 7
Update README.md
2a6ab91
verified
CompressedGemma
commited on
May 7
Calibration data
b33a755
verified
CompressedGemma
commited on
May 7
Qwen fix
3883e8d
verified
CompressedGemma
commited on
May 7
Experimental
9303d37
verified
CompressedGemma
commited on
May 7
Update code comments
262cc7b
verified
CompressedGemma
commited on
May 7
Tensor tweak
dc3b370
verified
CompressedGemma
commited on
May 7
Experimental support for other LLMs
5c1c396
verified
CompressedGemma
commited on
May 7
Experimental support for other LLMs
ae8c38d
verified
CompressedGemma
commited on
May 7
Update README.md
dd6d6ba
verified
CompressedGemma
commited on
May 6
Update README.md
96fce02
verified
CompressedGemma
commited on
May 6
It's only calibrated for Gemma, atm.
07b428c
verified
CompressedGemma
commited on
May 6
initial commit
819eddd
verified
CompressedGemma
commited on
May 6