jackD
/

DimVQ

vector-quantization

image-tokenizer

codebook-regularization

Model card Files Files and versions

DimVQ / README.md

jackD's picture

Upload README.md with huggingface_hub

43bdfe8 verified 16 days ago

|

history blame contribute delete

1.82 kB

	---
	license: mit
	tags:
	- vector-quantization
	- image-tokenizer
	- codebook-regularization
	- icml2026
	datasets:
	- imagenet-1k
	---

	# DimVQ: Unveiling And Addressing Dimensional Collapse In Vector Quantization Models Via Codebook Regularization

	Official pre-trained checkpoints for the ICML 2026 paper.

	## Model Description

	DimVQ identifies dimensional collapse in vector quantization models and proposes a simple codebook regularization to restore suppressed low-variance components. This regularization bridges the spectral gap between discrete codebook spaces and continuous representations.

	## Available Checkpoints

	\| File \| Model \| Resolution \| Codebook Size (K) \| Embedding Dim (D) \|
	\|------\|-------\|-----------\|-------------------\|-------------------\|
	\| `simvq_K65536/65536.ckpt` \| SimVQ + Codebook Reg. \| 128x128 \| 65,536 \| 128 \|
	\| `simvq_K65536/65536.yaml` \| Config for above \| - \| - \| - \|
	\| `simvq_K262144/262144.ckpt` \| SimVQ + Codebook Reg. \| 128x128 \| 262,144 \| 128 \|
	\| `simvq_K262144/262144.yaml` \| Config for above \| - \| - \| - \|

	## Usage

	```python
	# Load checkpoint
	import torch
	checkpoint = torch.load("262144.ckpt", map_location="cpu")
	model.load_state_dict(checkpoint["state_dict"])
	```

	## TODO

	- [ ] IBQ checkpoints (K=16384, K=262144, 256x256)
	- [ ] Downstream autoregressive generation models (IBQ-B, IBQ-L, IBQ-XXL)

	## Citation

	```bibtex
	@inproceedings{zhang2026dimvq,
	title={Unveiling And Addressing Dimensional Collapse In Vector Quantization Models Via Codebook Regularization},
	author={Zhang, Fang and Zhu, Yongxin and Liu, Yihao and Fu, Bin and Xu, Linli},
	booktitle={International Conference on Machine Learning (ICML)},
	year={2026}
	}
	```

	## Links

	- [Paper (arXiv)](https://arxiv.org/abs/TODO)
	- [Code (GitHub)](https://github.com/ksblk2116/dimvq)