AbstractFramework
/

wan2.2-t2v-a14b-diffusers-bf16

video-generation

Model card Files Files and versions

wan2.2-t2v-a14b-diffusers-bf16 / README.md

lpalbou's picture

Add files using upload-large-folder tool

2061f00 verified 1 day ago

|

history blame contribute delete

3.08 kB

	---
	license: apache-2.0
	base_model: Wan-AI/Wan2.2-T2V-A14B-Diffusers
	pipeline_tag: text-to-video
	library_name: mlx-gen
	tags:
	- mlx
	- mlx-gen
	- mflux
	- apple-silicon
	- bf16
	- wan
	- wan2.2
	- video-generation
	- text-to-video
	- wan-a14b
	---
	# wan2.2-t2v-a14b-diffusers-bf16

	This repository contains BF16 MLX-Gen saved weights for
	[`Wan-AI/Wan2.2-T2V-A14B-Diffusers`](https://huggingface.co/Wan-AI/Wan2.2-T2V-A14B-Diffusers).
	It is designed for local Apple Silicon inference with
	[`mlx-gen`](https://github.com/lpalbou/mlx-gen).

	It uses the mflux/MLX saved-weight layout. It is not a Diffusers or Transformers
	`from_pretrained()` checkpoint.

	## Source Model

	Original model: [`Wan-AI/Wan2.2-T2V-A14B-Diffusers`](https://huggingface.co/Wan-AI/Wan2.2-T2V-A14B-Diffusers).

	This prepared derivative follows the Apache 2.0 license of the source model.

	## Precision

	This package stores the Wan A14B T2V transformer and VAE weights for MLX-Gen BF16 runtime use. The UMT5 text encoder, scheduler metadata, tokenizer files, and model index are included in the prepared folder.

	## Validation

	Measured on 2026-06-04 with `mlx-gen 0.18.9` on Apple Silicon. The upstream Diffusers source snapshot measured about 118 GiB in the local Hugging Face cache before preparing these packages. The table below reports prepared-package generation from model init through MP4 save and post-save video-health validation.

	Validation profile: `384x224`, 33 frames, 12 denoising steps, guidance `4`, guidance-2 `3`, 8 fps, seed `4242`, `--low-ram`.

	\| Package \| Disk \| Full-Process Physical Peak \| Max RSS \| MLX Peak \| Total Time \| Video Health \|
	\|---\|---:\|---:\|---:\|---:\|---:\|---\|
	\| This BF16 package \| 64.3 GiB \| 33.0 GiB \| 31.8 GiB \| 27.7 GiB \| 152.7 s \| 33/33 frames, 384x224, 8 fps, temporal delta 1.3 \|
	\| Mixed q8/BF16 package \| 39.7 GiB \| 20.7 GiB \| 19.5 GiB \| 15.5 GiB \| 154.8 s \| 33/33 frames, 384x224, 8 fps, temporal delta 1.4 \|

	Physical peak is Darwin `ri_phys_footprint` sampled for the full process. The validation is intentionally small and repeatable; it is not a claim that every full-size `1280x720`, 81-frame, 40-step job has the same memory or timing profile.

	## Usage

	```bash
	python -m pip install -U mlx-gen

	mlxgen download --model AbstractFramework/wan2.2-t2v-a14b-diffusers-bf16

	mlxgen generate \
	--model AbstractFramework/wan2.2-t2v-a14b-diffusers-bf16 \
	--task text-to-video \
	--prompt "A cinematic scene of a scientist working on agentic AI through the night, monitors glowing, papers shifting in a slow dolly shot." \
	--width 384 \
	--height 224 \
	--frames 33 \
	--steps 12 \
	--guidance 4 \
	--guidance-2 3 \
	--fps 8 \
	--seed 4242 \
	--low-ram \
	--metadata \
	--output video.mp4
	```

	## Compatibility

	Requires `mlx-gen >= 0.18.9`.

	Generated with `mlx-gen 0.18.9`.

	Use the `mlxgen` command and Python import path for new MLX-Gen projects.

	## Attribution

	MLX-Gen is based on [mflux](https://github.com/filipstrand/mflux) by Filip Strand and the original mflux contributors.

	Prepared and contributed by [@lpalbou](https://huggingface.co/lpalbou).