kernels-community
/

flash-mla

Model card Files Files and versions

flash-mla / README.md

kernels-bot's picture

Uploaded using `kernel-builder`.

88fe4d9 verified 27 days ago

|

history blame contribute delete

862 Bytes

	---
	library_name: kernels
	license: mit
	---

	This is the repository card of kernels-community/flash-mla that has been pushed on the Hub. It was built to be used with the [`kernels` library](https://github.com/huggingface/kernels). This card was automatically generated.

	## How to use

	```python
	# make sure `kernels` is installed: `pip install -U kernels`
	from kernels import get_kernel

	kernel_module = get_kernel("kernels-community/flash-mla")
	__version__ = kernel_module.__version__

	__version__(...)
	```

	## Available functions
	- `__version__`
	- `FlashMLASchedMeta`
	- `get_mla_metadata`
	- `flash_mla_with_kvcache`
	- `flash_attn_varlen_func`
	- `flash_attn_varlen_qkvpacked_func`
	- `flash_attn_varlen_kvpacked_func`
	- `flash_mla_sparse_fwd`

	## Benchmarks

	Benchmarking script is available for this kernel. Run `kernels benchmark kernels-community/flash-mla`.