Kernels
flash-mla / README.md
kernels-bot's picture
Uploaded using `kernel-builder`.
88fe4d9 verified
---
library_name: kernels
license: mit
---
This is the repository card of kernels-community/flash-mla that has been pushed on the Hub. It was built to be used with the [`kernels` library](https://github.com/huggingface/kernels). This card was automatically generated.
## How to use
```python
# make sure `kernels` is installed: `pip install -U kernels`
from kernels import get_kernel
kernel_module = get_kernel("kernels-community/flash-mla")
__version__ = kernel_module.__version__
__version__(...)
```
## Available functions
- `__version__`
- `FlashMLASchedMeta`
- `get_mla_metadata`
- `flash_mla_with_kvcache`
- `flash_attn_varlen_func`
- `flash_attn_varlen_qkvpacked_func`
- `flash_attn_varlen_kvpacked_func`
- `flash_mla_sparse_fwd`
## Benchmarks
Benchmarking script is available for this kernel. Run `kernels benchmark kernels-community/flash-mla`.