Commit f1fcfe34 authored by Yu Cheng's avatar Yu Cheng Committed by GitHub
Browse files

Update README.md with new example links for Flash MLA Decoding and Native Sparse Attention (#122)

parent 3cbf8cbc
......@@ -26,6 +26,8 @@ Although tile-lang aims to be portable across a range of Devices, it has been sp
- [Dequantization GEMM](./examples/dequantize_gemm/)
- [Flash Attention](./examples/flash_attention/)
- [Flash Linear Attention](./examples/linear_attention/)
- [Flash MLA Decoding](./examples/flash_decoding/example_mla_decode.py)
- [Native Sparse Attention](./examples/native_sparse_attention/)
Within the `examples` directory, you will also find additional complex kernels—such as convolutions, forward/backward passes for FlashAttention, more operators will continuously be added.
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment