"src/include/threadwise_direct_convolution.hpp" did not exist on "b7d052459d1f67cd3c1fdcb331027da18a479e63"
Commit f1fcfe34 authored by Yu Cheng's avatar Yu Cheng Committed by GitHub
Browse files

Update README.md with new example links for Flash MLA Decoding and Native Sparse Attention (#122)

parent 3cbf8cbc
...@@ -26,6 +26,8 @@ Although tile-lang aims to be portable across a range of Devices, it has been sp ...@@ -26,6 +26,8 @@ Although tile-lang aims to be portable across a range of Devices, it has been sp
- [Dequantization GEMM](./examples/dequantize_gemm/) - [Dequantization GEMM](./examples/dequantize_gemm/)
- [Flash Attention](./examples/flash_attention/) - [Flash Attention](./examples/flash_attention/)
- [Flash Linear Attention](./examples/linear_attention/) - [Flash Linear Attention](./examples/linear_attention/)
- [Flash MLA Decoding](./examples/flash_decoding/example_mla_decode.py)
- [Native Sparse Attention](./examples/native_sparse_attention/)
Within the `examples` directory, you will also find additional complex kernels—such as convolutions, forward/backward passes for FlashAttention, more operators will continuously be added. Within the `examples` directory, you will also find additional complex kernels—such as convolutions, forward/backward passes for FlashAttention, more operators will continuously be added.
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment