Commits · ae9a6f0ae1b6475d391b423c229de4f18ff2cc92 · OpenDAS / tilelang

"docs/git@developer.sourcefind.cn:zhaoyu6/sglang.git" did not exist on "60e37f8028e726016ecaf952767876a3d41a1898"

19 Oct, 2025 1 commit

[Refactor][Example] Update linear attention examples and add tests (#1010) · ae9a6f0a

Tong WU authored Oct 19, 2025



* [Refactor][Example] Update linear attention examples and add tests

- Refactored the backward and forward linear attention kernels to use shared memory and atomic additions for improved performance.
- Introduced L2 normalization in the main functions of both examples.
- Added a new test suite for the linear attention examples to ensure correctness and performance.
- Updated argument parsing in the main functions for better usability.

* upd docstring for tma atomic add

* lint

* Add flash-linear-attention dependency to requirements.txt

* Rename main function to chunk_linear_attn_bwd

* Rename main function to chunk_linear_attn_fwd

* chore

---------
Co-authored-by: LeiWang1999 <leiwang1999@outlook.com>
Co-authored-by: Lei Wang <34334180+LeiWang1999@users.noreply.github.com>

ae9a6f0a

09 Jul, 2025 1 commit
- [Dev] Update linear attention examples to enhance performance on Hopper GPUs (#621) · 0fd3a3e8
  Tong WU authored Jul 09, 2025
```
* Tune linear attention examples on H100

* Add retnet fwd kernel

* fix lint
```
  0fd3a3e8
25 Jun, 2025 1 commit

[Example] Update examples to use @tilelang.jit (#597) · 3db18726

Cunxiao Ni authored Jun 25, 2025



* [Example] Update kernel compilation in examples to use @tilelang.jit

- Refactored multiple examples to eliminate the use of `tilelang.compile` for kernel creation, directly invoking the functions instead.
- Added `@tilelang.jit` decorators with appropriate output indices to enhance performance and maintainability.
- Improved code clarity by simplifying the kernel invocation process across various examples, ensuring consistency in how kernels are defined and executed.

* format

* Update example_tilelang_sparse_gqa_decode_varlen_indice.py

* Update example_dequant_gemm_fine_grained.py

* Update example_gemm_autotune.py

---------
Co-authored-by: Lei Wang <34334180+LeiWang1999@users.noreply.github.com>

3db18726

04 Jun, 2025 1 commit
- [CI] Add linear attention examples to CI (#552) · eec07578
  Tong WU authored Jun 04, 2025
```
* Add linear attention examples.

* Add license

* Remove comments

* Run yapf and ruff
```
  eec07578