"docs/git@developer.sourcefind.cn:zhaoyu6/sglang.git" did not exist on "60e37f8028e726016ecaf952767876a3d41a1898"
  1. 19 Oct, 2025 1 commit
    • Tong WU's avatar
      [Refactor][Example] Update linear attention examples and add tests (#1010) · ae9a6f0a
      Tong WU authored
      
      
      * [Refactor][Example] Update linear attention examples and add tests
      
      - Refactored the backward and forward linear attention kernels to use shared memory and atomic additions for improved performance.
      - Introduced L2 normalization in the main functions of both examples.
      - Added a new test suite for the linear attention examples to ensure correctness and performance.
      - Updated argument parsing in the main functions for better usability.
      
      * upd docstring for tma atomic add
      
      * lint
      
      * Add flash-linear-attention dependency to requirements.txt
      
      * Rename main function to chunk_linear_attn_bwd
      
      * Rename main function to chunk_linear_attn_fwd
      
      * chore
      
      ---------
      Co-authored-by: default avatarLeiWang1999 <leiwang1999@outlook.com>
      Co-authored-by: default avatarLei Wang <34334180+LeiWang1999@users.noreply.github.com>
      ae9a6f0a
  2. 09 Jul, 2025 1 commit
  3. 25 Jun, 2025 1 commit
    • Cunxiao Ni's avatar
      [Example] Update examples to use @tilelang.jit (#597) · 3db18726
      Cunxiao Ni authored
      
      
      * [Example] Update kernel compilation in examples to use @tilelang.jit
      
      - Refactored multiple examples to eliminate the use of `tilelang.compile` for kernel creation, directly invoking the functions instead.
      - Added `@tilelang.jit` decorators with appropriate output indices to enhance performance and maintainability.
      - Improved code clarity by simplifying the kernel invocation process across various examples, ensuring consistency in how kernels are defined and executed.
      
      * format
      
      * Update example_tilelang_sparse_gqa_decode_varlen_indice.py
      
      * Update example_dequant_gemm_fine_grained.py
      
      * Update example_gemm_autotune.py
      
      ---------
      Co-authored-by: default avatarLei Wang <34334180+LeiWang1999@users.noreply.github.com>
      3db18726
  4. 04 Jun, 2025 1 commit