1. 27 Oct, 2025 9 commits
  2. 25 Oct, 2025 1 commit
  3. 24 Oct, 2025 1 commit
  4. 23 Oct, 2025 4 commits
    • Wenhao Xie's avatar
      [Feature] Support None type as input for `T.ptr` and `T.Tensor` (#1114) · 50e789dd
      Wenhao Xie authored
      * [Feature] Support None type as input for T.ptr and T.Tensor
      
      * lint
      
      * lint
      
      * lint
      
      * lint fix
      50e789dd
    • Tong WU's avatar
      [Feature] Enhance vectorized conversion support in CUDA codegen (#1095) · a148d62a
      Tong WU authored
      * [Feature] Add vectorized float16 and float32 conversion support in CUDA codegen
      
      * Implemented handling for conversions between float16 and float32 types, specifically for vectorized operations using __half22float2 and __float22half2_rn.
      * Enhanced the existing code to support both directions of conversion based on the lane count.
      * Improved overall type handling in the VisitExpr_ method for better compatibility with TileLang.
      
      * [Feature] Add float32 to float8 conversion support in CUDA codegen
      
      * Implemented handling for conversion from float32 to float8 (E4M3/E5M2) in the VisitExpr_ method.
      * Added vectorized conversion support using __nv_cvt_float2_to_fp8x2 for float2 to fp8x2 transformations.
      * Enhanced type handling for better compatibility with TileLang, particularly for float8 types.
      
      * lint
      
      * fix a bug
      
      * [Enhancement] Support lanes=4 cases and add unit test for vectorized cast
      
      * lint
      
      * [Feature] Refactor bf16 convertion operations and remove legacy compile flags
      
      * lint
      a148d62a
    • Lei Wang's avatar
      [Refactor] Improve scalar handling in CopyNode and update loop partition dtype logi (#1111) · 86c8bb46
      Lei Wang authored
      * [Refactor] Improve scalar handling in CopyNode and update loop partition dtype logic
      
      * Refactored CopyNode::MakeSIMTLoop to handle scalar cases more efficiently by moving the scalar check to the end of the function.
      * Updated loop_partition.cc to set a default DataType for thread and vector extents, ensuring compatibility when loop_vars_ is empty.
      
      * lint fix
      
      * remove debug print
      86c8bb46
    • Yichen Yan's avatar
      [Lint] Enable pyupgrade linter in ruff (#963) · f14fb111
      Yichen Yan authored
      * update rules
      
      * ruff check
      
      * other fixes
      
      * fmt
      
      * do not touch examples
      
      * fmt
      f14fb111
  5. 22 Oct, 2025 7 commits
  6. 21 Oct, 2025 9 commits
  7. 20 Oct, 2025 9 commits