"src/include/threadwise_tensor_slice_copy.hpp" did not exist on "05e046654c9a226444091806a418a77fe0e4a4c2"
  • Lei Wang's avatar
    [Language] Introduce `StridedTensor` to support non contigious torch inputs (#722) · 1b308baf
    Lei Wang authored
    
    
    * Update submodule 'tvm' to commit e11521e6936a827efa334588d29571fbb4620107
    
    * Support strided tensors
    
    * Refactor target attribute helper functions for improved clarity
    
    * No code changes made in proxy.py and setup.py
    
    * lint fix
    
    * lint fix via gemini
    
    * lint fix
    
    * test fix
    
    * test fix
    
    * lint fix
    
    * Update wrapper.py
    
    * test fix
    
    * Enhance test for InjectSoftwarePipeline by adding LowerOpaqueBlock transformation and updating expected function signature to use match_buffer for better clarity.
    
    * lint fix
    
    ---------
    Co-authored-by: default avatarChenggang Zhao <chenggangz@deepseek.com>
    1b308baf
codegen_cuda.cc 76.1 KB