"...resnet50_tensorflow.git" did not exist on "a5a5102b28815a55983d2d346bcd0a836d4ba38b"
  • Lei Wang's avatar
    [Refactor] Separate tilelang Pass Thread Sync (with Hopper support) from tvm (#85) · ec84188f
    Lei Wang authored
    * bump version into v0.1.0
    
    * [Enhancement] Add custom develop command for editable installs and update .gitignore
    
    * [Documentation] Update README to include system dependencies installation instructions
    
    * [Build] Update setup.py to support library file copying for both release and develop modes
    
    * [Build] Refactor library file copying logic in setup.py
    
    * [Documentation] Remove unnecessary install section header in Installation.md
    
    * [Build] Add tox configuration and local distribution script for multi-Python version support
    
    * [Build] Improve git submodule update function with better error handling
    
    * [Build] Update LLVM configuration path in ROCm installation script
    
    * [Build] Add .tox/ to .gitignore for tox testing environment
    
    * [Build] Add support for TVM prebuild path configuration in CMakeLists.txt
    
    * [Cleanup] Remove unused TVM runtime error codes header
    
    * [Cleanup] Fix TVM grid constant type reference in CUDA module
    
    * [Cleanup] Remove unused customized_code function from IR module
    
    * [Feature] Add TileLang thread synchronization and storage access analysis passes
    
    * [Build] Reorder DLL search path directories for more flexible library loading
    
    * [Refactor] Improve thread synchronization and library path handling
    
    - Rename ThreadSync and TileLangThreadSync functions in C++ code
    - Update Python docstring for ThreadSync with more detailed description
    - Reorder library path detection in tilelang environment setup
    - Minor comment and code cleanup in CUDA and warp specialization modules
    
    * [Refactor] Improve thread synchronization code style and formatting
    
    - Standardize pointer type spacing in storage_access.h and storage_access.cc
    - Update whitespace and indentation in thread_storage_sync.cc
    - Reorder include statements in thread_partial_sync.cc
    - Minor code formatting improvements across thread synchronization files
    
    * [Refactor] Fix global function registration for ThreadSync
    
    - Correct global function registration to use ThreadSync instead of TileLangThreadSync
    - Update TVM global registration to match recent refactoring efforts
    
    * [Refactor] Simplify ThreadSync global function registration
    
    - Remove unnecessary whitespace in global function registration
    - Compact the TVM global registration line for ThreadSync
    ec84188f
thread_partial_sync.cc 11.7 KB