1. 22 Feb, 2025 1 commit
    • Lei Wang's avatar
      [Wheel] Provide a bare docker scripts to help build wheels for manylinux (#105) · b4bd2a56
      Lei Wang authored
      * [Build] Improve build configuration and package distribution support
      
      - Add `build` to requirements-build.txt for package building
      - Update MANIFEST.in to include Cython wrapper source file
      - Enhance setup.py to improve Cython file copying logic
      - Update build scripts to support multi-Python version distribution
      
      * [Build] Improve Cython file handling in setup.py and MANIFEST.in
      
      - Remove Cython wrapper from MANIFEST.in
      - Enhance setup.py to create target directory if it doesn't exist when copying Cython files
      - Improve file copying logic for Cython source files during build process
      
      * [Build] Remove Cython file copying logic in setup.py
      
      - Comment out Cython file copying code in TileLangBuilPydCommand
      - Simplify setup.py build process by removing redundant Cython file handling
      
      * [Build] Enhance Docker distribution scripts for multi-Python version support
      
      - Refactor local and PyPI distribution Docker scripts
      - Replace hardcoded Python installation with Miniconda-based multi-version Python environment
      - Improve Docker image setup with dynamic Python version creation
      - Simplify build process by using Miniconda for Python environment management
      
      * [Build] Separate lint requirements into a dedicated file
      
      - Create new requirements-lint.txt for formatting and linting tools
      - Update format.sh to use requirements-lint.txt instead of requirements-dev.txt
      - Update requirements-dev.txt and requirements-test.txt to reference requirements-lint.txt
      - Improve dependency management by isolating lint-specific requirements
      
      * [Build] Restore Cython file copying logic in setup.py
      
      - Re-add Cython file copying mechanism in TileLangBuilPydCommand
      - Implement robust file search across multiple potential directories
      - Add warning for cases where Cython source files cannot be found
      - Improve build process reliability for Cython source files
      
      * [Build] Refactor Cython file copying logic in setup.py
      
      - Simplify Cython file copying mechanism in TileLangBuilPydCommand
      - Improve directory creation and file copying for Cython source files
      - Relocate potential directories list to a more logical position
      - Enhance robustness of file and directory handling during build process
      
      * [Build] Refine Cython file copying logic in setup.py
      
      - Improve file existence check when copying Cython source files
      - Use os.path.join to construct full path for more robust file checking
      - Enhance file copying mechanism in TileLangBuilPydCommand
      b4bd2a56
  2. 13 Feb, 2025 1 commit
    • Lei Wang's avatar
      [WHL] Support whl building for different python versions via tox (#83) · f55defac
      Lei Wang authored
      * bump version into v0.1.0
      
      * [Enhancement] Add custom develop command for editable installs and update .gitignore
      
      * [Documentation] Update README to include system dependencies installation instructions
      
      * [Build] Update setup.py to support library file copying for both release and develop modes
      
      * [Build] Refactor library file copying logic in setup.py
      
      * [Documentation] Remove unnecessary install section header in Installation.md
      
      * [Build] Add tox configuration and local distribution script for multi-Python version support
      
      * [Build] Improve git submodule update function with better error handling
      f55defac
  3. 10 Feb, 2025 1 commit
    • Lei Wang's avatar
      [Dev] Remove unnecessary python dependencies (#69) · 2411fa28
      Lei Wang authored
      * [Enhancement] Add VectorizeLoop function and update imports for compatibility
      
      * [CI][Test] Improve test cases for vectorization and fix typos in parser comments
      
      * lint fix
      
      * Fix incorrect module reference for VectorizeLoop transformation
      
      * Refactor vectorize_loop transformation by removing unused extent mutation logic
      
      * [Enhancement] Add support for FP8 data types and global barriers in CUDA codegen
      
      * Fix formatting in CUDA FP8 header file for consistency
      
      * Refactor CI workflow to use 'tilelang_ci' virtual environment and update CUDA type printing for better clarity
      
      * Update submodule 'tvm' to latest commit for improved functionality
      
      * Refactor execution backend references from 'dl_pack' to 'dlpack' for consistency and clarity; add apply_simplify function to simplify PrimFunc or IRModule.
      
      * Refactor CUDA code for improved readability; clean up formatting and remove unnecessary whitespace in multiple files.
      
      * Refactor import statement in test_tilelang_kernel_dequantize_gemm.py to use 'tilelang.language' for consistency
      
      * Add CUDA requirements to FP8 test cases and update references for clarity
      
      * Add a blank line for improved readability in test_tilelang_kernel_fp8_gemm_mma.py
      
      * Fix data type in reference result calculation for consistency in test_tilelang_kernel_gemm_mma_intrinsic.py
      
      * Add CUDA requirements and FP8 test cases for matmul and gemv simulations
      
      * Remove debug print statements and use tilelang's testing assertion for result validation in test_tilelang_kernel_gemm_mma_intrinsic.py
      
      * Remove outdated comment regarding FP8 tests in test_tilelang_kernel_gemv_simt.py
      
      * Add BF16 support to matrix multiplication and introduce corresponding test cases
      
      * Add a blank line for improved readability in BF16 GEMM test
      
      * Update acknowledgements in README to include supervision by Zhi Yang at Peking University
      
      * enhance acknowledgement
      
      * Replace tutorial on memory layout optimization with new tutorial on writing high-performance kernels with thread primitives
      
      * Update subproject commit for TVM dependency
      
      * Update subproject commit for TVM dependency
      
      * Add int4_t type and functions for packing char values in CUDA common header
      
      * Add plot_layout example and implement GetForwardVars method in layout classes
      
      * Refactor code for improved readability by adjusting line breaks and formatting in layout and test files
      
      * Fix formatting by removing unnecessary line break in layout.h
      
      * Refactor make_int4 function for improved readability by adjusting parameter formatting
      
      * Add legend to plot_layout for improved clarity of thread and local IDs
      
      * Remove unnecessary dependencies from requirements files for cleaner setup
      
      * Remove flash_mha.py and add .gitkeep to deepseek_mla directory
      
      * Add build requirements and update installation scripts for improved setup
      2411fa28