• Lei Wang's avatar
    [CI] Refactor import paths in dequantization examples to use dequantize_utils (#914) · 9d382973
    Lei Wang authored
    * Update requirements and refactor benchmark script for deepseek_nsa example
    
    - Updated the requirements.txt to specify a fixed commit for the flash-linear-attention repository.
    - Refactored import paths in benchmark_nsa_fwd.py for better organization.
    - Added a new function to generate configurations for autotuning.
    - Modified the tilelang_sparse_attention function to accept parameters for block size, number of stages, and threads, enhancing flexibility.
    - Changed allocation of shared memory for accumulators to optimize performance.
    
    * Refactor import paths in dequantization examples to use dequantize_utils
    
    - Updated import statements in multiple dequantization example scripts to replace references to the removed utils.py file with the new dequantize_utils module.
    - Ensured consistency across example scripts for better organization and maintainability.
    9d382973
dequantize_utils.py 6.36 KB