• Lei Wang's avatar
    [CostModel] Introduce cuda driver api to get precise shared memory capacity (#317) · 3b660b67
    Lei Wang authored
    
    
    * [Enhancement] Introduce CUDA driver module and refactor CUDA device handling
    
    - Added a new `cuda_driver` module to encapsulate CUDA device properties and functionalities.
    - Updated `CUDA` class in `cuda.py` to utilize the new driver for fetching device name and shared memory capabilities.
    - Introduced `get_device_name` and `get_shared_memory_per_block` functions in the `cuda_driver` for improved device property management.
    - This refactor enhances code organization and maintainability while improving the handling of CUDA device attributes.
    
    * [Refactor] Clean up whitespace in CUDA-related files
    
    - Removed unnecessary blank lines in `cuda.py`, `__init__.py`, and `cuda_driver.py` to improve code readability and maintainability.
    - This change enhances the overall organization of the codebase without altering functionality.
    
    ---------
    Co-authored-by: default avatarLeiWang1999 <wyatuestc@gmail.com>
    3b660b67
__init__.py 250 Bytes