"src/git@developer.sourcefind.cn:yangql/composable_kernel.git" did not exist on "ac1f62be3fbdad4f37d98ad0928914fe591a6364"
[CostModel] Introduce cuda driver api to get precise shared memory capacity (#317)
* [Enhancement] Introduce CUDA driver module and refactor CUDA device handling
- Added a new `cuda_driver` module to encapsulate CUDA device properties and functionalities.
- Updated `CUDA` class in `cuda.py` to utilize the new driver for fetching device name and shared memory capabilities.
- Introduced `get_device_name` and `get_shared_memory_per_block` functions in the `cuda_driver` for improved device property management.
- This refactor enhances code organization and maintainability while improving the handling of CUDA device attributes.
* [Refactor] Clean up whitespace in CUDA-related files
- Removed unnecessary blank lines in `cuda.py`, `__init__.py`, and `cuda_driver.py` to improve code readability and maintainability.
- This change enhances the overall organization of the codebase without altering functionality.
---------
Co-authored-by:
LeiWang1999 <wyatuestc@gmail.com>
Showing
Please register or sign in to comment