[Refactor] Improve documentation and add detailed docstrings across multiple modules (#298)
* [Enhancement] Update AtomicAdd functions for BFLOAT16 in common.h - Added conditional compilation for BFLOAT16 atomic operations to ensure compatibility with CUDA architectures greater than 7.5. - Improved code clarity by organizing the AtomicAdd functions and adding relevant comments for better understanding. * [Enhancement] Improve documentation and add detailed docstrings across multiple modules - Updated the `__init__.py` file to enhance module documentation, providing clarity on auto-tuning functionalities. - Added comprehensive docstrings to the `JITContext`, `AutotuneResult`, and `AutoTuner` classes, detailing their attributes and methods. - Enhanced memory allocation utilities in `allocate.py` with detailed descriptions for each allocation function. - Improved documentation for various intrinsic operations in `builtin.py`, `copy.py`, `customize.py`, `frame.py`, `gemm.py`, `memscope.py`, and `reduce.py`, ensuring clear explanations of parameters and return values. - Refactored the `KernelCache` class to improve clarity and maintainability, including detailed comments and docstrings for methods. - Overall, these changes aim to enhance code readability and provide better guidance for future developers and users of the Tile-AI framework.
Showing
Please register or sign in to comment