• Rostyslav Geyyer's avatar
    Add a gpu gemm reference kernel (#1528) · aa932445
    Rostyslav Geyyer authored
    
    
    * Add a gpu gemm reference kernel
    
    * Switch to gpu reference in gemm examples
    
    * Remove redundant arguments
    
    * Update all related examples
    
    * Update more examples
    
    * Try less threads per block
    
    * Try even less threads per block
    
    * Add support for all matrix layouts
    
    * Increase block size
    
    * Clean up
    
    * Remove hardcoded strides
    
    * Clean up
    
    * Try a column-major case
    
    * Revert back to row-major
    
    * Run both CPU and GPU veriffication
    
    ---------
    Co-authored-by: default avatarPo Yen Chen <PoYen.Chen@amd.com>
    aa932445
common.hpp 8.6 KB