"vscode:/vscode.git/clone" did not exist on "46cea4f632292f791dac6748470d3ab1ecea5e73"
  • Bartlomiej Wroblewski's avatar
    Redesign the DPP8 GEMM kernel to use warp-wise component (#863) · 37a8c1f7
    Bartlomiej Wroblewski authored
    * Redesign the DPP8 GEMM kernel to use warp-wise component
    
    * Review: Improve error messages
    
    * Review: Remove unnecessary empty lines
    
    * Review: Fix M, N per thread names
    
    * Review: Rename mfma_input_type to dpp_input_type
    
    * Review: Fix tensor adaptor; remove unnecessary element
    
    * Review: Remove calls to dpp_gemm's MakeCDescriptor
    
    * Review: Add blockwise doc, change function names to include dimension names
    
    * Review: Remove duplicated code; Move Block2CtileMap alias to the top of the file
    
    * Review: Add __restrict__ keywords
    
    * Review: Use MatrixPadder for padding A, B, C matrices
    
    * Review: Remove hardcoded datatypes
    
    * Review: Change names from FloatX to XDataType
    
    * Review: Introduce AK0 and BK0 instead of a single K0
    
    * Review: Remove construction of dpp_datatypes object
    
    * Review: Rename DppInstrRunner to DppLanegroupGemm
    37a8c1f7
gemm_dpp_fp16.cpp 3.19 KB