• rocking5566's avatar
    Gemm_c_shuffle (4 layouts) X (fp32 bf16 int8) (#131) · 485ea46a
    rocking5566 authored
    
    
    * [What] Separate fixpoint gemm from gemm example
    [Why] let example of gemm_int8 be pure gemm.
    [What]
    1. Add gemm_requant_relu_requant,
    2. Let CDataType be int32 in pure gemm, because no one use int8 CDataType. It is also part of gemm_requant_relu_requant
    
    * Fix path
    
    * Revise cmakelist due to merge develop
    
    * Add gemm fp16 test
    
    * Extract PrepareGemmTensor
    
    * Extract TestGemm
    
    * Add test for different layout
    
    * Add 4 layouts of shuffle version of fp32
    
    * Add 4 layouts of shuffle version of int8
    
    * Add 4 layouts of shuffle version of bf16
    
    * replace all DeviceGemmPtr_ with DeviceGemmNoOpPtr to fit naming convension
    
    * Add test for non-shuffle verstion of gemm
    
    * Fix typo
    
    * Print kernel information
    
    * Add rest of the fp32 kernel to the test
    
    * 1. Add rest of the fp16 device iop.
    2. Mark the invalid device operation
    Co-authored-by: default avatarrocking <chunylai@amd.com>
    485ea46a
test_util.hpp 3.85 KB