• Adam Osewski's avatar
    Gemm reduce examples int4/int8/fp32/bf16 (#368) · d00e6115
    Adam Osewski authored
    
    
    * GEMM + Reduce max fp16+fp32
    
    * GEmm + Max bf16 + int8
    
    * Refactor common definitions.
    
    * Refactor common func of mean meansquare example.
    
    * More examples for mean meansquare.
    
    * Update int8 examples and skip them cause of random errors.
    
    * Int4 examples.
    
    * Fix examples for max int4/8
    
    * Tensor conversion for int4 input data for mean meansquare example.
    
    * Remove int4 mean_meansquare example
    
    * Fix int8 mean_meansquare example.
    
    -All ReductionAccData and R<N>DataType have to be F32. The INT32 data
    type is giving wrong results.
    
    * Guard int4 with ifdef
    
    * Change int8 example to add_addsquare due to div rounding err.
    
    * Clang format
    
    * Change the return type of common function.
    
    * Get back int8 example with division.
    
    * Remove int8 mean meansquare.
    
    * Use proper cast for BF16 data type.
    
    * Use ck::literals.
    
    * Use proper data type for host tensors & reference.
    
    - Use ReduceAccDataType for reference gemm output data type.
    - Cast host reference output tensor to EDataType
    - Fix ifdefs for int4.
    Co-authored-by: default avatarAdam Osewski <aosewski@amd.com>
    d00e6115
reduction_operator.hpp 9.61 KB