• Shucai Xiao's avatar
    Int8 gemm support (#811) · 39bc6161
    Shucai Xiao authored
    
    
    * add a flag to indicate int8x4 input format
    
    * clang format
    
    * code backup
    
    * clang format
    
    * code backup
    
    * clang format
    
    * code backup
    
    * clang format
    
    * code backup
    
    * clang format
    
    * code backup
    
    * clang format
    
    * remove log info
    
    * remove unnecessary changes
    
    * fix cppcheck error
    
    * add unit tests to have more code coverage
    
    * clang format
    
    * add debug info
    
    * remove log info
    
    * fix cppcheck error
    
    * clang format
    
    * clang format
    
    * add one more unit tests for more scenarios
    
    * fix cppcheck error
    
    * clang format
    
    * fix review comments
    
    * clang format
    
    * rename p to m
    
    * fix review comments
    
    * refine unit tests
    
    * clang format
    
    * refine unit tests and fixed a bug
    
    * clang format
    
    * fix build error related to rocm4.2
    
    * fix a bug related to alpha and beta
    
    * refine two unit tests related to int8_gemm
    
    * fix cppcheck error
    
    * refine unit test to pass on mi100
    
    * add unit test for packing int8 args
    
    * clang format
    
    * change unit tests back
    
    * disable some unit tests for gpu
    
    * clang format
    
    * refine unit tests to run on mi100
    
    * clang format
    
    * refine unit tests
    
    * refine unit tests
    
    * clang format
    
    * change back a unit test
    Co-authored-by: default avatarmvermeulen <5479696+mvermeulen@users.noreply.github.com>
    39bc6161
batch_quant_dot_4.cpp 924 Bytes