• Po Yen Chen's avatar
    Modularize ckProfiler operations (#514) · 8784a72e
    Po Yen Chen authored
    
    
    * Re-structure ckProfiler source files
    
    * Rename profiler.cpp to main.cpp
    
    * Modularize ckProfiler operations
    
    * Add description for profiler operations
    
    * Use longer name to avoid name collision
    
    * Use macro to delay expansion
    
    * Use std::move() to avoid object copying
    
    * Prohibit users from calling dtor
    
    * Use macro to eliminate redundant code
    
    * Make friend function hidden
    
    * Add missing include directive <iostream>
    
    * Fix wrong include directives
    
    * Remove int8 from batchnorm-forward instances since it is not needed for forward training and could fail test
    Co-authored-by: default avatarQianfeng Zhang <Qianfeng.Zhang@amd.com>
    8784a72e
grouped_gemm_fp16.cpp 2.04 KB