• rocking5566's avatar
    gemm + layernorm (#261) · d32a67a9
    rocking5566 authored
    * Implement reduction meand and reduction square mean
    
    * Refine file name
    
    * Add reduce mean and square mean
    
    * Fix parameter name
    
    * Add normalize device op (not implement invoker::run())
    
    * Remove epislon
    
    * Refine deviceop
    
    * Add 5ary elementwise for normalization
    
    * Add layernorm example
    
    * layerNorm verication
    
    * Fix compiler error due to merge from develop
    
    * Fix typo
    
    * Fix compile error
    
    * Refine naming
    
    * [What] Suport non pointer for invoker and argument
    [Why] Snyc coding style with gemm
    
    * Refine folder name
    
    * Refine class name
    
    * Evaluate perf of the kernel
    
    * Fix compile error
    
    * [What] Refine perf evaluation in example of gemm + reduction
    [Why] evaluation of gemm + reduction may cause verification fail. Because evaluation will not initial global memory
    
    * clang-format
    d32a67a9
CMakeLists.txt 196 Bytes