• ruanjm's avatar
    [CK_TILE] Add Various Fusion Functions to RMSNorm (#1802) · 04dd3148
    ruanjm authored
    
    
    * Add shortcut to RMSNorm
    
    * Modify test for adding shortcut for RMSNorm
    
    * Add fused parameter into tests
    
    * 1. Add YDataType. 2. rmsnorm2d_fwd_traits_ from rmsnorm2d_fwd.hpp to rmsnorm2d_fwd_api.cpp and rmsnorm2d_fwd_instance_common.hpp
    
    * 1. Supports various stride and percisions.
    
    * Add support of Epilogue
    
    * Add fuse and epilogue support to rmsnorm ref
    
    * Modify rmsnorm example
    
    * Refactor tests/examples
    
    * Bug fix for newly added tests/examples
    
    * Bug fix for new tests 2
    
    * Modify smoke test scripts
    
    remove dbg code
    
    * Supports non-smooth dyanmic quant
    
    * Update Rmsnorm2dFwd::GetName()
    
    * rename xscale and prec_sx to smoothscale and prec_sm
    
    Bug fix after rename
    
    Remove files
    
    * change example_rmsnorm2d_fwd.cpp
    
    * update performance calculator
    
    * Fix issue in two-pass when fuse add is enabled
    
    * Remove comment of beta
    
    ---------
    Co-authored-by: default avatarrocking <ChunYu.Lai@amd.com>
    04dd3148
smoothquant.cpp 8.27 KB