• Paul Fultz II's avatar
    Fuse skip layernorm (#683) · 1bfb147d
    Paul Fultz II authored
    
    
    * Unify the vectorized and non-vectorized path
    
    * Formatting
    
    * Make fusion easily extendable
    
    * Add skip layernorm fusion
    
    * Formatting
    
    * Call correct layernorm function
    
    * Fix compile errors
    
    * Add DCE
    
    * Add test for skip layernorm
    
    * Formatting
    
    * Remove unused typedef
    
    * Formatting
    
    * Fix tidy issues
    
    * Formatting
    Co-authored-by: default avatarShucai Xiao <shucai.xiao@amd.com>
    1bfb147d
test_layernorm.cpp 3.31 KB