"docs/source/zh-Hans/concepts/colossalai_overview.md" did not exist on "85b2303b5506f9cef57bed571eedb186015a4b8c"
  • littsk's avatar
    [hotfix] Add layer norm gradients all-reduce for sequence parallel (#4926) · 1a3315e3
    littsk authored
    
    
    * [hotfix] Add layer norm gradients all-reduce for sequence parallel. (#4915)
    
    * Add layer norm gradients all-reduce for sequence parallel.
    
    * skip pipeline inference test
    
    * [hotfix] fixing polices of sequence parallel (#4922)
    
    * Add layer norm gradients all-reduce for sequence parallel.
    
    * fix parameter passing when calling get_autopolicy
    
    ---------
    Co-authored-by: default avatarlittsk <1214689160@qq.com>
    
    * Hotfix/add grad all reduce for sequence parallel (#4927)
    
    * Add layer norm gradients all-reduce for sequence parallel.
    
    
    * fix parameter passing when calling get_autopolicy
    
    * fix bug using wrong variables
    
    ---------
    Co-authored-by: default avatarlittsk <1214689160@qq.com>
    
    * fix policy initialization
    
    * fix bloom and chatglm policices
    
    * polish code of handling layernorm
    
    * fix moe module
    
    * polish code of class initializing
    
    ---------
    Co-authored-by: default avatarZhongkai Zhao <kanezz620@gmail.com>
    1a3315e3
test_pipeline_infer.py 2.98 KB