• FoolPlayer's avatar
    [shardformer]: Feature/shardformer, add some docstring and readme (#3816) · 8cc11235
    FoolPlayer authored
    * init shardformer code structure
    
    * add implement of sharder (inject and replace)
    
    * add implement of replace layer to colossal layer
    
    * separate different layer policy, add some notion
    
    * implement 1d and 2d slicer, can tell col or row
    
    * fix bug when slicing and inject model
    
    * fix some bug; add inference test example
    
    * add share weight and train example
    
    * add train
    
    * add docstring and readme
    
    * add docstring for other files
    
    * pre-commit
    8cc11235
shardmodel.py 1.88 KB