• Xuechen Li's avatar
    support when num_heads is not divisible by world_size; resolves #459 (#461) · bb4cded1
    Xuechen Li authored
    * uneql rank.
    
    * trim.
    
    * enable passing in number of heads for each rank.
    
    * simplify.
    
    * simplify.
    
    * cleanup.
    
    * fix col parallel.
    
    * fix bug with row parallel.
    
    * fit out proj.
    
    * refac.
    
    * fix sharding logic.
    
    * refac sharding.
    
    * refac.
    
    * support multiple of.
    
    * make fn reuseable.
    
    * fix bug in dimensions.
    
    * scaffold.
    
    * test uneven heads.
    
    * fix test by adding barrier.
    
    * refac.
    
    * reuse code.
    
    * clean up.
    bb4cded1
distributed.py 5.86 KB