1. 18 Aug, 2023 1 commit
    • Xuechen Li's avatar
      support when num_heads is not divisible by world_size; resolves #459 (#461) · bb4cded1
      Xuechen Li authored
      * uneql rank.
      
      * trim.
      
      * enable passing in number of heads for each rank.
      
      * simplify.
      
      * simplify.
      
      * cleanup.
      
      * fix col parallel.
      
      * fix bug with row parallel.
      
      * fit out proj.
      
      * refac.
      
      * fix sharding logic.
      
      * refac sharding.
      
      * refac.
      
      * support multiple of.
      
      * make fn reuseable.
      
      * fix bug in dimensions.
      
      * scaffold.
      
      * test uneven heads.
      
      * fix test by adding barrier.
      
      * refac.
      
      * reuse code.
      
      * clean up.
      bb4cded1
  2. 07 Jan, 2023 1 commit
  3. 27 Dec, 2022 2 commits
  4. 24 Dec, 2022 1 commit