1. 27 Mar, 2024 1 commit
    • liangjing's avatar
      v1 · 051f58f1
      liangjing authored
      051f58f1
  2. 02 Apr, 2023 1 commit
  3. 24 Sep, 2022 1 commit
  4. 23 Sep, 2022 1 commit
    • Jared Casper's avatar
      Clean up licensing. · fabad461
      Jared Casper authored
      The LICENSE file says everything is 3-clause BSD, which is what we want,
      but at some point the Apache license was added to the top of some files
      and that proliferated. This commit removes the Apache license from any
      files that we own the copyright to.
      
      Also updates the copyright year and removes the unnessary coding=utf-8
      line.
      fabad461
  5. 21 Jul, 2022 1 commit
  6. 18 May, 2022 1 commit
  7. 28 Apr, 2022 1 commit
  8. 30 Jul, 2021 1 commit
    • Deepak Narayanan's avatar
      Support for pipeline parallelism in T5 model · 46c74b4c
      Deepak Narayanan authored
      - Accumulate encoder hidden state gradient to handle skip connection
      - Correctly compute the number of layers in encoder / decoder for T5 model
      - Ensure e weights are initialized the same way in embeddings
      - Synchronize embedding gradients across encoder and decoder for T5 model
      - Support for checkpoint loading and saving
      46c74b4c
  9. 16 Apr, 2021 1 commit