1. 20 Apr, 2022 1 commit
  2. 19 Apr, 2022 1 commit
  3. 14 Apr, 2022 1 commit
  4. 13 Apr, 2022 1 commit
  5. 08 Apr, 2022 3 commits
  6. 07 Apr, 2022 2 commits
    • Masaki Kozuki's avatar
      Deprecation warning: `pyprof` & `reparameterization` (#1348) · 727a6452
      Masaki Kozuki authored
      * add warning to pyprof
      
      * add warning to reparameterization
      
      note: this module is already not import-able as follows:
      
      ```
      (base) root@c4bb3f161482:/vscode/apex# python -c 'import torch; import
      apex; from apex import reparameterization'
      /vscode/apex/apex/pyprof/__init__.py:5: FutureWarning: pyprof will be
      removed by the end of June, 2022
        warnings.warn("pyprof will be removed by the end of June, 2022",
      FutureWarning)
      /vscode/apex/apex/reparameterization/__init__.py:2: FutureWarning:
      reparameterization will be removed by the end of June, 2022
        warnings.warn("reparameterization will be removed by the end of June,
      2022", FutureWarning)
      Traceback (most recent call last):
        File "<string>", line 1, in <module>
        File "/vscode/apex/apex/reparameterization/__init__.py", line 4, in
      <module>
          from .weight_norm import WeightNorm
        File "/vscode/apex/apex/reparameterization/weight_norm.py", line 3, in
      <module>
          from ..fp16_utils import Fused_Weight_Norm
      ImportError: cannot import name 'Fused_Weight_Norm' from
      'apex.fp16_utils' (/vscode/apex/apex/fp16_utils/__init__.py)
      ```
      727a6452
    • Masaki Kozuki's avatar
      [transformer] add microbatches test (#1349) · 7d903878
      Masaki Kozuki authored
      * add test
      
      * destroy model parallel was missing
      7d903878
  7. 05 Apr, 2022 2 commits
  8. 03 Apr, 2022 1 commit
  9. 02 Apr, 2022 4 commits
  10. 01 Apr, 2022 3 commits
  11. 31 Mar, 2022 3 commits
  12. 30 Mar, 2022 2 commits
  13. 29 Mar, 2022 2 commits
  14. 28 Mar, 2022 1 commit
  15. 25 Mar, 2022 7 commits
    • yjk21's avatar
      update fmha (#1344) · 3c88451a
      yjk21 authored
      3c88451a
    • Thor Johnsen's avatar
      Forgot · cd8db094
      Thor Johnsen authored
      cd8db094
    • Thor Johnsen's avatar
      Optional inplace halo exchange · b41c68b3
      Thor Johnsen authored
      b41c68b3
    • Masaki Kozuki's avatar
      [transformer] Format & Test Refactoring (#1325) · a0ed4151
      Masaki Kozuki authored
      * try PyTorch custom TestCase class
      
      * revert
      
      * initial working example
      
      * update
      
      * data utils
      
      * fix imports
      
      * hardcode backend to nccl
      
      * fix signature
      
      * fix typo
      
      * mapping
      
      * set device
      
      * init
      
      * refactor x entropy
      
      * remove unused import & destroy model parallel
      
      * refactor random
      
      * fix test
      
      * remove migrated tests
      
      * refactor
      
      * init
      
      * separate affine weight init
      
      * init model parallel
      
      * split more
      
      * weight init fix part 1
      
      * use cpu init for consistency btwn native and tensor parallel
      
      * black
      
      * add col parallel
      
      * use a 3D tensor of square matrix for column parallel linear
      
      * skip the failing cases
      
      * migrate layers test
      
      * pipeline parallel forward/backward
      
      * fix typo
      
      * fix typo
      
      * fix
      
      * fix pipeline world size
      
      * black
      
      * rm `run_pipeline_parallel_test` in favor of test_pipeline_parallel_fwd_bwd.py
      
      * stop logging
      
      * set log level
      
      * black
      
      * license and format
      
      * fix
      
      * skip tf32 as matrices are small
      
      * remove potentially inappropriate license
      
      * Apply suggestions from code review
      
      * remove `TODO` comment
      
      * `torch.testing.assert_allclose` -> `torch.testing.assert_close`
      
      * remove comment-outs
      
      * remote unused import
      
      * minor fix
      a0ed4151
    • Masaki Kozuki's avatar
      [transformer] `parallel_state`: Position Embedding (#1343) · f10b4b89
      Masaki Kozuki authored
      * update
      
      * Add comment to `destroy_model_parallel`
      f10b4b89
    • Thor Johnsen's avatar
      Halo exchangers · 778808eb
      Thor Johnsen authored
      778808eb
    • Thor Johnsen's avatar
      Add bottleneck block · 3ade5b26
      Thor Johnsen authored
      3ade5b26
  16. 24 Mar, 2022 4 commits
  17. 23 Mar, 2022 2 commits