Commits · f00d0a3fc1fb59b05669ca35ddecb6d422403bfd · OpenDAS / Megatron-LM

11 Feb, 2022 4 commits
- Merge branch 'readme-no-versions' into 'main' · f00d0a3f
  Mohammad Shoeybi authored Feb 10, 2022
```
Remove specific versions of pytorch, etc. from README so it doesn't go out of date.

See merge request ADLR/megatron-lm!392
```
  f00d0a3f
- Remove specific versions of pytorch, etc. from README so it doesn't go out of date. · d50e89f1
  Jared Casper authored Feb 10, 2022
  
  d50e89f1
- Merge branch 'stop_tokens' into 'main' · d5fe59fe
  Mohammad Shoeybi authored Feb 10, 2022
```
Adding several things to the text_generation_server that were necessary for the demos

See merge request ADLR/megatron-lm!350
```
  d5fe59fe
- Addressing comments · 1d391bba
  rprenger authored Feb 10, 2022
  
  1d391bba
08 Feb, 2022 2 commits
- Merge branch 'lmcafee/embed-standalone' into 'main' · 10c6ad06
  Jared Casper authored Feb 08, 2022
```
Standalone embedding stage

See merge request ADLR/megatron-lm!385
```
  10c6ad06
- fixed t5 'get_num_layers()' · 2fadaa50
  Lawrence McAfee authored Feb 08, 2022
  
  2fadaa50
04 Feb, 2022 3 commits
- renamed argument; 'embed' -> 'embedding' · c04c4977
  Lawrence McAfee authored Feb 04, 2022
  
  c04c4977
- Adding message to ValueError · b0c824d9
  rprenger authored Feb 04, 2022
  
  b0c824d9
- Adding the web interface · 42982fc3
  rprenger authored Feb 03, 2022
  
  42982fc3
01 Feb, 2022 2 commits
- comments, cleanup. · b93bef00
  Lawrence McAfee authored Feb 01, 2022
  
  b93bef00
- found root source of t5 issue (fast layer norm) · bea16fa3
  Lawrence McAfee authored Feb 01, 2022
  
  bea16fa3
31 Jan, 2022 2 commits
- working for t5 [ encoder embedding only ] · 3af6725d
  Lawrence McAfee authored Jan 31, 2022
  
  3af6725d
- added 'no-op' layer, to replace transformer layer when num_layers == 0. · 1fa6990c
  Lawrence McAfee authored Jan 31, 2022
  
  1fa6990c
29 Jan, 2022 5 commits
- Merge branch 'vision-merge' into 'main' · e724785f
  Jared Casper authored Jan 28, 2022
```
second phase of vision code merge

See merge request ADLR/megatron-lm!381
```
  e724785f
- narrowed issue to pipeline rank 0, virtual pipeline rank >= 1 · 5bc9f889
  Lawrence McAfee authored Jan 28, 2022
  
  5bc9f889
- typo fix · 2b628f96
  Vijay Korthikanti authored Jan 28, 2022
  
  2b628f96
- Merge branch 'github-pr' into 'main' · e156d2fe
  Jared Casper authored Jan 28, 2022
```
Combination of several github PRs

See merge request ADLR/megatron-lm!383
```
  e156d2fe
- Revert incorrect fix. · cd499559
  Jared Casper authored Jan 28, 2022
  
  cd499559
28 Jan, 2022 9 commits
- Merge branch 'patch-1' of https://github.com/vycezhong/Megatron-LM into github-pr · 2a34e0ec
  Jared Casper authored Jan 28, 2022
  
  2a34e0ec
- Merge branch 'main' of https://github.com/satpalsr/Megatron-LM into github-pr · 34f55429
  Jared Casper authored Jan 28, 2022
  
  34f55429
- Merge branch 'patch-1' of https://github.com/jamesr66a/Megatron-LM into github-pr · adebe364
  Jared Casper authored Jan 28, 2022
  
  adebe364
- Merge branch 'patch-2' of https://github.com/kvtoraman/Megatron-LM into github-pr · 20f6169f
  Jared Casper authored Jan 28, 2022
  
  20f6169f
- Merge branch 'patch-1' of https://github.com/rajeshkppt/Megatron-LM into github-pr · 0747e8e5
  Jared Casper authored Jan 28, 2022
  
  0747e8e5
- Merge branch 'fix' of https://github.com/singleheart/Megatron-LM into github-pr · 9882fb3f
  Jared Casper authored Jan 28, 2022
  
  9882fb3f
- Merge branch 'patch-1' of https://github.com/stas00/Megatron-LM into github-pr · 4a62d582
  Jared Casper authored Jan 28, 2022
  
  4a62d582
- more naming cleanup · 641408f5
  Vijay Korthikanti authored Jan 28, 2022
  
  641408f5
- changing class name AnnealingLR to OptimizerParamScheduler · 04ecc834
  Vijay Korthikanti authored Jan 28, 2022
  
  04ecc834
27 Jan, 2022 6 commits
- address review comments · 53931b8b
  Vijay Korthikanti authored Jan 27, 2022
  
  53931b8b
- Merge branch 'main' into lmcafee/embed-standalone · f17a3933
  Lawrence McAfee authored Jan 27, 2022
  
  f17a3933
- Merge branch 'lmcafee/distrib-chkpt-fix-v2' into 'main' · fd5469aa
  Jared Casper authored Jan 26, 2022
```
Distributed checkpointing memory fix

See merge request ADLR/megatron-lm!379
```
  fd5469aa
- add clarification about the model parallel size column · 2d767f48
  Stas Bekman authored Jan 26, 2022
  
  2d767f48
- [README] specify explicitly which gpu and node size was used · 145d2eb9
  Stas Bekman authored Jan 26, 2022
```
The paper has this info, so proposing to copy it next to the table. 

Otherwise it's hard to guess whether you used 40GB A100s or 80GB ones (and secondary, n_gpus per node).

Thank you!
```
  145d2eb9
- Adding the option to not log · 31fd62d6
  rprenger authored Jan 26, 2022
  
  31fd62d6
26 Jan, 2022 2 commits
- further clarified viewless tensor comment in transformer.py · d16e2a24
  Lawrence McAfee authored Jan 26, 2022
  
  d16e2a24
- address review comments · 8acbbe25
  Vijay Korthikanti authored Jan 26, 2022
  
  8acbbe25
25 Jan, 2022 3 commits
- Merge branch 'lmcafee/t5-free-pipe-output-true-fix' into 'main' · fc5d4c2b
  Jared Casper authored Jan 25, 2022
```
T5 fix for deallocating pipeline output tensors

See merge request ADLR/megatron-lm!380
```
  fc5d4c2b
- limit 'make_viewless_tensor()' to case of micro_batch_size == 1; added comment · 24369dd6
  Lawrence McAfee authored Jan 25, 2022
  
  24369dd6
- working with interleaving · 804ed2e6
  Lawrence McAfee authored Jan 24, 2022
  
  804ed2e6
24 Jan, 2022 2 commits
- added args.transformer_pipeline_model_parallel_size · a06af061
  Lawrence McAfee authored Jan 24, 2022
  
  a06af061
- fixed args.virtual_pipeline_model_parallel_size · c2b7d0b3
  Lawrence McAfee authored Jan 24, 2022
  
  c2b7d0b3