Commits · c6f25b309913655b1f43a553c6f2d5e4a620f527 · OpenDAS / Megatron-LM

08 Sep, 2021 1 commit
- This file doesn't work withthe new inference code, so it should be remove · c6f25b30
  rprenger authored Sep 08, 2021
  
  c6f25b30
07 Sep, 2021 2 commits
- Merge branch 'slym/allreduce-overlap' into 'main' · b462d254
  Jared Casper authored Sep 07, 2021
```
use low-priority stream for nccl overlapping all-reduce and gemm

See merge request ADLR/megatron-lm!319
```
  b462d254
- remove increasing nccl stream for overlapping allreduce and gemm · f3e57f6f
  Sangkug Lym authored Sep 06, 2021
  
  f3e57f6f
03 Sep, 2021 2 commits
- Merge branch 'slym/allreduce_custom' into 'main' · 6e1bde1e
  Jared Casper authored Sep 02, 2021
```
allreduce overlap with wgrad gemm

See merge request ADLR/megatron-lm!316
```
  6e1bde1e
- reflect feedback · 4df8b7a2
  slym authored Sep 02, 2021
  
  4df8b7a2
02 Sep, 2021 4 commits
- reflect feedback · 3f652469
  slym authored Sep 02, 2021
  
  3f652469
- minor changes · 16c90445
  slym authored Sep 02, 2021
  
  16c90445
- t # This is a combination of 2 commits. · cf7efd4f
  Sangkug Lym authored Aug 30, 2021
```
allreduce overlap with wgrad gemm

change custom delay to dummy add
```
  cf7efd4f
- Merge branch 'add_all_probs' into 'main' · b7ae685f
  Jared Casper authored Sep 01, 2021
```
Letting server return the log-probabilities of the context and generated text

See merge request ADLR/megatron-lm!317
```
  b7ae685f
01 Sep, 2021 2 commits
- Fixing a bug caused by merge · d1b155c9
  rprenger authored Sep 01, 2021
  
  d1b155c9
- Fixing merge conflicts with main · 6b5ae488
  rprenger authored Sep 01, 2021
  
  6b5ae488
31 Aug, 2021 2 commits
- Fixing merge conflict · 3d718bfc
  rprenger authored Aug 31, 2021
  
  3d718bfc
- Fixing small merge conflict · 9939fb58
  rprenger authored Aug 31, 2021
  
  9939fb58
27 Aug, 2021 7 commits
- Merge branch 'slym/pyt21.08_nvfuser' into 'main' · 3860e995
  Jared Casper authored Aug 27, 2021
```
Use nvfuser at pytorch >= 1.10

See merge request ADLR/megatron-lm!314
```
  3860e995
- Use nvfuser at pytorch >= 1.10 · dba2506d
  Sangkug Lym authored Aug 27, 2021
  
  dba2506d
- Merge branch 'server' into 'main' · 89e8d27e
  Jared Casper authored Aug 27, 2021
```
Adding API server

See merge request ADLR/megatron-lm!294
```
  89e8d27e
- Adding API server · 3fe6821a
  Ryan Prenger authored Aug 27, 2021
  
  3fe6821a
- Added generate_samples_eval function · b6b7ba4d
  rprenger authored Aug 27, 2021
  
  b6b7ba4d
- Removing NVIDIA specific code and fixing some whitespace · e718810e
  rprenger authored Aug 27, 2021
  
  e718810e
- Merge branch 'dist_chkpt_act' into 'main' · 136d63cb
  Jared Casper authored Aug 27, 2021
```
Revisited distributing checkpointed activations along the tensor parallel ranks

See merge request ADLR/megatron-lm!311
```
  136d63cb
26 Aug, 2021 4 commits
- Fixing bug caused by merge · 448cb299
  rprenger authored Aug 25, 2021
  
  448cb299
- Merging with main · feea48cd
  rprenger authored Aug 25, 2021
  
  feea48cd
- Found a bug. If you don't make this change and you ask for 1 token you get 2 etc. · 8694c7b0
  rprenger authored Aug 25, 2021
  
  8694c7b0
- Addressing comments · 055a673e
  rprenger authored Aug 25, 2021
  
  055a673e
24 Aug, 2021 1 commit
- Merge branch 'github_fused_softmax' into 'main' · 0be40526
  Jared Casper authored Aug 23, 2021
```
Fused softmax checks and additions from Github (#133)

See merge request ADLR/megatron-lm!312
```
  0be40526
23 Aug, 2021 7 commits
- removed unused variable · bc7b3539
  mshoeybi authored Aug 22, 2021
  
  bc7b3539
- Modify comments · b0b017ff
  hyunwoongko authored Aug 18, 2021
  
  b0b017ff
- chagne PR by reviews · c6b77639
  hyunwoongko authored Aug 18, 2021
  
  c6b77639
- fix bugs in fused softmax · 0d350c8d
  hyunwoongko authored Aug 13, 2021
  
  0d350c8d
- Remove useless include · 116820a5
  hyunwoongko authored Aug 13, 2021
  
  116820a5
- Improve and fix bugs about fused softmax layer · 40becfc9
  hyunwoongko authored Aug 13, 2021
  
  40becfc9
- tested · cb5e611d
  mshoeybi authored Aug 22, 2021
  
  cb5e611d
21 Aug, 2021 5 commits
- some cleanup · c61dc22f
  mshoeybi authored Aug 21, 2021
  
  c61dc22f
- added for pp · b8940b96
  mshoeybi authored Aug 21, 2021
  
  b8940b96
- resolved conflicts · 7f2cc3a4
  mshoeybi authored Aug 20, 2021
  
  7f2cc3a4
- resolved conflicts · 30b92cf5
  mshoeybi authored Aug 20, 2021
  
  30b92cf5
- resolved conflicts · 8cb389b8
  mshoeybi authored Aug 20, 2021
  
  8cb389b8
19 Aug, 2021 3 commits
- Merge branch 'slym/partial_chpt' into 'main' · 23266c57
  Jared Casper authored Aug 19, 2021
```
Checkpoint a set number of individual Transformer layers

See merge request ADLR/megatron-lm!301
```
  23266c57
- Merge branch 'mem_opt_for_contig_buffer' into 'main' · 3715b0bc
  Jared Casper authored Aug 19, 2021
```
Memory optimizations for contiguous buffers

See merge request ADLR/megatron-lm!310
```
  3715b0bc
- update readme and arguement definition · 99f47676
  slym authored Aug 19, 2021
  
  99f47676