Commits · ff2f0a05ea2c03f87f98af428c59bbbd5bfc8f41 · OpenDAS / Megatron-LM

30 Sep, 2021 1 commit
- further refactoring · ff2f0a05
  mshoeybi authored Sep 29, 2021
  
  ff2f0a05
29 Sep, 2021 3 commits
- added multi-batch inference · 390ddef8
  mshoeybi authored Sep 29, 2021
  
  390ddef8
- working · b6555b71
  mshoeybi authored Sep 28, 2021
  
  b6555b71
- working · 6c40f892
  mshoeybi authored Sep 28, 2021
  
  6c40f892
27 Sep, 2021 1 commit
- refactoring, tested and working · 25f9c3f0
  mshoeybi authored Sep 26, 2021
  
  25f9c3f0
24 Sep, 2021 1 commit
- tested and woking · e722c4a9
  mshoeybi authored Sep 23, 2021
  
  e722c4a9
23 Sep, 2021 2 commits
- working · 107c29e8
  mshoeybi authored Sep 22, 2021
  
  107c29e8
- added batching · 2f08c0c3
  mshoeybi authored Sep 22, 2021
  
  2f08c0c3
22 Sep, 2021 3 commits
- sampling · 554d1cc0
  mshoeybi authored Sep 22, 2021
  
  554d1cc0
- sampling · 018c270a
  mshoeybi authored Sep 21, 2021
  
  018c270a
- sampling tested · f1555799
  mshoeybi authored Sep 21, 2021
  
  f1555799
21 Sep, 2021 1 commit
- added sampling · 297a5f33
  mshoeybi authored Sep 21, 2021
  
  297a5f33
20 Sep, 2021 2 commits
- Merge branch 'inference_context_optimization' into 'main' · 87023abd
  Jared Casper authored Sep 20, 2021
```
Inference context optimization

See merge request ADLR/megatron-lm!321
```
  87023abd
- Inference context optimization · 8b9fe87b
  Mohammad Shoeybi authored Sep 20, 2021
  
  8b9fe87b
17 Sep, 2021 3 commits

Merge branch 'add-temperature-parameter-to-server-api' into 'main' · f47aa770
Jared Casper authored Sep 17, 2021
```
Add temperature to the server API

See merge request ADLR/megatron-lm!325
```
f47aa770

Add temperature to the server API · 527e07c0

Robert Clark authored Sep 10, 2021



A temperature value between 0.0 and 100.0 can now be specified via the API
while running the text generation server. The value passed to
--temperature while running the text generation server is kept as the
default value for all API calls that don't include temperature, even if
different values were manually specified previously.
Signed-Off-By: Robert Clark <roclark@nvidia.com>

527e07c0

Merge branch 'fix_initial_broadcasting' into 'main' · a97d676b
Jared Casper authored Sep 17, 2021
```
Fixes a bug in broadcasting that was causing hanging

See merge request ADLR/megatron-lm!327
```
a97d676b

14 Sep, 2021 5 commits
- Merge branch 'debug_harness' into 'main' · 230633f8
  Ryan Prenger authored Sep 14, 2021
```
Changing API to fix LM Harness Evaluation and make API less confusing/terrible

See merge request ADLR/megatron-lm!326
```
  230633f8
- Fixes a bug in broadcasting that was causing hanging · b46482e8
  rprenger authored Sep 14, 2021
  
  b46482e8
- Addressing comments · 593b47b4
  rprenger authored Sep 14, 2021
  
  593b47b4
- Fixing Merge conflict · 2c500111
  rprenger authored Sep 14, 2021
  
  2c500111
- Changing api to tokens_to_generate, making it so we always generate at least tokens_to_generate · 77979e3b
  rprenger authored Sep 14, 2021
  
  77979e3b
13 Sep, 2021 1 commit
- Changing the interface to the lm eval harness and fixing bugs caused by... · 42e83ee0
  rprenger authored Sep 13, 2021
```
Changing the interface to the lm eval harness and fixing bugs caused by misunderstanding out_seq_length
```
  42e83ee0
10 Sep, 2021 1 commit
- Merge branch 'memory_save' into 'main' · c81bf127
  Jared Casper authored Sep 10, 2021
```
Reducing memory usage during inference

See merge request ADLR/megatron-lm!320
```
  c81bf127
08 Sep, 2021 2 commits
- Merge branch 'remove_generate_samples_gpt' into 'main' · 8fe6f9bf
  Jared Casper authored Sep 08, 2021
```
This file doesn't work withthe new inference code, so it should be remove

See merge request ADLR/megatron-lm!322
```
  8fe6f9bf
- This file doesn't work withthe new inference code, so it should be remove · c6f25b30
  rprenger authored Sep 08, 2021
  
  c6f25b30
07 Sep, 2021 2 commits
- Merge branch 'slym/allreduce-overlap' into 'main' · b462d254
  Jared Casper authored Sep 07, 2021
```
use low-priority stream for nccl overlapping all-reduce and gemm

See merge request ADLR/megatron-lm!319
```
  b462d254
- remove increasing nccl stream for overlapping allreduce and gemm · f3e57f6f
  Sangkug Lym authored Sep 06, 2021
  
  f3e57f6f
03 Sep, 2021 3 commits
- Reducing memory usage during inference · faf58b77
  rprenger authored Sep 03, 2021
  
  faf58b77
- Merge branch 'slym/allreduce_custom' into 'main' · 6e1bde1e
  Jared Casper authored Sep 02, 2021
```
allreduce overlap with wgrad gemm

See merge request ADLR/megatron-lm!316
```
  6e1bde1e
- reflect feedback · 4df8b7a2
  slym authored Sep 02, 2021
  
  4df8b7a2
02 Sep, 2021 4 commits
- reflect feedback · 3f652469
  slym authored Sep 02, 2021
  
  3f652469
- minor changes · 16c90445
  slym authored Sep 02, 2021
  
  16c90445
- t # This is a combination of 2 commits. · cf7efd4f
  Sangkug Lym authored Aug 30, 2021
```
allreduce overlap with wgrad gemm

change custom delay to dummy add
```
  cf7efd4f
- Merge branch 'add_all_probs' into 'main' · b7ae685f
  Jared Casper authored Sep 01, 2021
```
Letting server return the log-probabilities of the context and generated text

See merge request ADLR/megatron-lm!317
```
  b7ae685f
01 Sep, 2021 2 commits
- Fixing a bug caused by merge · d1b155c9
  rprenger authored Sep 01, 2021
  
  d1b155c9
- Fixing merge conflicts with main · 6b5ae488
  rprenger authored Sep 01, 2021
  
  6b5ae488
31 Aug, 2021 2 commits
- Fixing merge conflict · 3d718bfc
  rprenger authored Aug 31, 2021
  
  3d718bfc
- Fixing small merge conflict · 9939fb58
  rprenger authored Aug 31, 2021
  
  9939fb58
27 Aug, 2021 1 commit
- Merge branch 'slym/pyt21.08_nvfuser' into 'main' · 3860e995
  Jared Casper authored Aug 27, 2021
```
Use nvfuser at pytorch >= 1.10

See merge request ADLR/megatron-lm!314
```
  3860e995