- 19 May, 2022 2 commits
-
-
Jared Casper authored
-
Jared Casper authored
-
- 16 May, 2022 1 commit
-
-
Jared Casper authored
Fourth phase vision merge: classification and segmentation tasks See merge request ADLR/megatron-lm!400
-
- 13 May, 2022 1 commit
-
-
Vijay Korthikanti authored
-
- 10 May, 2022 1 commit
-
-
Jared Casper authored
jit functions warmpups to (1) match fprop and recompute results and (2) remove grad_enable See merge request ADLR/megatron-lm!404
-
- 09 May, 2022 1 commit
-
-
Sangkug Lym authored
change dummy_handler to nullcontext
-
- 25 Mar, 2022 1 commit
-
-
Sangkug Lym authored
-
- 15 Mar, 2022 3 commits
-
-
Jared Casper authored
Bugfix: Pipeline output deallocation w/ persistent layer norm See merge request ADLR/megatron-lm!402
-
Lawrence McAfee authored
-
Lawrence McAfee authored
-
- 01 Mar, 2022 3 commits
-
-
Vijay Korthikanti authored
-
Vijay Korthikanti authored
-
Vijay Korthikanti authored
-
- 24 Feb, 2022 2 commits
-
-
Lawrence McAfee authored
Switch Transformer See merge request ADLR/megatron-lm!373
-
rprenger authored
-
- 23 Feb, 2022 1 commit
-
-
Jared Casper authored
Gradient accumulation fusion See merge request ADLR/megatron-lm!394
-
- 18 Feb, 2022 1 commit
-
-
Sangkug Lym authored
-
- 17 Feb, 2022 6 commits
-
-
Jared Casper authored
changed all_gather to _all_gather_base in distributed checkpointing See merge request ADLR/megatron-lm!395
-
rprenger authored
-
mshoeybi authored
-
mshoeybi authored
-
Lawrence McAfee authored
-
Lawrence McAfee authored
-
- 16 Feb, 2022 1 commit
-
-
Sangkug Lym authored
remove redundant linear layer class definition add fuse_gradient_accumulation attribute to weights for simple targetting reflect feedback and clean up the codes arg change
-
- 15 Feb, 2022 5 commits
-
-
Vijay Korthikanti authored
-
Vijay Korthikanti authored
-
Vijay Korthikanti authored
-
Vijay Korthikanti authored
-
Jared Casper authored
vision third phase merge: pretraining methods + mit,swin backbones See merge request ADLR/megatron-lm!384
-
- 11 Feb, 2022 5 commits
-
-
Vijay Korthikanti authored
-
Mohammad Shoeybi authored
Remove specific versions of pytorch, etc. from README so it doesn't go out of date. See merge request ADLR/megatron-lm!392
-
Jared Casper authored
-
Mohammad Shoeybi authored
Adding several things to the text_generation_server that were necessary for the demos See merge request ADLR/megatron-lm!350
-
rprenger authored
-
- 08 Feb, 2022 2 commits
-
-
Jared Casper authored
Standalone embedding stage See merge request ADLR/megatron-lm!385
-
Lawrence McAfee authored
-
- 04 Feb, 2022 3 commits
-
-
Lawrence McAfee authored
-
rprenger authored
-
rprenger authored
-
- 03 Feb, 2022 1 commit
-
-
rprenger authored
-