Merge branch 'transformer_refactoring_from_pretrain_refactoring' into 'master'
Major refactoring, combining gpt2 and bert See merge request ADLR/megatron-lm!8
Showing
This diff is collapsed.
Please register or sign in to comment
Major refactoring, combining gpt2 and bert See merge request ADLR/megatron-lm!8