Small bugfix in bert_model.py: make sure word_embeddings is initialized before...

Small bugfix in bert_model.py: make sure word_embeddings is initialized before instantiating lm_head

Small bugfix in bert_model.py: make sure word_embeddings is initialized before...
Small bugfix in bert_model.py: make sure word_embeddings is initialized before instantiating lm_head
69a546be · Deepak Narayanan · 1979c242 · 69a546be
Commit 69a546be authored Nov 12, 2020 by Deepak Narayanan
Hide whitespace changes
Inline Side-by-side

Showing with 1 addition and 2 deletions

megatron/model/bert_model.py megatron/model/bert_model.py +1 -2

No files found.
--- a/megatron/model/bert_model.py
+++ b/megatron/model/bert_model.py
@@ -149,6 +149,7 @@ class BertModelBase(PipelinedMegatronModule):
            init_method=init_method,
            scaled_init_method=scaled_init_method)
+        self.initialize_word_embeddings(init_method_normal)
        if mpu.is_pipeline_last_stage():
            self.lm_head = BertLMHead(
                self.word_embeddings_weight().size(0),
@@ -160,8 +161,6 @@ class BertModelBase(PipelinedMegatronModule):
                                                    init_method)
                self._binary_head_key = 'binary_head'
-        self.initialize_word_embeddings(init_method_normal)
    def forward(self, bert_model_input, attention_mask,
                tokentype_ids=None, lm_labels=None):