[`Nllb-Moe`] Fix nllb moe accelerate issue (#23758)

fix nllb moe accelerate issue

[`Nllb-Moe`] Fix nllb moe accelerate issue (#23758)
fix nllb moe accelerate issue
f67dac97 · Younes Belkada · GitHub · d685e330 · f67dac97
Unverified Commit f67dac97 authored May 25, 2023 by Younes Belkada Committed by GitHub May 25, 2023
Show whitespace changes
Inline Side-by-side

Showing with 1 addition and 1 deletion

src/transformers/models/nllb_moe/modeling_nllb_moe.py src/transformers/models/nllb_moe/modeling_nllb_moe.py +1 -1

No files found.
--- a/src/transformers/models/nllb_moe/modeling_nllb_moe.py
+++ b/src/transformers/models/nllb_moe/modeling_nllb_moe.py
@@ -856,7 +856,7 @@ class NllbMoePreTrainedModel(PreTrainedModel):
    config_class = NllbMoeConfig
    base_model_prefix = "model"
    supports_gradient_checkpointing = True
-    _no_split_modules = ["NllbMoeAttention"]
+    _no_split_modules = ["NllbMoeEncoderLayer", "NllbMoeDecoderLayer"]

    def _init_weights(self, module):
        """Initialize the weights"""