[shardformer] ChatGLM support layernorm sharding

f155ae89 · klhhhhh · Hongxin Liu · 00f6ef15 · f155ae89
Commit f155ae89 authored Jul 17, 2023 by klhhhhh Committed by Hongxin Liu Aug 15, 2023
Hide whitespace changes
Inline Side-by-side

Showing with 1 addition and 1 deletion

tests/kit/model_zoo/transformers/chatglm2_6b/modeling_chatglm.py ...it/model_zoo/transformers/chatglm2_6b/modeling_chatglm.py +1 -1

No files found.
--- a/tests/kit/model_zoo/transformers/chatglm2_6b/modeling_chatglm.py
+++ b/tests/kit/model_zoo/transformers/chatglm2_6b/modeling_chatglm.py
@@ -417,7 +417,7 @@ class SelfAttention(torch.nn.Module):
        )
 =======
        self.dense = nn.Linear(self.projection_size,
-                               self.hidden_size,
+                               config.hidden_size,
                               bias=config.add_bias_linear,
                               device=device,
                               **_config_to_kwargs(config))