"docs/source/deployment/nginx.md" did not exist on "32aa2059addd97be1afce7a199d228191710c294"
Unverified Commit c2a2f19a authored by Alex Brooks's avatar Alex Brooks Committed by GitHub
Browse files

[Bugfix] Fix Tensor Parallelism Padding Consistency in Granite Models (#20843)


Signed-off-by: default avatarAlex-Brooks <Alex.Brooks@ibm.com>
parent 2c11a738
......@@ -273,6 +273,10 @@ class GraniteModel(nn.Module):
self.vocab_size,
config.hidden_size,
org_num_embeddings=config.vocab_size,
padding_size=DEFAULT_VOCAB_PADDING_SIZE
# We need bigger padding if using lora for kernel
# compatibility
if not lora_config else lora_config.lora_vocab_padding_size,
quant_config=quant_config,
)
else:
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment