Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Commits
98f97133
Unverified
Commit
98f97133
authored
Sep 17, 2024
by
Joe Runde
Committed by
GitHub
Sep 17, 2024
Browse files
[Bugfix] Fix TP > 1 for new granite (#8544)
Signed-off-by:
Joe Runde
<
Joseph.Runde@ibm.com
>
parent
56c3de01
Changes
1
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
2 additions
and
1 deletion
+2
-1
vllm/model_executor/models/granite.py
vllm/model_executor/models/granite.py
+2
-1
No files found.
vllm/model_executor/models/granite.py
View file @
98f97133
...
@@ -428,7 +428,8 @@ class GraniteForCausalLM(nn.Module, SupportsLoRA):
...
@@ -428,7 +428,8 @@ class GraniteForCausalLM(nn.Module, SupportsLoRA):
sampling_metadata
:
SamplingMetadata
)
->
Optional
[
torch
.
Tensor
]:
sampling_metadata
:
SamplingMetadata
)
->
Optional
[
torch
.
Tensor
]:
logits
=
self
.
logits_processor
(
self
.
lm_head
,
hidden_states
,
logits
=
self
.
logits_processor
(
self
.
lm_head
,
hidden_states
,
sampling_metadata
)
sampling_metadata
)
logits
/=
self
.
config
.
logits_scaling
if
logits
is
not
None
:
logits
/=
self
.
config
.
logits_scaling
return
logits
return
logits
def
sample
(
def
sample
(
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment