Unverified Commit 8b9241be authored by Cade Daniel's avatar Cade Daniel Committed by GitHub
Browse files

[Speculative decoding] [Bugfix] Fix overallocation in ngram + spec logprobs (#4672)

parent f942efb5
......@@ -138,7 +138,7 @@ class NGramWorker(LoraNotSupportedWorkerBase):
SamplerOutput(
outputs=None,
sampled_token_probs=token_probs[i],
logprobs=token_logprobs,
logprobs=token_logprobs[i],
sampled_token_ids=token_ids[i],
))
return outputs, False
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment