-
Andrew Sansom authored
Fix implementation divergence for BLOOM models between vLLM and HuggingFace when using prompt embeds (#24686) Signed-off-by:Andrew Sansom <andrew@protopia.ai>
ddcec289
Fix implementation divergence for BLOOM models between vLLM and HuggingFace when using prompt embeds (#24686)
Signed-off-by:
Andrew Sansom <andrew@protopia.ai>