Fix implementation divergence for BLOOM models between vLLM and HuggingFace...
Fix implementation divergence for BLOOM models between vLLM and HuggingFace when using prompt embeds (#24686)
Signed-off-by:
Andrew Sansom <andrew@protopia.ai>
Showing
Please register or sign in to comment