perf: Avoid copying inputs_embeds tensors to GPU unless prompt_embeds is enabled (#25739)
Signed-off-by:
Andrew Sansom <andrew@protopia.ai>
Showing
Please register or sign in to comment
Signed-off-by:
Andrew Sansom <andrew@protopia.ai>