***Note: for speculative execution to work, user must put all "gen" in "assistant". Show in "assistant" the desired answer format. Each "gen" term should have a stop token. The stream mode is not supported in speculative execution.
***Note: for speculative execution to work, user must put all "gen" in "assistant".
Show in "assistant" the desired answer format. Each "gen" term should have a stop token.
The stream mode is not supported in speculative execution.
s+=sgl.user("Answer questions in the following format:")
s+=sgl.user("Answer questions in the following format:")
s+=sgl.user("Question 1: What is the capital of France?\nQuestion 2: What is the population of this city?\n")
s+=sgl.user("Question 1: What is the capital of France?\nQuestion 2: What is the population of this city?\n")
s+=sgl.assistant("Answer 1: The capital of France is Paris.\nAnswer 2: The population of Paris in 2024 is estimated to be around 2.1 million for the city proper.\n")
s+=sgl.assistant("Answer 1: The capital of France is Paris.\nAnswer 2: The population of Paris in 2024 is estimated to be around 2.1 million for the city proper.\n")