In above example, the server uses the chat template specified in the model tokenizer.
You can override the chat template if needed when launching the server:
By default, the server uses the chat template specified in the model tokenizer from Hugging Face. It should just work for most official models such as Llama-2/Llama-3.
If needed, you can also override the chat template when launching the server: