"pytorch/cuda/moe_compute_kernel.cu" did not exist on "307e0ad90e2e25fd5f9bb13b489ca1a44d895720"
truncate thinking tags in generations (#3145)
* feat: add postprocessing for generated text to strip stop sequences and thinking tokens * nit * fix: trim leading whitespace after stripping thinking tokens from generation * feat: add think_end_token to model_args * nit * nit * nit * add to readme * nit
Showing
Please register or sign in to comment