"ssh:/git@developer.sourcefind.cn:2222/OpenDAS/vllm_cscc.git" did not exist on "a3205beffb6b3d2923fd9ad8e1ef8b4fd5f7ed29"
Default to 'align' mamba cache mode for Mamba-based models when speculative...
Default to 'align' mamba cache mode for Mamba-based models when speculative decoding is enabled (#40454)
Signed-off-by:
Roi Koren <roik@nvidia.com>
Showing
Please register or sign in to comment