"...git@developer.sourcefind.cn:2222/OpenDAS/vllm_cscc.git" did not exist on "5ea71ff46fe503df12f18ad41d40f5c2b18dcfcd"
feat: use RNG when dp routing targets are tied; override no-assume-kv-reuse...
feat: use RNG when dp routing targets are tied; override no-assume-kv-reuse for decode requests (#6253)
Signed-off-by:
Karen Chung <karenc@nvidia.com>
Showing
Please register or sign in to comment