fix(vllm): classify disagg decode-engine queued requests correctly in FPM (#8471)
Signed-off-by:Hongkuan Zhou <hongkuanz@nvidia.com> Signed-off-by:
hongkuanz <hongkuanz@nvidia.com>
Showing
Please register or sign in to comment
Signed-off-by:Hongkuan Zhou <hongkuanz@nvidia.com> Signed-off-by:
hongkuanz <hongkuanz@nvidia.com>