"git@developer.sourcefind.cn:kecinstone/2024-pra-vllm.git" did not exist on "805de738f618f8b47ab0d450423d23db1e636fa2"
[Common] Bucket batch size with higher granularity for THD (#2653)
bucket max_b with more granularity when >512
Signed-off-by:
Charlene Yang <8636796+cyanguwa@users.noreply.github.com>
Showing
Please register or sign in to comment