Improve data-parallel request partitioning for VLLM (#1477)
* add undistribute + use more_itertools * remove divide() util fn * add more_itertools as dependency
Showing
Please register or sign in to comment
* add undistribute + use more_itertools * remove divide() util fn * add more_itertools as dependency