Fix bugs in permutation custom partitioning (#2617)
* Use correct block size for workspace in row id map creation, also shard workspace correctly based on 2nd dim of routing_map/row_id map Signed-off-by:DoubleCheeseCheetos <hanhdp99@gmail.com> * reduce size of largest test case on single_GPU scenario to fit on L40 and A100 in CI line up Signed-off-by:
tdophung <hanhdp99@gmail.com> --------- Signed-off-by:
DoubleCheeseCheetos <hanhdp99@gmail.com> Signed-off-by:
tdophung <hanhdp99@gmail.com> Co-authored-by:
DoubleCheeseCheetos <hanhdp99@gmail.com>
Showing
Please register or sign in to comment