-
Phuong Nguyen authored
* scaling enum abstract * rm NVTE_ from ScalingMode names * rework scaling mode enum in grouped gemm * fix norm sharding --------- Signed-off-by:Phuong Nguyen <phuonguyen@nvidia.com>
962d9c53
* scaling enum abstract
* rm NVTE_ from ScalingMode names
* rework scaling mode enum in grouped gemm
* fix norm sharding
---------
Signed-off-by:
Phuong Nguyen <phuonguyen@nvidia.com>