"...composable_kernel.git" did not exist on "f689a1554ee6b958d0b643e3bfea53cf49307e03"
[Bugfix] Fix X_amax Correctness Issue in Group Cast FP8 (#345)
- Modified the `group_per_split_token_cast_to_fp8` function to include a conditional check for batch sizes, ensuring that the scaling factor is applied only when within the valid range. This change enhances the robustness of the FP8 conversion process for grouped per-split tokens.
Showing
Please register or sign in to comment