"vscode:/vscode.git/clone" did not exist on "e624df922d1295fcd31e434f34ae6aa0b30db22d"
[Enhancement] Update group_per_split_token_cast_to_fp8 to support multiple data types (#356)
- Modified the `group_per_split_token_cast_to_fp8` function to support `bfloat16`, `float`, and `float16` data types. - Updated local fragment allocations to use the new `accum_dtype` for consistency. - Enhanced the main execution block to handle different tensor data types based on the specified `dtype`, improving flexibility in tensor operations.
Showing
Please register or sign in to comment