[BugFix] Correct direct copy from bf16 to fp8 (#1090)
* [BugFix] Correct direct copy from bf16 to fp8
* fix lint
* implement overloaded cast codegen for type conversion
* fix lint
* remove test
* fix lint
* trigger CI
* Overload fp8 for implicit conversion
* format
* new format
* fix: Reinterpret types to cute types in GEMM
* new format
* fix lint
* new format
* fix lint
* format
* trigger ci
---------
Co-authored-by:
nicunxiao <nicunxiao@bytedance.com>
Showing
Please register or sign in to comment