"...git@developer.sourcefind.cn:OpenDAS/TransformerEngine.git" did not exist on "324be3324278723bd8f66196ed1ccac29b94bd7f"
[TE/JAX] Prototype for New XLA Custom Calls with FFI (#946)
* implemented custom call with ffi in csrc
* moved headers of misc to misc.h, add ffi.h
* ActLu and DActLu lowering with ffi_lowering
* CastTranspose with ffi_lowering
* enabled cudaGraph
* added 4d input test case to TestActivationLu
* added operand_output_aliases for CastTranspose
* added env var NVTE_JAX_WITH_FFI, default value = 1
* replace casting ActivationEnum by taking its value
---------
Signed-off-by:
Phuong Nguyen <phuonguyen@nvidia.com>
Showing
Please register or sign in to comment