- Introduction to integrating TE into an existing JAX model framework, building a Transformer Layer, and instructions on integrating TE modules like Linear and LayerNorm.
# Third party
# Third party
-[Hugging Face Accelerate + TE](https://github.com/huggingface/accelerate/tree/main/benchmarks/fp8/transformer_engine)
-[Hugging Face Accelerate + TE](https://github.com/huggingface/accelerate/tree/main/benchmarks/fp8/transformer_engine)
- Scripts for training with Accelerate and TE. Supports single GPU, and multi-GPU via DDP, FSDP, and DeepSpeed ZeRO 1-3.
- Scripts for training with Accelerate and TE. Supports single GPU, and multi-GPU via DDP, FSDP, and DeepSpeed ZeRO 1-3.
mv"${WHL_BASE}/${WHL_BASE}.dist-info""${WHL_BASE}/transformer_engine_cu12-${VERSION}.dist-info"|| error_exit "Failed to move ${WHL_BASE}.dist-info to transformer_engine_cu12-${VERSION}.dist-info"
mv"${WHL_BASE}/${WHL_BASE}.dist-info""${WHL_BASE}/transformer_engine_cu12-${VERSION}.dist-info"|| error_exit "Failed to move ${WHL_BASE}.dist-info to transformer_engine_cu12-${VERSION}.dist-info"
wheel pack ${WHL_BASE}|| error_exit "Failed to pack ${WHL_BASE}"
python3 -mwheel pack ${WHL_BASE}|| error_exit "Failed to pack ${WHL_BASE}"
rm dist/*.whl || error_exit "Failed to remove dist/*.whl"
rm dist/*.whl || error_exit "Failed to remove dist/*.whl"
mv*.whl dist/ || error_exit "Failed to move *.whl to dist/"
mv*.whl dist/ || error_exit "Failed to move *.whl to dist/"