Fixing gelu_checkpointing memory issue (#812)
* fixing buffers in transformer kernel when gelu-checkpoint is enabled * fixing the test issue for other memory optimization flags * fixing a bug for when attn_dropout_checkpoint is enabled
Showing
csrc/transformer/ds_transformer_cuda.cpp
100755 → 100644
Please register or sign in to comment