• Reza Yazdani's avatar
    Fixing gelu_checkpointing memory issue (#812) · 8295d7a8
    Reza Yazdani authored
    * fixing buffers in transformer kernel when gelu-checkpoint is enabled
    
    * fixing the test issue for other memory optimization flags
    
    * fixing a bug for when attn_dropout_checkpoint is enabled
    8295d7a8
ds_transformer_cuda.cpp 46.2 KB