-
Jared Casper authored
bug fix for previous MR (Flag to call empty_cache() each iteration, to reduce fragmentation) See merge request ADLR/megatron-lm!309
68797d90
bug fix for previous MR (Flag to call empty_cache() each iteration, to reduce fragmentation) See merge request ADLR/megatron-lm!309