• Stas Bekman's avatar
    [Deepspeed] add support for bf16 mode (#14569) · 580dd87c
    Stas Bekman authored
    
    
    * [WIP] add support for bf16 mode
    
    * prep for bf16
    
    * prep for bf16
    
    * fix; zero2/bf16 is ok
    
    * check bf16 is available
    
    * test fixes
    
    * enable zero3_bf16
    
    * config files
    
    * docs
    
    * split stage_dtype; merge back to non-dtype-specific config file
    
    * fix doc
    
    * cleanup
    
    * cleanup
    
    * bfloat16 => bf16 to match the PR changes
    
    * s/zero_gather_fp16_weights_on_model_save/zero_gather_16bit_weights_on_model_save/; s/save_fp16_model/save_16bit_model/
    
    * test fixes/skipping
    
    * move
    
    * fix
    
    * Update docs/source/main_classes/deepspeed.mdx
    Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
    
    * backticks
    
    * cleanup
    
    * cleanup
    
    * cleanup
    
    * new version
    
    * add note about grad accum in bf16
    Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
    580dd87c
deepspeed.mdx 76.9 KB