"examples/vscode:/vscode.git/clone" did not exist on "228cdd6a6ea60da794dab84ebdd14ec4b09b9c00"
  • RhuiDih's avatar
    Enhancing SFT Training Efficiency Using Packing and FlashAttention2 with Position IDs (#31629) · 9cf4f2aa
    RhuiDih authored
    * add DataCollatorBatchFlattening
    
    * Update data_collator.py
    
    * change name
    
    * new FA2 flow if position_ids is provided
    
    * add comments
    
    * minor fix
    
    * minor fix data collator
    
    * add test cases for models
    
    * add test case for data collator
    
    * remove extra code
    
    * formating for ruff check and check_repo.py
    
    * ruff format
    
    ruff format tests src utils
    
    * custom_init_isort.py
    9cf4f2aa
test_modeling_common.py 218 KB