1. 06 Aug, 2024 1 commit
  2. 05 Aug, 2024 6 commits
  3. 02 Aug, 2024 1 commit
  4. 01 Aug, 2024 7 commits
  5. 31 Jul, 2024 4 commits
  6. 30 Jul, 2024 1 commit
    • Joshua Lochner's avatar
      Fix slow GemmaTokenizer and improve SPM slow -> fast conversion process (#32191) · 6e2d04e4
      Joshua Lochner authored
      * Remove user-defined tokens which can be obtained through merges
      
      * Remove debug line
      
      * formatting
      
      * Refactor spm slow -> fast converter
      
      * revert unnecessary refactor
      
      * set comprehension
      
      * remove test files
      
      * Use `vocab_scores`
      
      * Always replace spiece underline with space in decode
      
      * we no longer need token filtering
      
      * Add save fast load slow unit test
      
      * Remove tokenizers version check
      
      * Remove duplicate code
      
      * Make `<start_of_turn>` and `<end_of_turn>` special tokens
      
      * Bias merge priority with length if score is the same
      
      * Add unit test for merge priority
      
      * CI
      6e2d04e4
  7. 29 Jul, 2024 5 commits
  8. 26 Jul, 2024 5 commits
  9. 25 Jul, 2024 3 commits
  10. 24 Jul, 2024 5 commits
  11. 23 Jul, 2024 2 commits
    • Sai-Suraj-27's avatar
      Updated `ruff` to the latest version (#31926) · d2c687b3
      Sai-Suraj-27 authored
      * Updated ruff version and fixed the required code accorindg to the latest version.
      
      * Updated ruff version and fixed the required code accorindg to the latest version.
      
      * Added noqa directive to ignore 1 error shown by ruff
      d2c687b3
    • RhuiDih's avatar
      Enhancing SFT Training Efficiency Using Packing and FlashAttention2 with Position IDs (#31629) · 9cf4f2aa
      RhuiDih authored
      * add DataCollatorBatchFlattening
      
      * Update data_collator.py
      
      * change name
      
      * new FA2 flow if position_ids is provided
      
      * add comments
      
      * minor fix
      
      * minor fix data collator
      
      * add test cases for models
      
      * add test case for data collator
      
      * remove extra code
      
      * formating for ruff check and check_repo.py
      
      * ruff format
      
      ruff format tests src utils
      
      * custom_init_isort.py
      9cf4f2aa