1. 01 Aug, 2024 4 commits
  2. 31 Jul, 2024 4 commits
  3. 30 Jul, 2024 1 commit
    • Joshua Lochner's avatar
      Fix slow GemmaTokenizer and improve SPM slow -> fast conversion process (#32191) · 6e2d04e4
      Joshua Lochner authored
      * Remove user-defined tokens which can be obtained through merges
      
      * Remove debug line
      
      * formatting
      
      * Refactor spm slow -> fast converter
      
      * revert unnecessary refactor
      
      * set comprehension
      
      * remove test files
      
      * Use `vocab_scores`
      
      * Always replace spiece underline with space in decode
      
      * we no longer need token filtering
      
      * Add save fast load slow unit test
      
      * Remove tokenizers version check
      
      * Remove duplicate code
      
      * Make `<start_of_turn>` and `<end_of_turn>` special tokens
      
      * Bias merge priority with length if score is the same
      
      * Add unit test for merge priority
      
      * CI
      6e2d04e4
  4. 29 Jul, 2024 5 commits
  5. 26 Jul, 2024 5 commits
  6. 25 Jul, 2024 3 commits
  7. 24 Jul, 2024 5 commits
  8. 23 Jul, 2024 12 commits
  9. 22 Jul, 2024 1 commit
    • Marc Sun's avatar
      Add new quant method (#32047) · 96a074fa
      Marc Sun authored
      * Add new quant method
      
      * update
      
      * fix multi-device
      
      * add test
      
      * add offload
      
      * style
      
      * style
      
      * add simple example
      
      * initial doc
      
      * docstring
      
      * style again
      
      * works ?
      
      * better docs
      
      * switch to non persistant
      
      * remove print
      
      * fix init
      
      * code review
      96a074fa