1. 06 Oct, 2023 3 commits
    • Tianqi Liu's avatar
      Update tokenization_code_llama_fast.py (#26576) · 65aabafe
      Tianqi Liu authored
      * Update tokenization_code_llama_fast.py
      
      * Update test_tokenization_code_llama.py
      
      * Update test_tokenization_code_llama.py
      65aabafe
    • Towdo's avatar
      af38c837
    • Ramiro Leal-Cavazos's avatar
      Remove unnecessary `view`s of `position_ids` (#26059) · 8878eb1b
      Ramiro Leal-Cavazos authored
      * Remove unnecessary `view` of `position_ids` in `modeling_llama`
      
      When `position_ids` is `None`, its value is generated using
      `torch.arange`, which creates a tensor of size `(seq_length +
      past_key_values_length) - past_key_values_length = seq_length`. The
      tensor is then unsqueezed, resulting in a tensor of shape `(1,
      seq_length)`. This means that the last `view` to a tensor of shape
      `(-1, seq_length)` is a no-op.
      
      This commit removes the unnecessary view.
      
      * Remove no-op `view` of `position_ids` in rest of transformer models
      8878eb1b
  2. 05 Oct, 2023 11 commits
  3. 04 Oct, 2023 14 commits
  4. 03 Oct, 2023 12 commits