1. 07 Jun, 2024 2 commits
  2. 06 Jun, 2024 5 commits
  3. 05 Jun, 2024 2 commits
  4. 04 Jun, 2024 4 commits
  5. 03 Jun, 2024 4 commits
  6. 28 May, 2024 6 commits
  7. 27 May, 2024 2 commits
  8. 24 May, 2024 4 commits
  9. 23 May, 2024 5 commits
    • Yasmin Moslem's avatar
      Remove deprecated properties in tokenization_nllb.py and tokenization_nllb_fast.py (#29834) · 6d3d5b10
      Yasmin Moslem authored
      * Fix typo in tokenization_nllb.py
      
      Change `adder_tokens_decoder` into `added_tokens_decoder` and improve the warning's readability.
      
      * Fix typo in tokenization_nllb_fast.py
      
      Change `adder_tokens_decoder` into `added_tokens_decoder` and improve the warning's readability.
      
      * Remove deprecated attributes in tokenization_nllb.py
      
      Remove deprecated attributes: `lang_code_to_id`, `fairseq_tokens_to_ids`, `id_to_lang_code`, and `fairseq_ids_to_tokens`
      
      * Remove deprecated attribute in tokenization_nllb_fast.py
      
      Remove deprecated attribute `lang_code_to_id`
      
      * Remove deprecated properties in tokenization_nllb.py
      
      Remove deprecated properties - fix format
      
      * Remove deprecated properties in tokenization_nllb_fast.py
      
      Remove deprecated properties - fix format
      
      * Update test_tokenization_nllb.py
      
      * update test_tokenization_nllb.py
      
      * Update tokenization_nllb.py
      
      * Update test_tokenization_seamless_m4t.py
      
      * Update test_tokenization_seamless_m4t.py
      6d3d5b10
    • Aritra Roy Gosthipaty's avatar
      [Port] TensorFlow implementation of Mistral (#29708) · 965e98dc
      Aritra Roy Gosthipaty authored
      
      
      * chore: initial commit
      
      * chore: adding imports and inits
      
      * chore: adding the causal and classification code
      
      * chore: adding names to the layers
      
      * chore: using single self attn layer
      
      * chore: built the model and layers
      
      * chore: start with testing
      
      * chore: docstring change, transpose fix
      
      * fix: rotary embedding
      
      * chore: adding cache implementation
      
      * remove unused torch
      
      * chore: fixing the indexing issue
      
      * make fix-copies
      
      * Use modeling_tf_utils.keras
      
      * make fixup
      
      * chore: fixing tests
      
      * chore: adding past key value logic
      
      * chore: adding multi label classfication test
      
      * fix: switching on the built parameters in the layers
      
      * fixing repo consistency
      
      * ruff formats
      
      * style changes
      
      * fix: tf and pt equivalence
      
      * removing returns from docstrings
      
      * fix docstrings
      
      * fix docstrings
      
      * removing todos
      
      * fix copies
      
      * fix docstring
      
      * fix docstring
      
      * chore: using easier rotate_half
      
      * adding integration tests
      
      * chore: addressing review related to rotary embedding layer
      
      * review changes
      
      * [run-slow] mistral
      
      * skip: test save load after resize token embedding
      
      * style
      
      ---------
      Co-authored-by: default avatarMatt <rocketknight1@gmail.com>
      965e98dc
    • Yih-Dar's avatar
      Update 4 `MptIntegrationTests` expected outputs (#30989) · 2a89673f
      Yih-Dar authored
      
      
      * fix
      
      * fix
      
      * fix
      
      * fix
      
      * fix
      
      * [run-slow] mpt
      
      ---------
      Co-authored-by: default avatarydshieh <ydshieh@users.noreply.github.com>
      2a89673f
    • Fanli Lin's avatar
      [tests] add `torch.use_deterministic_algorithms` for XPU (#30774) · 21339a52
      Fanli Lin authored
      * add xpu check
      
      * add marker
      
      * add documentation
      
      * update doc
      
      * fix ci
      
      * remove from global init
      
      * fix
      21339a52
    • Marc Sun's avatar
      Fix accelerate failing tests (#30836) · 8366b572
      Marc Sun authored
      * Fix accelerate tests
      
      * fix clip
      
      * skip dbrx tests
      
      * fix GPTSan
      
      * fix M2M100Model
      
      * same fix as jamba
      
      * fix mt5
      
      * Fix T5Model
      
      * Fix umt5 model
      
      * fix switch_transformers
      
      * fix whisper
      
      * fix gptsan again
      
      * fix siglip recent test
      
      * skip siglip tests
      
      * wrong place fixed
      8366b572
  10. 22 May, 2024 4 commits
  11. 21 May, 2024 2 commits