1. 28 May, 2024 15 commits
  2. 27 May, 2024 7 commits
  3. 24 May, 2024 14 commits
  4. 23 May, 2024 4 commits
    • Matthias Gerstgrasser's avatar
      Bugfix: WandbCallback uploads initial model checkpoint (#30897) · 6657fb5f
      Matthias Gerstgrasser authored
      * fix wandb always uploading initial model
      
      * Update comment.
      
      * Optionally log initial model
      
      * Revert "Optionally log initial model"
      
      This reverts commit 9602cc1fad3feaf218f82a7339a194d3d2fbb946.
      6657fb5f
    • Yasmin Moslem's avatar
      Remove deprecated properties in tokenization_nllb.py and tokenization_nllb_fast.py (#29834) · 6d3d5b10
      Yasmin Moslem authored
      * Fix typo in tokenization_nllb.py
      
      Change `adder_tokens_decoder` into `added_tokens_decoder` and improve the warning's readability.
      
      * Fix typo in tokenization_nllb_fast.py
      
      Change `adder_tokens_decoder` into `added_tokens_decoder` and improve the warning's readability.
      
      * Remove deprecated attributes in tokenization_nllb.py
      
      Remove deprecated attributes: `lang_code_to_id`, `fairseq_tokens_to_ids`, `id_to_lang_code`, and `fairseq_ids_to_tokens`
      
      * Remove deprecated attribute in tokenization_nllb_fast.py
      
      Remove deprecated attribute `lang_code_to_id`
      
      * Remove deprecated properties in tokenization_nllb.py
      
      Remove deprecated properties - fix format
      
      * Remove deprecated properties in tokenization_nllb_fast.py
      
      Remove deprecated properties - fix format
      
      * Update test_tokenization_nllb.py
      
      * update test_tokenization_nllb.py
      
      * Update tokenization_nllb.py
      
      * Update test_tokenization_seamless_m4t.py
      
      * Update test_tokenization_seamless_m4t.py
      6d3d5b10
    • Aritra Roy Gosthipaty's avatar
      [Port] TensorFlow implementation of Mistral (#29708) · 965e98dc
      Aritra Roy Gosthipaty authored
      
      
      * chore: initial commit
      
      * chore: adding imports and inits
      
      * chore: adding the causal and classification code
      
      * chore: adding names to the layers
      
      * chore: using single self attn layer
      
      * chore: built the model and layers
      
      * chore: start with testing
      
      * chore: docstring change, transpose fix
      
      * fix: rotary embedding
      
      * chore: adding cache implementation
      
      * remove unused torch
      
      * chore: fixing the indexing issue
      
      * make fix-copies
      
      * Use modeling_tf_utils.keras
      
      * make fixup
      
      * chore: fixing tests
      
      * chore: adding past key value logic
      
      * chore: adding multi label classfication test
      
      * fix: switching on the built parameters in the layers
      
      * fixing repo consistency
      
      * ruff formats
      
      * style changes
      
      * fix: tf and pt equivalence
      
      * removing returns from docstrings
      
      * fix docstrings
      
      * fix docstrings
      
      * removing todos
      
      * fix copies
      
      * fix docstring
      
      * fix docstring
      
      * chore: using easier rotate_half
      
      * adding integration tests
      
      * chore: addressing review related to rotary embedding layer
      
      * review changes
      
      * [run-slow] mistral
      
      * skip: test save load after resize token embedding
      
      * style
      
      ---------
      Co-authored-by: default avatarMatt <rocketknight1@gmail.com>
      965e98dc
    • Yih-Dar's avatar
      Update 4 `MptIntegrationTests` expected outputs (#30989) · 2a89673f
      Yih-Dar authored
      
      
      * fix
      
      * fix
      
      * fix
      
      * fix
      
      * fix
      
      * [run-slow] mpt
      
      ---------
      Co-authored-by: default avatarydshieh <ydshieh@users.noreply.github.com>
      2a89673f