1. 08 Mar, 2024 12 commits
  2. 07 Mar, 2024 9 commits
  3. 06 Mar, 2024 13 commits
  4. 05 Mar, 2024 6 commits
    • AI4Harmony's avatar
      🌐 [i18n-KO] Translated generation_strategies.md to Korean (#29086) · 7b01579f
      AI4Harmony authored
      
      
      * Update ko _toctree.yml
      
      * Create ko: generation_strategies.md
      
      * Apply suggestions from code review
      Co-authored-by: default avatarSteven Liu <59462357+stevhliu@users.noreply.github.com>
      
      * Apply suggestions from code review
      Co-authored-by: default avatarJungnerd <46880056+jungnerd@users.noreply.github.com>
      
      * Apply suggestions from code review
      Co-authored-by: default avatarJungnerd <46880056+jungnerd@users.noreply.github.com>
      
      ---------
      Co-authored-by: default avatarSteven Liu <59462357+stevhliu@users.noreply.github.com>
      Co-authored-by: default avatarJungnerd <46880056+jungnerd@users.noreply.github.com>
      7b01579f
    • Michael's avatar
      [i18n-zh] Translate add_new_pipeline.md into Chinese (#29432) · 638c423c
      Michael authored
      * [i18n-zh] Translate add_new_pipeline.md into Chinese
      
      * apply suggestions from Fan-Lin
      638c423c
    • Lysandre Debut's avatar
      Automatic safetensors conversion when lacking these files (#29390) · a69cbf4e
      Lysandre Debut authored
      * Automatic safetensors conversion when lacking these files
      
      * Remove debug
      
      * Thread name
      
      * Typo
      
      * Ensure that raises do not affect the main thread
      a69cbf4e
    • Logan Adams's avatar
      Update pytest `import_path` location (#29154) · 9c5e5609
      Logan Adams authored
      * Update to pull function from proper lib
      
      * Fix ruff formatting error
      
      * Remove accidently added file
      9c5e5609
    • AleksanderWWW's avatar
      Fix bug with passing capture_* args to neptune callback (#29041) · 8f3f8e67
      AleksanderWWW authored
      * Fix bug with passing capture_* args to neptune callback
      
      * ruff happy?
      
      * instantiate (frozen)set only once
      
      * code review
      
      * code review 2
      
      * ruff happy?
      
      * code review
      8f3f8e67
    • Arthur's avatar
      [`Add Mamba`] Adds support for the `Mamba` models (#28094) · fb1c62e9
      Arthur authored
      
      
      * initial-commit
      
      * start cleaning
      
      * small nits
      
      * small nits
      
      * current updates
      
      * add kernels
      
      * small refactoring little step
      
      * add comments
      
      * styling
      
      * nit
      
      * nits
      
      * Style
      
      * Small changes
      
      * Push dummy mambda simple slow
      
      * nit
      
      * Use original names
      
      * Use original names and remove norm
      
      * Updates for inference params
      
      * Style nd updates
      
      * nits
      
      * Match logits
      
      * Add a test
      
      * Add expected generated text
      
      * nits doc, imports and styling
      
      * style
      
      * oups
      
      * dont install kernels, invite users to install the required kernels
      
      * let use use the original packages
      
      * styling
      
      * nits
      
      * fix some copieds
      
      * update doc
      
      * fix-copies
      
      * styling done
      
      * nits
      
      * fix import check
      
      * run but wrong cuda ress
      
      * mamba CUDA works :)
      
      * fix the fast path
      
      * config naming nits
      
      * conversion script is not required at this stage
      
      * finish fixing the fast path: generation make sense now!
      
      * nit
      
      * Let's start working on the CIs
      
      * style
      
      * better style
      
      * more nits
      
      * test nit
      
      * quick fix for now
      
      * nits
      
      * nit
      
      * nit
      
      * nit
      
      * nits
      
      * update test rest
      
      * fixup
      
      * update test
      
      * nit
      
      * some fixes
      
      * nits
      
      * update test values
      
      * fix styling
      
      * nit
      
      * support peft
      
      * integrations tests require torchg
      
      * also add slow markers
      
      * styling
      
      * chose forward wisely
      
      * nits
      
      * update tests
      
      * fix gradient checkpointing
      
      * fixup
      
      * nit
      
      * fix doc
      
      * check copies
      
      * fix the docstring
      
      * fix some more tests
      
      * style
      
      * fix beam search
      
      * add init schene
      
      * update
      
      * nit
      
      * fix
      
      * fixup the doc
      
      * fix the doc
      
      * fixup
      
      * tentative update but slow is no longer good
      
      * nit
      
      * should we always use float32?
      
      * nits
      
      * revert wrong changes
      
      * res in float32
      
      * cleanup
      
      * skip fmt for now
      
      * update generation values
      
      * update test values running original model
      
      * fixup
      
      * update tests + rename inference_params to cache_params + make sure training does not use cache_params
      
      * small nits
      
      * more nits
      
      * fix final CIs
      
      * style
      
      * nit doc
      
      * I hope final doc nits
      
      * nit
      
      * 🫠
      
      * final touch!
      
      * fix torch import
      
      * Apply suggestions from code review
      Co-authored-by: default avatarLysandre Debut <hi@lysand.re>
      
      * Apply suggestions from code review
      
      * fix fix and fix
      
      * fix base model prefix!
      
      * nit
      
      * Update src/transformers/models/mamba/__init__.py
      
      * Update docs/source/en/model_doc/mamba.md
      Co-authored-by: default avatarLysandre Debut <hi@lysand.re>
      
      * nit
      
      ---------
      Co-authored-by: default avatarLysandre Debut <hi@lysand.re>
      fb1c62e9