- 28 May, 2024 8 commits
-
-
Pavel Iakubovskii authored
* Add test for multiple images * [run slow] owlv2 * Fix box rescaling * [run slow] owlv2
-
Pavel Iakubovskii authored
Remove float64
-
oOraph authored
* Unit test to verify fix Signed-off-by:
Raphael Glon <oOraph@users.noreply.github.com> * fix from_pretrained in offline mode when model is preloaded in cache Signed-off-by:
Raphael Glon <oOraph@users.noreply.github.com> * minor: fmt Signed-off-by:
Raphael Glon <oOraph@users.noreply.github.com> --------- Signed-off-by:
Raphael Glon <oOraph@users.noreply.github.com> Co-authored-by:
Raphael Glon <oOraph@users.noreply.github.com>
-
Hengwen Tong authored
* Remove backend checks in training_args.py * Expilicit initialize the device --------- Co-authored-by:tonghengwen <tonghengwen@cambricon.com>
-
AP authored
Update quicktour.md to fix broken link Missing '/' in attention mask link in the transformers quicktour
-
Clint Adams authored
-
Yih-Dar authored
fix Co-authored-by:ydshieh <ydshieh@users.noreply.github.com>
-
Yih-Dar authored
use main Co-authored-by:ydshieh <ydshieh@users.noreply.github.com>
-
- 27 May, 2024 7 commits
-
-
Yih-Dar authored
skip Co-authored-by:ydshieh <ydshieh@users.noreply.github.com>
-
Yoach Lacombe authored
* fix pad_to_max_length Whisper * add tests * make style
-
Marc Sun authored
fix quanto tests
-
amyeroberts authored
-
Eitan Turok authored
* Fix link in dbrx.md * remove "though this may not be up to date" --------- Co-authored-by:Lysandre Debut <hi@lysand.re>
-
Yih-Dar authored
[push-ci-image] Co-authored-by:ydshieh <ydshieh@users.noreply.github.com>
-
Aymeric Roucher authored
-
- 24 May, 2024 14 commits
-
-
Pablo Montalvo authored
* fix devices and dtype assignments * [run-slow]paligemma
-
Ita Zaporozhets authored
* seems like `split_special_tokens` is used here * split special token * add new line at end of file * moving split special token test to common tests * added assertions * test * fixup * add co-author * passing rest of args to gptsan_japanese, fixing tests * removing direct comparison of fast and slow models * adding test support for UDOP and LayoutXLM * ruff fix * readd check if slow tokenizer * modify test to handle bos tokens * removing commented function * trigger build * applying review feedback - updated docstrings, var names, and simplified tests * ruff fixes * Update tests/test_tokenization_common.py Co-authored-by:
Arthur <48595927+ArthurZucker@users.noreply.github.com> * applying feedback, comments * shutil temp directory fix --------- Co-authored-by:
Arthur Zucker <arthur.zucker@gmail.com> Co-authored-by:
Ita Zaporozhets <itazaporozhets@Itas-MBP.localdomain> Co-authored-by:
itazap <itazap@users.noreply.github.com> Co-authored-by:
Arthur <48595927+ArthurZucker@users.noreply.github.com> Co-authored-by:
Ita Zaporozhets <itazaporozhets@Itas-MacBook-Pro.local>
-
BHUVAN M authored
* added interpolation for vitmae model in pytorch as well as tf. * Update modeling_vit_mae.py irreugalr import fixed * small changes and proper formatting * changes suggested in review. * modified decoder interpolate_func * arguments and docstring fix * Apply suggestions from code review doc fixes Co-authored-by:
amyeroberts <22614925+amyeroberts@users.noreply.github.com> --------- Co-authored-by:
amyeroberts <22614925+amyeroberts@users.noreply.github.com>
-
Yih-Dar authored
Co-authored-by:ydshieh <ydshieh@users.noreply.github.com>
-
Younes Belkada authored
* Fix remaining quant tests * Update test_quanto.py
-
Lucain authored
* Fix resume_download future warning * better like this * Add regression test
-
Yih-Dar authored
* allow multi-gpu * allow multi-gpu --------- Co-authored-by:ydshieh <ydshieh@users.noreply.github.com>
-
Marc Sun authored
fix awq mistral test
-
Fanli Lin authored
* enable on xpu * fix style * add comment and mps
-
Yixiang Gao authored
* add test that currently fails * test passed * all perceiver passed * fixup, style, quality, repo-consistency, all passed * Apply suggestions from code review: default to False + compute sqrt once only Co-authored-by:
amyeroberts <22614925+amyeroberts@users.noreply.github.com> * fix a minor bracket * replace dim with self._num_channels * add arguments to the rest preprocessors --------- Co-authored-by:
amyeroberts <22614925+amyeroberts@users.noreply.github.com>
-
Yih-Dar authored
* fix * [push-ci-image] * run with latest --------- Co-authored-by:ydshieh <ydshieh@users.noreply.github.com>
-
Lucain authored
-
Kevin Koehncke authored
Fix training speed regression introduced by "optimize VRAM for calculating pos_bias in LayoutLM v2, v3 (#26139)" (#30988) * Revert "optimize VRAM for calculating pos_bias in LayoutLM v2, v3 (#26139)" This reverts commit a7e0ed82 . * Instead of reverting commit, wrap indexing in torch.no_grad context * Apply wrapping in LayoutLMv2 * Add comments explaining reason for no_grad * Fix code format --------- Co-authored-by:
Kevin Koehncke <kevin.koehncke@uipath.com>
-
Ita Zaporozhets authored
* add prefix space ignored in llama #29625 * adding test with add_prefix_space=False * ruff --------- Co-authored-by:Ita Zaporozhets <itazaporozhets@Itas-MBP.localdomain>
-
- 23 May, 2024 11 commits
-
-
Matthias Gerstgrasser authored
* fix wandb always uploading initial model * Update comment. * Optionally log initial model * Revert "Optionally log initial model" This reverts commit 9602cc1fad3feaf218f82a7339a194d3d2fbb946.
-
Yasmin Moslem authored
* Fix typo in tokenization_nllb.py Change `adder_tokens_decoder` into `added_tokens_decoder` and improve the warning's readability. * Fix typo in tokenization_nllb_fast.py Change `adder_tokens_decoder` into `added_tokens_decoder` and improve the warning's readability. * Remove deprecated attributes in tokenization_nllb.py Remove deprecated attributes: `lang_code_to_id`, `fairseq_tokens_to_ids`, `id_to_lang_code`, and `fairseq_ids_to_tokens` * Remove deprecated attribute in tokenization_nllb_fast.py Remove deprecated attribute `lang_code_to_id` * Remove deprecated properties in tokenization_nllb.py Remove deprecated properties - fix format * Remove deprecated properties in tokenization_nllb_fast.py Remove deprecated properties - fix format * Update test_tokenization_nllb.py * update test_tokenization_nllb.py * Update tokenization_nllb.py * Update test_tokenization_seamless_m4t.py * Update test_tokenization_seamless_m4t.py
-
Aritra Roy Gosthipaty authored
* chore: initial commit * chore: adding imports and inits * chore: adding the causal and classification code * chore: adding names to the layers * chore: using single self attn layer * chore: built the model and layers * chore: start with testing * chore: docstring change, transpose fix * fix: rotary embedding * chore: adding cache implementation * remove unused torch * chore: fixing the indexing issue * make fix-copies * Use modeling_tf_utils.keras * make fixup * chore: fixing tests * chore: adding past key value logic * chore: adding multi label classfication test * fix: switching on the built parameters in the layers * fixing repo consistency * ruff formats * style changes * fix: tf and pt equivalence * removing returns from docstrings * fix docstrings * fix docstrings * removing todos * fix copies * fix docstring * fix docstring * chore: using easier rotate_half * adding integration tests * chore: addressing review related to rotary embedding layer * review changes * [run-slow] mistral * skip: test save load after resize token embedding * style --------- Co-authored-by:Matt <rocketknight1@gmail.com>
-
Yih-Dar authored
* fix * fix * fix * fix * fix * [run-slow] mpt --------- Co-authored-by:ydshieh <ydshieh@users.noreply.github.com>
-
Yasmin Moslem authored
* Add a check that warmup_setps is either 0 or >= 1 Update training_args.py to add a check that warmup_setps is either 0 or >= 1. Otherwise, raise an error. * Update src/transformers/training_args.py Co-authored-by:
amyeroberts <22614925+amyeroberts@users.noreply.github.com> --------- Co-authored-by:
amyeroberts <22614925+amyeroberts@users.noreply.github.com>
-
Fanli Lin authored
* add xpu check * add marker * add documentation * update doc * fix ci * remove from global init * fix
-
Marc Sun authored
* Fix accelerate tests * fix clip * skip dbrx tests * fix GPTSan * fix M2M100Model * same fix as jamba * fix mt5 * Fix T5Model * Fix umt5 model * fix switch_transformers * fix whisper * fix gptsan again * fix siglip recent test * skip siglip tests * wrong place fixed
-
Younes Belkada authored
* Change in quantization docs * Update overview.md * Update docs/source/en/quantization/overview.md Co-authored-by:
Marc Sun <57196510+SunMarc@users.noreply.github.com> --------- Co-authored-by:
Marc Sun <57196510+SunMarc@users.noreply.github.com>
-
Benjamin Warner authored
add torch.compile dynamic support
-
Poedator authored
-
Younes Belkada authored
* refactor quant docs * delete file * rename to overview * fix * fix table * fix * add content * fix library versions * fix table * fix table * fix table * fix table * Apply suggestions from code review Co-authored-by:
Steven Liu <59462357+stevhliu@users.noreply.github.com> * replace to quantization_config * fix aqlm snippet * add DLAI courses * fix * fix table * fix bulet points --------- Co-authored-by:
Steven Liu <59462357+stevhliu@users.noreply.github.com>
-