- 17 May, 2024 1 commit
-
-
amyeroberts authored
* Remove deprecated logic and warnings * Add back some code that seems to be important... * Let's just add all he nllb stuff back; removing it is a bit more involved * Remove kwargs * Remove more kwargs
-
- 10 May, 2024 1 commit
-
-
Aaron Jimenez authored
* update link * run make style
-
- 08 May, 2024 1 commit
-
-
Pavel Iakubovskii authored
* Training script for object detection * Evaluation script for object detection * Training script for object detection with eval loop outside trainer * Trainer DETR finetuning * No trainer DETR finetuning * Eval script * Refine object detection example with trainer * Remove commented code and enable telemetry * No trainer example * Add requirements for object detection examples * Add test for trainer example * Readme draft * Fix uploading to HUB * Readme improvements * Update eval script * Adding tests for object-detection examples * Add object-detection example * Add object-detection resources to docs * Update README with custom dataset instructions * Update year * Replace valid with validation * Update instructions for custom dataset * Remove eval script * Remove use_auth_token * Add copied from and telemetry * Fixup * Update readme * Fix id2label * Fix links in docs * Update examples/pytorch/object-detection/run_object_detection.py Co-authored-by:
NielsRogge <48327001+NielsRogge@users.noreply.github.com> * Update examples/pytorch/object-detection/run_object_detection.py Co-authored-by:
NielsRogge <48327001+NielsRogge@users.noreply.github.com> * Move description to the top * Fix Trainer example * Update no trainer example * Update albumentations version --------- Co-authored-by:
NielsRogge <48327001+NielsRogge@users.noreply.github.com>
-
- 02 May, 2024 1 commit
-
-
Bai Li authored
* Fix memory leak with CTC training script on Chinese languages * Fix lint
-
- 01 May, 2024 2 commits
-
-
amyeroberts authored
Fix --model_type in examples
-
Matt authored
* Handle cases when CLS token is absent * Use BOS token as a fallback
-
- 30 Apr, 2024 1 commit
-
-
Anton Vlasjuk authored
* fix seq2seq data collator to respect the given padding strategy further added tests for the seq2seq data collator in the style of the `data_collator_for_token_classification` (pt, tf, np) * formatting and change bool equals "==" to "is" * add missed return types in tests * update numpy test as it can handle unequal shapes, not like pt or tf
-
- 26 Apr, 2024 1 commit
-
-
Sanchit Gandhi authored
* [examples] update whisper fine-tuning * deprecate forced/suppress tokens * item assignment * update readme * final fix
-
- 18 Apr, 2024 2 commits
-
-
Zach Mueller authored
* Alias * Note alias * Tests and src * Rest * Clean * Change typing? * Fix tests * Deprecation versions
-
Lysandre authored
-
- 17 Apr, 2024 1 commit
-
-
Yih-Dar authored
fix Co-authored-by:ydshieh <ydshieh@users.noreply.github.com>
-
- 15 Apr, 2024 1 commit
-
-
JINO ROHIT authored
-
- 10 Apr, 2024 1 commit
-
-
Pavel Iakubovskii authored
* Remove unused augmentation * Fix pad_if_smaller() and remove unused augmentation * Add indentation * Fix requirements * Update dataset use instructions * Replace transforms with albumentations * Replace identity transform with None * Fixing formatting * Fixed comment place
-
- 09 Apr, 2024 1 commit
-
-
NielsRogge authored
* Undo * Use tokenizer * Undo data collator
-
- 08 Apr, 2024 2 commits
-
-
JINO ROHIT authored
-
Haz Sameen Shahgir authored
updated examples/pytorch/language-modeling scripts and requirements.txt to require datasets>=2.14.0 (#30120) updated requirements.txt and require_version() calls in examples/pytorch/language-modeling to require datasets>=2.14.0
-
- 05 Apr, 2024 1 commit
-
-
NielsRogge authored
* Add image processor to trainer * Replace tokenizer=image_processor everywhere
-
- 02 Apr, 2024 1 commit
-
-
Mario 艩a拧ko authored
-
- 30 Mar, 2024 1 commit
-
-
Jacky Lee authored
* improve: error message for best model metric * update: raise warning instead of error
-
- 21 Mar, 2024 1 commit
-
-
Jacky Lee authored
feat: add support for torch_dtype Co-authored-by:Jacky Lee <jackylee328@gmail.com>
-
- 20 Mar, 2024 1 commit
-
-
Arthur Zucker authored
-
- 15 Mar, 2024 1 commit
-
-
Quentin Lhoest authored
* Update run_glue.py * Update run_glue.py * Update run_glue_no_trainer.py
-
- 12 Mar, 2024 2 commits
-
-
Joao Gante authored
check max_position_embeddings
-
Hilco van der Wilk authored
* Update legacy Repository usage in `examples/pytorch/text-classification/run_glue_no_trainer.py` Marked for deprecation here https://huggingface.co/docs/huggingface_hub/guides/upload#legacy-upload-files-with-git-lfs * Fix import order * Replace all example usage of deprecated Repository * Fix remaining repo call and rename args variable * Revert removing creation of gitignore files and don't change research examples
-
- 11 Mar, 2024 2 commits
-
-
Yitong Huang authored
* add USE_TORCH_XLA env * rename torch_tpu to torch_xla * better is_torch_xla_available; fix some fsdp and performance issues * fix format * fix bug when pjrt_device is cpu * fix bug * fix the deprecation handling --------- Co-authored-by:
anw90 <ang868@gmail.com> Co-authored-by:
wangang.wa <wangang.wa@alibaba-inc.com>
-
Tanay Mehta authored
* add: initial script to train clm fim * fix: if training model from scratch, new tokens will be added and embeddings resized * fix: fixed attention_mask errors when generating FIM data * fix: file formatted using black * add: run_fim_no_trainer.py and fixed some comments in run_fim.py * add: added fim examples to the README.md and ran code fixup * fix: little bug in both fim training scripts * fix: remove comment from notebook and added a note on fim related params * fix: minor typo in README * add: suggested minor changes to README and run_fim.py * add: gradient_accumulation_steps and gradient_checkpointing args * add: improved model embedding resizing * add: pad_to_multiple_of and attn_implementation params * add: requested minor changes * add: deepspeed zero compatibility * add: resize embeddings layer with zero3 support for fim model initialization
-
- 21 Feb, 2024 1 commit
-
-
Arthur Zucker authored
-
- 19 Feb, 2024 2 commits
-
-
Arthur authored
* change version * nuke * this doesn't make sense * update some requirements.py * revert + no main * nits * change cache number * more pin * revert --------- Co-authored-by:ydshieh <ydshieh@users.noreply.github.com>
-
Jay Zhou authored
-
- 16 Feb, 2024 1 commit
-
-
Lysandre Debut authored
* Script & Manual edition * Update
-
- 12 Feb, 2024 2 commits
-
-
Klaus Hipp authored
Add language identifiers to code blocks
-
Alexey Fadeev authored
Updated datasets requirements. Need a package version >= 2.14.0
-
- 07 Feb, 2024 1 commit
-
-
Yih-Dar authored
* fix * fix * fix --------- Co-authored-by:ydshieh <ydshieh@users.noreply.github.com>
-
- 02 Feb, 2024 1 commit
-
-
Klaus Hipp authored
* Fix typos and grammar mistakes in docs and examples * Fix typos in docstrings and comments * Fix spelling of `tokenizer` in model tests * Remove erroneous spaces in decorators * Remove extra spaces in Markdown link texts
-
- 01 Feb, 2024 1 commit
-
-
zspo authored
Co-authored-by:p_spozzhang <p_spozzhang@tencent.com>
-
- 30 Jan, 2024 1 commit
-
-
Matt authored
* Pin torch to <2.2.0 * Pin torchvision and torchaudio as well * Playing around with versions to see if this helps * twiddle something to restart the CI * twiddle it back * Try changing the natten version * make fixup * Revert "Try changing the natten version" This reverts commit de0d6592c35dc39ae8b5a616c27285db28262d06. * make fixup * fix fix fix * fix fix fix --------- Co-authored-by:ydshieh <ydshieh@users.noreply.github.com>
-
- 29 Jan, 2024 1 commit
-
-
Klaus Hipp authored
-
- 26 Jan, 2024 1 commit
-
-
Steven Liu authored
* change datasets * fix
-
- 22 Jan, 2024 2 commits
-
-
bofeng huang authored
* Fix lr_scheduler * Fix lr scheduler
-
jheitmann authored
-