- 14 Feb, 2022 3 commits
-
-
Toni Kukurin authored
-
Zhen Wang authored
-
Yih-Dar authored
* Fix wrong checkpoint name: vit * Fix missing import * Fix more missing import * make style * Apply suggestions from code review Co-authored-by:
NielsRogge <48327001+NielsRogge@users.noreply.github.com> Co-authored-by:
ydshieh <ydshieh@users.noreply.github.com> Co-authored-by:
NielsRogge <48327001+NielsRogge@users.noreply.github.com>
-
- 11 Feb, 2022 12 commits
-
-
Sylvain Gugger authored
* Add push to hub to feature extractor * Quality * Clean up
-
Daniel Erenrich authored
"to make ensure" is redundant.
-
Sylvain Gugger authored
* Rework AutoFeatureExtractor.from_pretrained internal * Custom feature extractor * Add more tests * Add support for custom feature extractor code * Clean up
-
Stas Bekman authored
* [research_projects] deal with security alerts * add a note of the original PL ver and warning
-
Stas Bekman authored
* [deepspeed docs] round_robin_gradients * training and/or eval/predict loss is * Update docs/source/main_classes/deepspeed.mdx Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
-
Sylvain Gugger authored
-
Steven Liu authored
-
Joao Gante authored
* Fix TF MT5 vocab resize * more assertive testing
-
Mishig Davaadorj authored
-
Joao Gante authored
* Add informative warning
-
lewtun authored
-
Sylvain Gugger authored
-
- 10 Feb, 2022 9 commits
-
-
Patrick von Platen authored
-
Ngo Quang Huy authored
-
lewtun authored
* Add local and TensorFlow ONNX export examples to docs * Use PyTorch - TensorFlow split
-
NielsRogge authored
Co-authored-by:Niels Rogge <nielsrogge@Nielss-MBP.localdomain>
-
Yih-Dar authored
* Compute loss independent from decoder (as 14139) * fix expected seq_len + style * Apply the same change to TFVisionEncoderDecoderModel * fix style * Add case with labels in equivalence test * uncomment * Add case with labels in equivalence test * add decoder_token_labels * use hf_compute_loss * Apply suggestions from code review Co-authored-by:
NielsRogge <48327001+NielsRogge@users.noreply.github.com> * Add copied from Co-authored-by:
ydshieh <ydshieh@users.noreply.github.com> Co-authored-by:
NielsRogge <48327001+NielsRogge@users.noreply.github.com>
-
Patrick von Platen authored
-
Alberto B茅gu茅 authored
* Add TensorFlow support for ONNX export * Change documentation to mention conversion with Tensorflow * Refactor export into export_pytorch and export_tensorflow * Check model's type instead of framework installation to choose between TF and Pytorch Co-authored-by:
Lysandre Debut <lysandre@huggingface.co> Co-authored-by:
Alberto B茅gu茅 <alberto.begue@della.ai> Co-authored-by:
lewtun <lewis.c.tunstall@gmail.com>
-
Lysandre authored
-
Sylvain Gugger authored
-
- 09 Feb, 2022 16 commits
-
-
Sylvain Gugger authored
* Expand tutorial for custom models * Style * Apply suggestions from code review Co-authored-by:
Lysandre Debut <lysandre.debut@reseau.eseo.fr> Co-authored-by:
Lysandre Debut <lysandre.debut@reseau.eseo.fr>
-
NielsRogge authored
Co-authored-by:Niels Rogge <nielsrogge@Nielss-MBP.localdomain>
-
Stas Bekman authored
* [trainer docs] document how to select specific gpus * expand * add urls * add accelerate launcher
-
Yih-Dar authored
Co-authored-by:ydshieh <ydshieh@users.noreply.github.com>
-
Sylvain Gugger authored
* Expose hub test problem * Fix tests
-
Sylvain Gugger authored
-
Sylvain Gugger authored
-
Chan Woo Kim authored
* added classes to get started with constrained beam search * in progress, think i can directly force tokens now but not yet with the round robin * think now i have total control, now need to code the bank selection * technically works as desired, need to optimize and fix design choices leading to undersirable outputs * complete PR #1 without disjunctive decoding * removed incorrect tests * Delete k.txt * Delete test.py * Delete test.sh * revert changes to test scripts * genutils * full implementation with testing, no disjunctive yet * shifted docs * passing all tests realistically ran locally * removing accidentally included print statements * fixed source of error in initial PR test * fixing the get_device() vs device trap * fixed documentation docstrings about constrained_beam_search * fixed tests having failing for Speech2TextModel's floating point inputs * fix cuda long tensor * added examples and testing for them and founx & fixed a bug in beam_search and constrained_beam_search * deleted accidentally added test halting code with assert False * code reformat * Update tests/test_generation_utils.py Co-authored-by:
Patrick von Platen <patrick.v.platen@gmail.com> * Update tests/test_generation_utils.py Co-authored-by:
Patrick von Platen <patrick.v.platen@gmail.com> * Update tests/test_generation_utils.py Co-authored-by:
Patrick von Platen <patrick.v.platen@gmail.com> * Update tests/test_generation_utils.py Co-authored-by:
Patrick von Platen <patrick.v.platen@gmail.com> * Update tests/test_generation_utils.py * fixing based on comments on PR * took out the testing code that should but work fails without the beam search moditification ; style changes * fixing comments issues * docstrings for ConstraintListState * typo in PhrsalConstraint docstring * docstrings improvements Co-authored-by:
Patrick von Platen <patrick.v.platen@gmail.com>
-
Clara Meister authored
* typical decoding * changing arg name * add test config params * forgotten arg rename * fix edge case where scores are same * test for typical logits warper * code quality fixes
-
Suraj Patil authored
-
Lysandre Debut authored
-
Sanchit Gandhi authored
* Add Wav2Vec2 Adapter Weights to Flax * Suggested changes
-
Sylvain Gugger authored
* Make sure custom configs work with Transformers * Apply code review suggestions
-
Lysandre Debut authored
* Upgrade black to version ~=22.0 * Check copies * Fix code
-
Leandro von Werra authored
* add model scaling section * Apply suggestions from code review Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * integrate reviewer feedback * initialize GPU properly * add note about BnB optimizer * move doc from `scaling.mdx` to `performance.mdx` * integrate reviewer feedback * revert section levels Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
-
Sylvain Gugger authored
* PoC for a ProcessorMixin class * Documentation * Apply suggestions from code review Co-authored-by:
NielsRogge <48327001+NielsRogge@users.noreply.github.com> Co-authored-by:
Suraj Patil <surajp815@gmail.com> Co-authored-by:
Patrick von Platen <patrick.v.platen@gmail.com> * Roll out to other processors * Add base feature extractor class in init * Use args and kwargs Co-authored-by:
NielsRogge <48327001+NielsRogge@users.noreply.github.com> Co-authored-by:
Suraj Patil <surajp815@gmail.com> Co-authored-by:
Patrick von Platen <patrick.v.platen@gmail.com>
-