- 07 Sep, 2022 14 commits
-
-
Steven Liu authored
* skip some code examples for doctests * make style * fix code snippet formatting * separate code snippet into two blocks
-
lewtun authored
-
Colin Dean authored
* Starts on a list of external deps required for dev I've found that I need to install MeCab manually on my AS Mac. * Generalizes OS nascent dependency list Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
-
Yanming Wang authored
* Fix XLA fp16 and bf16 error checking * Update src/transformers/training_args.py Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
-
Yih-Dar authored
Co-authored-by:ydshieh <ydshieh@users.noreply.github.com>
-
Ankur Goyal authored
* [WIP] Skeleton of VisualQuestionAnweringPipeline extended to support LayoutLM-like models * Fixup * Use the full encoding * Basic refactoring to DocumentQuestionAnsweringPipeline * Cleanup * Improve args, docs, and implement preprocessing * Integrate OCR * Refactor question_answering pipeline * Use refactored QA code in the document qa pipeline * Fix tests * Some small cleanups * Use a string type annotation for Image.Image * Update encoding with image features * Wire through the basic docs * Handle invalid response * Handle empty word_boxes properly * Docstring fix * Integrate Donut model * Fixup * Incorporate comments * Address comments * Initial incorporation of tests * Address Comments * Change assert to ValueError * Comments * Wrap `score` in float to make it JSON serializable * Incorporate AutoModeLForDocumentQuestionAnswering changes * Fixup * Rename postprocess function * Fix auto import * Applying comments * Improve docs * Remove extra assets and add copyright * Address comments Co-authored-by:Ankur Goyal <ankur@impira.com>
-
Olatunji Ruwase authored
* [DeepSpeed] Fix performance degradation in sharded models * style * polish Co-authored-by:Stas Bekman <stas@stason.org>
-
Yih-Dar authored
* remvoe _create_and_check_torch_fx_tracing defined in specific model test files Co-authored-by:ydshieh <ydshieh@users.noreply.github.com>
-
Joao Gante authored
-
Matt authored
* Update TF fine-tuning docs * Fix formatting * Add some section headers so the right sidebar works better * Squiggly it * Update docs/source/en/training.mdx Co-authored-by:
Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/training.mdx Co-authored-by:
Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/training.mdx Co-authored-by:
Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/training.mdx Co-authored-by:
Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/training.mdx Co-authored-by:
Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/training.mdx Co-authored-by:
Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/training.mdx Co-authored-by:
Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/training.mdx Co-authored-by:
Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/training.mdx Co-authored-by:
Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/training.mdx Co-authored-by:
Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/training.mdx Co-authored-by:
Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/training.mdx Co-authored-by:
Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/training.mdx Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Explain things in the text, not the comments * Make the two dataset creation methods into a list * Move the advice about collation out of a <Tip> * Edits for clarity * Edits for clarity * Edits for clarity * Replace `to_tf_dataset` with `prepare_tf_dataset` in the fine-tuning pages * Restructure the page a little bit * Restructure the page a little bit * Restructure the page a little bit Co-authored-by:
Steven Liu <59462357+stevhliu@users.noreply.github.com> Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
-
Wang, Yi authored
Signed-off-by:
Wang, Yi A <yi.a.wang@intel.com> Signed-off-by:
Wang, Yi A <yi.a.wang@intel.com>
-
Nicholas Broad authored
* add accelerator.end_training() Some trackers need this to end their runs. * fixup and quality * add space * add space again ?!?
-
Yih-Dar authored
* add check for scheduled CI * Add check to other CIs Co-authored-by:ydshieh <ydshieh@users.noreply.github.com>
-
NielsRogge authored
* Simplify code example * Add seed
-
- 06 Sep, 2022 8 commits
-
-
Ekagra Ranjan authored
Fix incorrect size of input for 1st strided window length in `Perplexity of fixed-length models` (#18906) * update the PPL for stride 512 * fix 1st strided window size * linting * fix typo * styling
-
Yih-Dar authored
Co-authored-by:ydshieh <ydshieh@users.noreply.github.com>
-
Sylvain Gugger authored
* Further reduce the number of alls to head for cached models/tokenizers/pipelines * Fix tests * Address review comments
-
Alara Dirik authored
-
Yih-Dar authored
Co-authored-by:ydshieh <ydshieh@users.noreply.github.com>
-
Ekagra Ranjan authored
* use tokenizer to output tensor * add preprocessing for decoder_input_ids for bare T5Model * add preprocessing to tf and flax * linting * linting * Update src/transformers/models/t5/modeling_flax_t5.py Co-authored-by:
Patrick von Platen <patrick.v.platen@gmail.com> * Update src/transformers/models/t5/modeling_tf_t5.py Co-authored-by:
Patrick von Platen <patrick.v.platen@gmail.com> * Update src/transformers/models/t5/modeling_t5.py Co-authored-by:
Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by:
Patrick von Platen <patrick.v.platen@gmail.com>
-
arun99481 authored
Co-authored-by:Arun Rajaram <arunrajaram@Aruns-MacBook-Pro.local>
-
Had authored
* add position bias head masking if heads pruned * fix pruning function in t5 encoder * make style * make fix-copies * Revert added folder Co-authored-by:Patrick von Platen <patrick.v.platen@gmail.com>
-
- 05 Sep, 2022 7 commits
-
-
Joao Gante authored
-
zkep authored
* update the translation
-
Sofia Oliveira authored
* Add type hints to XLM-Roberta-XL models * Format
-
Surya Prakash Sahu authored
-
Patrick von Platen authored
* add first generation tutorial * [Pegasus X] correct naming * [Generation] Remove
-
Lysandre Debut authored
-
Joao Gante authored
* bias as a layer * alias the bias (hah, it rhymes) * add comment with info
-
- 02 Sep, 2022 11 commits
-
-
Steven Liu authored
*
📝 update quicktour *📝 add trainer section *🖍 markdown table, apply feedbacks *✨ make style * add tf training section * make style -
Steven Liu authored
*
📝 first draft * oops add to toctree * make style *📝 add inference section *🖍 make style *📝 add images *🖍 apply feedbacks * remove num_labels and pytorch block * apply feedbacks, add colab notebook Co-authored-by:Steven <stevhliu@gmail.com>
-
Steven Liu authored
*
📝 update docs landing page *🖍 apply feedbacks * apply feedbacks * apply feedbacks, use <br> for list -
Jason Phang authored
* PegasusX Initial commit * rename * pegasus X implementation * pegx update * pegx fix * pegasus-x fixes * pegx updates * cleanup * cleanup * cleanup * tests * stylefixes * Documentation update * Model hub fix * cleanup * update * update * testfix * Check fix * tweaks for merging * style * style * updates for pr * style * change pegasus-x repo
-
Yih-Dar authored
Co-authored-by:ydshieh <ydshieh@users.noreply.github.com>
-
Yih-Dar authored
Co-authored-by:ydshieh <ydshieh@users.noreply.github.com>
-
Joao Gante authored
-
Stas Bekman authored
-
Sylvain Gugger authored
* Fix number of examples for iterable datasets in multiprocessing * Add stronger check
-
Yih-Dar authored
Co-authored-by:ydshieh <ydshieh@users.noreply.github.com>
-
Sylvain Gugger authored
* Clean up utils.hub using the latest from hf_hub * Adapt test * Address review comment * Fix test
-