- 19 Jan, 2023 19 commits
-
-
Sylvain Gugger authored
* Add disclaimer for necessary fake models * Address review comments * Use for GPT-NeoX as well
-
Cl茅mentine Fourrier authored
* [FT] First commit for graphormer architecture. The model has no tokenizer, as it uses a collator and preprocessing function for its input management. Architecture to be tested against original one. The arch might need to be changed to fit the checkpoint, but a revert to the original arch will make the code less nice to read. TODO: doc * [FIX] removed test model * [FIX] import error * [FIX] black and flake * [DOC] added paper refs * [FIX] [DOC] * [FIX] black * [DOC] Updated READMEs * [FIX] Order of imports + rm Tokenizer calls * [FIX] Moved assert in class to prevent doc build failure * [FIX] make fix-copies * [Doc] update from code review Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * [FIX] Removed Graphormer from Sequence classification model list * [DOC] Added HF copyright to Cython file * [DOC] Fixed comments * [FIX] typos in class doc + removed config classes. Todo: update doc from paper definitions * [FIX] Removed dependency to fairseq, and replaced all asserts with Exception management * [FIX] Homogeneized initialization of weights to pretrained constructor * [FIX] [CP] Updated multi_hop parameter to get same results as in original implementation * [DOC] Relevant parameter description in the configuration file * [DOC] Updated doc and comments in main graphormer file * [FIX] make style and quality checks * [DOC] Fix doc format * [FIX] [WIP] Updated part of the tests, though still a wip * [FIX] [WIP] * [FIX] repo consistency * [FIX] Changed input names for more understandability * [FIX] [BUG] updated num_classes params for propagation in the model * simplified collator * [FIX] Updated tests to follow new naming pattern * [TESTS] Updated test suite along with model * |FIX] rm tokenizer import * [DOC] add link to graphormerdoc * Changed section in doc from text model to graph model * Apply suggestions from code review Spacing, inits Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * [DOC] Explain algos_graphormer functions * Cython soft import protection * Rm call to Callable in configuration graphormer * [FIX] replaced asserts with Exceptions * Add org to graphormer checkpoints * Prefixed classes with Graphormer * Management of init functions * format * fixes * fix length file * update indent * relaunching ci * Errors for missing cython imports * fix style * fix style doc Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
-
ydshieh authored
-
Kambe Hiroyuki authored
* Add Japanese translation index.mdx * Fix the year of the license * Change the models list to Japanese
-
Joao Gante authored
-
Younes Belkada authored
fix cvt init
-
Karim Foda authored
* Add hallucination penalty * Make quality changes * Inverse penalty * Fix imports & quality * Fix name spelling issue * set encoder_repetition_penalty and fix quality * Fix failing test * Add to config_common_kwargs * Fix modelling_rag error * Update src/transformers/generation_logits_process.py Co-authored-by:
Joao Gante <joaofranciscocardosogante@gmail.com> * Remove breakpoint * Make style fixes * Update encoder_repetition_penalty default value * Merge latest main changes * Make fixup changes * Add EncoderRepetitionPenaltyLogitsProcessor to generation/__init__.py * Fix repo-inconsistency * Remove venv * Remove tensorflow-macos & add tests * Add documentation * Fix quality issues * move encoder_repetition_penalty to config * Update src/transformers/configuration_utils.py Co-authored-by:
Joao Gante <joaofranciscocardosogante@gmail.com> * Update src/transformers/generation/configuration_utils.py Co-authored-by:
Joao Gante <joaofranciscocardosogante@gmail.com> * Remove encoder_repetition_penalty from tests * Fix type error * Fix format error Co-authored-by:
Joao Gante <joaofranciscocardosogante@gmail.com>
-
Arthur authored
* add draft logit processor * add template functions * update timesapmt processor parameters * draft script * simplify code * cleanup * fixup and clean * update pipeline * style * clean up previous idea * add tokenization utils * update tokenizer and asr output * fit whisper type * style and update test * clean test * style test * update tests * update error test * udpate code (not based on review yet) * update tokenization * update asr pipeline * update code * cleanup and update test * fmt * remove text verificatino * cleanup * cleanup * add model test * update tests * update code add docstring * update code and add docstring * fix pipeline tests * add draft logit processor add template functions update timesapmt processor parameters draft script simplify code cleanup fixup and clean update pipeline style clean up previous idea add tokenization utils update tokenizer and asr output fit whisper type style and update test clean test style test update tests update error test udpate code (not based on review yet) update tokenization update asr pipeline update code cleanup and update test fmt remove text verificatino cleanup cleanup add model test update tests update code add docstring update code and add docstring fix pipeline tests * Small update. * Fixup. * Tmp. * More support. * Making `forced_decoder_ids` non mandatory for users to set. * update and fix first bug * properly process sequence right after merge if last * tofo * allow list inputs + compute begin index better * start adding tests * add the 3 edge cases * style * format sequences * fixup * update * update * style * test passes, edge cases should be good * update last value * remove Trie * update tests and expec ted values * handle bigger chunk_length * clean tests a bit * refactor chunk iter and clean pipeline * update tests * style * refactor chunk iter and clean pipeline * upade * resolve comments * Apply suggestions from code review Co-authored-by:
Nicolas Patry <patry.nicolas@protonmail.com> * take stride right into account * update test expected values * Update code based on review Co-authored-by:
sgugger <sylvain.gugger@gmail.com> * major refactor * add correct strides for tests * Update src/transformers/pipelines/automatic_speech_recognition.py * fix whisper timestamp test Co-authored-by:
Nicolas Patry <patry.nicolas@protonmail.com> Co-authored-by:
sgugger <sylvain.gugger@gmail.com>
-
Matthijs Hollemans authored
-
amyeroberts authored
* Update examples to use image processors * Small fixes * Resolve conflicts
-
amyeroberts authored
-
Maria Khalusova authored
* Extended the CV preprocessing section with more details and refactored the example * added padding to the CV section, though it is a special case * Added a tip about post processing methods * make style * link update * Apply suggestions from review Co-authored-by:
Steven Liu <59462357+stevhliu@users.noreply.github.com> * review feedback Co-authored-by:
Steven Liu <59462357+stevhliu@users.noreply.github.com>
-
Yih-Dar authored
fix device Co-authored-by:ydshieh <ydshieh@users.noreply.github.com>
-
Sylvain Gugger authored
-
Matthijs Hollemans authored
-
Yih-Dar authored
* update year Co-authored-by:ydshieh <ydshieh@users.noreply.github.com>
-
Yih-Dar authored
fix Co-authored-by:ydshieh <ydshieh@users.noreply.github.com>
-
Jitesh Jain authored
* Add Oneformer Model * Add OneFormer Tests * Add UNIVERSAL_SEGMENTATION_MAPPING * Fix config *
馃悰 Fix error encountered while writing tests *馃敤 Fix instance segmentation post processing * Format Files and Add Documentation * Add Documentation mdx file * Run make fixup * Run make fix-copies * Remove unnecessary code * Format modeling_oneformer.py * Add OneFormer to ImageSegmentationPipeline * Format files * Add Demo link to Readme * Fix fomatting errors * Fix test failures * Update Table in index.mdx * Fix version * Fix style * Remove OneFormer from TF * Fix Imports * Fix dummy objects * Fix tests * Add newline * Remove OneFormerFeatureExtractor * Remove CUDA Kernels * Use AutoBackbone for Swin * Fix description * Use Image Processor * Fix copies * Fix formatting * Fix import order * Fix flake8 errors * Fix doc errors * Add Hindi Readme entry * Update supported backbones * Update supported backbones * Undo Changes * Fix type of config * Fix isort * Fix auto.mdx * Fix swin config * Replace DinatBackbone with AutoBackbone * Use SwinBackbone * Use SwinBackbone * Fix conversion script * Fix arguments * Add argument description * Fix style * Add OneFormerProcessor * Fix OneFormerProcessor Tests * Fix mapping * Fix imports * Fix inits * Fix style * Fix comment * Fix docstring * Move OneFormer to MultiModal * Fix Copies * Remove size divisor * Fix check_repo.py * Fix copies * Add Processor for Testing Pipeline * Fix padding for tokens * Fix variables * Fix formatting with correct black version * Add Image Processor Test * Apply suggestions * Revert common modeling * Add check for task * Fix conversion script * Fix initialization order * Fix tests * Undo Pipeline Changes * Fix layers in MLP * Fix copies * Update image paths * Fix copies * Apply suggestions -
Stas Bekman authored
* [issues template] update deepspeed owners add the right contact for deepspeed@accelerate * pr-template
-
- 18 Jan, 2023 16 commits
-
-
Matt authored
* Rewrite a couple of lines in the TF XLA doc to explain that jit_compile can be used in model.compile() too * Remove extra )
-
jeffhataws authored
* Add XLA torchrun support * Clarify that currently DDP doesn't work with torch.distributed XLA backend yet * Enable DDP with torchrun and XLA (now available in PT-XLA 1.13) * Add check for AWS Neuron availability and AWS Neuron specific compiler flag * Change the new test's name to TestTrainerDistributedNeuronCore * Remove "assert" and replace raised exception * Remove compiler flag as it is optional. If needed, will be another PR. * Use TORCHELASTIC_RUN_ID to determine whether torchrun is used
-
dependabot[bot] authored
Bump future in /examples/research_projects/visual_bert Bumps [future](https://github.com/PythonCharmers/python-future) from 0.18.2 to 0.18.3. - [Release notes](https://github.com/PythonCharmers/python-future/releases) - [Changelog](https://github.com/PythonCharmers/python-future/blob/master/docs/changelog.rst) - [Commits](https://github.com/PythonCharmers/python-future/compare/v0.18.2...v0.18.3 ) --- updated-dependencies: - dependency-name: future dependency-type: direct:production ... Signed-off-by:
dependabot[bot] <support@github.com> Signed-off-by:
dependabot[bot] <support@github.com> Co-authored-by:
dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
-
dependabot[bot] authored
Bumps [future](https://github.com/PythonCharmers/python-future) from 0.18.2 to 0.18.3. - [Release notes](https://github.com/PythonCharmers/python-future/releases) - [Changelog](https://github.com/PythonCharmers/python-future/blob/master/docs/changelog.rst) - [Commits](https://github.com/PythonCharmers/python-future/compare/v0.18.2...v0.18.3 ) --- updated-dependencies: - dependency-name: future dependency-type: direct:production ... Signed-off-by:
dependabot[bot] <support@github.com> Signed-off-by:
dependabot[bot] <support@github.com> Co-authored-by:
dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
-
Sylvain Gugger authored
* Adapt repository creation to latest hf_hub * Update all examples * Fix other tests, add Flax examples * Address review comments
-
Yih-Dar authored
* fix Co-authored-by:ydshieh <ydshieh@users.noreply.github.com>
-
Pengfei Liu authored
* using raw string for regex to search <extra_id> * fix the same issue in test file:`tokenization_t5.py`
-
Wang, Yi authored
"TypeError: unhashable type: 'slice'" Signed-off-by:
Wang, Yi A <yi.a.wang@intel.com> Signed-off-by:
Wang, Yi A <yi.a.wang@intel.com>
-
Peter Lin authored
* Fix git model for generate with beam search. * Update comment * Fix bug on multi batch * Add generate tests * Clean up tests * Fix style Co-authored-by:Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>
-
Joao Gante authored
* Fix Flax OPT numerical masking * re-enable test * add fix to bart and reintroduce copied from in opt
-
Jordi Mas authored
* Fix typos in documentation * Small fix * Fix formatting
-
Samuel Xu authored
* Added flax model code * Added tf changes * missed some * Added copy comments * Added style hints * Fixed copy statements * Added suggested fixes * Made some fixes * Style fixup * Added necessary copy statements * Fixing copy statements * Added more copies * Final copy fix * Some bugfixes * Adding imports to init * Fixed up all make fixup errors * Fixed doc errors * Auto model changes
-
Younes Belkada authored
* `blip` support for training * remove labels creation * remove unneeded `decoder_input_ids` creation * final changes - add colab link to documentation - reduction = mean for loss * fix nits * update link * clearer error message
-
Yih-Dar authored
Co-authored-by:ydshieh <ydshieh@users.noreply.github.com>
-
Shogo Hida authored
* Create toctree for Japanese translations Signed-off-by:
Shogo Hida <shogo.hida@gmail.com> * Copy English version Signed-off-by:
Shogo Hida <shogo.hida@gmail.com> * Add Japanese translations Signed-off-by:
Shogo Hida <shogo.hida@gmail.com> * Add Japanese translations Signed-off-by:
Shogo Hida <shogo.hida@gmail.com> Signed-off-by:
Shogo Hida <shogo.hida@gmail.com>
-
Wonhyeong Seo authored
docs: ko: installation.mdx
-
- 17 Jan, 2023 5 commits
-
-
layjain authored
* Fixed num_channels!=3 normalization training * empty commit to trigger CI * Empty-Commit for CircleCI * Empty-Commit * Empty Commit try-3: https://discuss.circleci.com/t/github-code-checkout-suddenly-failing/31558 * Empty commit to trigger CI Co-authored-by:
Lay Jain <layjain@basil.csail.mit.edu> Co-authored-by:
ydshieh <ydshieh@users.noreply.github.com>
-
Sherman Siu authored
* Add epsilon- and eta-sampling. Add epsilon- and eta-sampling, following the official code from https://github.com/john-hewitt/truncation-sampling and adapting to be more configurable, as required by Huggingface transformers. * Add unit tests for epsilon- and eta-sampling. * Black: fix code formatting. * Fix docstring spacing. * Clean up newlines. * Fix implementation bugs and their associated tests. * Remove epsilon- and eta-sampling parameters from PretrainedConfig. * Clarify and clean up the documentation. * Remove parameters for PretrainedConfig test.
-
Maria Khalusova authored
* initial commit, refactoring the text generation api reference * removed repetitive code examples * Refactoring the text generation docs to reduce repetition * make style
-
Maria Khalusova authored
* Part of the "text generation" rework: adding a high-level overview of the text generation strategies * code samples update via make style * fixed a few formatting issues * Apply suggestions from review Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * fixed spaces, and switched two links to markdown * Apply Steven's suggestions from review Co-authored-by:
Steven Liu <59462357+stevhliu@users.noreply.github.com> * new lines after headers to fix link rendering * review feedback addressed. added links to image captioning and audio transcription examples * minor capitalization fix * addressed the review feedback * Apply suggestions from review Co-authored-by:
Joao Gante <joaofranciscocardosogante@gmail.com> * Applied review suggestions Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by:
Steven Liu <59462357+stevhliu@users.noreply.github.com> Co-authored-by:
Joao Gante <joaofranciscocardosogante@gmail.com>
-
Maria Khalusova authored
* Added TF example for image classification * Code style polishing * code style polishing * minor polishing * fixed a link in a tip, and a typo in the inference TF content * Apply Amy's suggestions from review Co-authored-by:
amyeroberts <22614925+amyeroberts@users.noreply.github.com> * Update docs/source/en/tasks/image_classification.mdx Co-authored-by:
amyeroberts <22614925+amyeroberts@users.noreply.github.com> * review feedback addressed * make style * added PushToHubCallback with save_strategy="no" * minor polishing * added PushToHubCallback with save_strategy=no * minor polishing * Update docs/source/en/tasks/image_classification.mdx * added data augmentation Co-authored-by:
Sayak Paul <spsayakpaul@gmail.com> * make style Co-authored-by:
amyeroberts <22614925+amyeroberts@users.noreply.github.com> Co-authored-by:
Sayak Paul <spsayakpaul@gmail.com>
-