- 24 Aug, 2022 3 commits
-
-
Juyoung Kim authored
* fixed docstring typos * Added missing colon Co-authored-by:김주영 <juyoung@zezedu.com>
-
Daniel Stancl authored
* Add TFXGLM models * Add todo: self.supports_xla_generation = False Co-authored-by:
Daniel Stancl <stancld@Daniels-MacBook-Pro.local> Co-authored-by:
Daniel Stancl <stancld@daniels-mbp.home> Co-authored-by:
Joao Gante <joaofranciscocardosogante@gmail.com> Co-authored-by:
Daniel <daniel.stancl@rossum.ai> Co-authored-by:
Patrick von Platen <patrick.v.platen@gmail.com>
-
Constantin Hütterer authored
* Add minor doc-string change to include hp_name * fix: missing type-information for kwargs * fix: missing white-space in hyperparameter_search doc-strings
-
- 23 Aug, 2022 4 commits
-
-
Joao Gante authored
-
SaulLu authored
* improve add_tokens documentation * format
-
Nicolas Patry authored
and friends.
-
Yih-Dar authored
Co-authored-by:ydshieh <ydshieh@users.noreply.github.com>
-
- 22 Aug, 2022 1 commit
-
-
Yih-Dar authored
Co-authored-by:
lewtun <lewis.c.tunstall@gmail.com> Co-authored-by:
ydshieh <ydshieh@users.noreply.github.com>
-
- 19 Aug, 2022 1 commit
-
-
Patrick von Platen authored
* add first generation tutorial * [Circle CI] Temporary fix for broken detectron2 import * remove generation
-
- 18 Aug, 2022 7 commits
-
-
Severin Simmler authored
* Fix quantization * Save model * Remove unused comments * Fix formatting
-
lewtun authored
-
regisss authored
-
amyeroberts authored
-
Joao Gante authored
-
Patrick von Platen authored
* add first generation tutorial * [LongT5 Docs] Correct docs * correct expected string * remove incorrect file
-
Matt authored
* Allow users to force TF availability * Correctly name the envvar!
-
- 17 Aug, 2022 2 commits
-
-
amyeroberts authored
* Update methods to optionally rescale This is necessary to allow for casting our images / videos to numpy arrays within the feature extractors' call. We want to do this to make sure the behaviour is as expected when flags like are False. If some transformations aren't applied, then the output type can't be unexpected e.g. a list of PIL images instead of numpy arrays. * Cast images to numpy arrays in call to enable consistent behaviour with different configs * Remove accidental clip changes * Update tests to reflect the scaling logic We write a generic function to handle rescaling of our arrays. In order for the API to be intuitive, we take some factor c and rescale the image values by that. This means, the rescaling done in normalize and to_numpy_array are now done with array * (1/255) instead of array / 255. This leads to small differences in the resulting image. When testing, this was in the order of 1e-8, and so deemed OK
-
Jingya HUANG authored
-
- 16 Aug, 2022 2 commits
-
-
Joao Gante authored
-
Sourab Mangrulkar authored
* mac m1 `mps` integration * Update docs/source/en/main_classes/trainer.mdx Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * addressing comments * Apply suggestions from code review Co-authored-by:
Dan Saattrup Nielsen <47701536+saattrupdan@users.noreply.github.com> * resolve comment Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by:
Dan Saattrup Nielsen <47701536+saattrupdan@users.noreply.github.com>
-
- 14 Aug, 2022 1 commit
-
-
Karim Foda authored
* [Flax] Add remat (gradient checkpointing) * fix variable naming in test * flip: checkpoint using a method * fix naming * fix class naming * apply PVP's suggestions from code review * add gradient_checkpointing to examples * Add gradient_checkpointing to run_mlm_flax * Add remat to longt5 * Add gradient checkpointing test longt5 * Fix args errors * Fix remaining tests * Make fixup & quality fixes * replace kwargs * remove unecessary kwargs * Make fixup changes * revert long_t5_flax changes * Remove return_dict and copy to LongT5 * Remove test_gradient_checkpointing Co-authored-by:sanchit-gandhi <sanchit@huggingface.co>
-
- 12 Aug, 2022 10 commits
-
-
Stas Bekman authored
* [fsmt] deal with -100 indices in decoder ids Fixes: https://github.com/huggingface/transformers/issues/17945 decoder ids get the default index -100, which breaks the model - like t5 and many other models add a fix to replace -100 with the correct pad index. For some reason this use case hasn't been used with this model until recently - so this issue was there since the beginning it seems. Any suggestions to how to add a simple test here? or perhaps we have something similar already? user's script is quite massive. * style
-
Niklas Muennighoff authored
* Update BLOOM parameter counts * Update BLOOM parameter counts
-
NielsRogge authored
* First draft * Improve script * Update script * Make conversion work * Add final_layer_norm attribute to Swin's config * Add DonutProcessor * Convert more models * Improve feature extractor and convert base models * Fix bug * Improve integration tests * Improve integration tests and add model to README * Add doc test * Add feature extractor to docs * Fix integration tests * Remove register_buffer * Fix toctree and add missing attribute * Add DonutSwin * Make conversion script work * Improve conversion script * Address comment * Fix bug * Fix another bug * Remove deprecated method from docs * Make Swin and Swinv2 untouched * Fix code examples * Fix processor * Update model_type to donut-swin * Add feature extractor tests, add token2json method, improve feature extractor * Fix failing tests, remove integration test * Add do_thumbnail for consistency * Improve code examples * Add code example for document parsing * Add DonutSwin to MODEL_NAMES_MAPPING * Add model to appropriate place in toctree * Update namespace to appropriate organization Co-authored-by:Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>
-
Younes Belkada authored
* Supporting seq2seq models for `bitsandbytes` integration - `bitsandbytes` integration supports now seq2seq models - check if a model has tied weights as an additional check * small modification - tie the weights before looking at tied weights!
-
Joao Gante authored
* validate generate model_kwargs * generate tests -- not all models have an attn mask
-
Yih-Dar authored
Co-authored-by:ydshieh <ydshieh@users.noreply.github.com>
-
Sourab Mangrulkar authored
-
Stas Bekman authored
-
Ian Castillo authored
* Add type hints for Vilt models * Add missing return type for TokenClassification class
-
Arthur authored
* initial commit * add small test * add cross pt tf flag to test * fix quality * style * update test with new repo * fix failing test * update * fix wrong param ordering * style * update based on review * update related to recent new caching mechanism * quality * Update based on review Co-authored-by:
sgugger <sylvain.gugger@gmail.com> * quality and style * Update src/transformers/modeling_flax_utils.py Co-authored-by:
sgugger <sylvain.gugger@gmail.com> Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
-
- 11 Aug, 2022 9 commits
-
-
amyeroberts authored
-
Alara Dirik authored
-
Sylvain Gugger authored
* Fix docstrings with last version of hf-doc-builder styler * Remove empty Parameter block
-
Michael Benayoun authored
* Support audio classification architectures for labels generation, as well as provides a flag to print warnings or not * Use ENV_VARS_TRUE_VALUES
-
iiLaurens authored
* Fix critical trace warnings to allow ONNX export * Force input to `sqrt` to be float type * Cleanup code * Remove unused import statement * Update model sew * Small refactor Co-authored-by:
Michael Benayoun <mickbenayoun@gmail.com> * Use broadcasting instead of repeat * Implement suggestion Co-authored-by:
Michael Benayoun <mickbenayoun@gmail.com> * Match deberta v2 changes in sew_d * Improve code quality * Update code quality * Consistency of small refactor * Match changes in sew_d Co-authored-by:
Michael Benayoun <mickbenayoun@gmail.com>
-
Dan Jones authored
Change BartLearnedPositionalEmbedding's forward method signature to support Opacus training (#18486) * changing BartLearnedPositionalEmbedding forward signature and references to it * removing debugging dead code (thanks style checker) * blackened modeling_bart file * removing copy inconsistencies via make fix-copies * changing references to copied signatures in Bart variants * make fix-copies once more * using expand over repeat (thanks @michaelbenayoun) * expand instead of repeat for all model copies Co-authored-by:Daniel Jones <jonesdaniel@microsoft.com>
-
Wonseok Lee (Jack) authored
* fix typos * fix sequence_length docs of LayoutLMv3Model * delete trailing white spaces * fix layoutlmv3 docs more * apply make fixup & quality * change to two versions of input docstring * apply make fixup & quality
-
Alara Dirik authored
* Fixes resizing bug in OWL-ViT * Defaults to square resize if size is set to an int * Sets do_center_crop default value to False
-
Maxime G authored
* Segformer TF: fix output size in doc * Segformer pytorch: fix output size in doc Co-authored-by:Maxime Gardoni <maxime.gardoni@ecorobotix.com>
-