- 11 Nov, 2021 2 commits
-
-
Suraj Patil authored
* fix inits * fix embed dtype * fix embed dtype * add test to check default dtype * quality * add type conversion methods for flax models * more robust casting * cast sinusoidal positions * update pegasus * update albert * update test * make sure dtype is passed to every module * style * fix electra dense * fix t5 * quality * add more tests * better name * use the dtype for lm head computation * fix albert * style * fix albert embed dtype * more tests * fix vision enc-dec * cleanup * fix embed dtype pegasus * fix default param test * doc * update template * fix final_logits_bias dtype * Apply suggestions from code review Co-authored-by:
Patrick von Platen <patrick.v.platen@gmail.com> * fix doc * fix doc * add detailed docstring for dtype parameter * remove un-necessary import Co-authored-by:
Patrick von Platen <patrick.v.platen@gmail.com>
-
Stas Bekman authored
-
- 10 Nov, 2021 6 commits
-
-
Li-Huai (Allan) Lin authored
* Fix index out of range when padding * Apply suggestions from code review * Style
-
Chang Wang authored
-
Ella Charlaix authored
* Add notebook applying Intel Neural Compressor quantization for text classification tasks * Add Optimum notebooks section
-
Li-Huai (Allan) Lin authored
* Fix albert mask token tokenization. * Ensure special tokans sanitized. * Style * Fix * Apply suggestions from code review
-
Nicolas Patry authored
* Adding some quality of life for `pipeline` function. * Update docs/source/main_classes/pipelines.rst Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update src/transformers/pipelines/__init__.py Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Improve the tests. Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
-
Elad Segal authored
`BatchFeature`: Convert `List[np.ndarray]` to `np.ndarray` before converting to pytorch tensors (#14306) * update * style fix * retrigger checks * check first element * fix syntax error * Update src/transformers/feature_extraction_utils.py Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * remove import Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
-
- 09 Nov, 2021 10 commits
-
-
Sylvain Gugger authored
-
Patrick von Platen authored
* [Bert2Bert] allow bert2bert + relative embeddings * up * Update README_ko.md * up * up
-
Steven Liu authored
* rewrite guides for fine-tuning with datasets * simple qa code example * use anonymous rST links * style
-
Suraj Patil authored
-
Yih-Dar authored
Co-authored-by:ydshieh <ydshieh@users.noreply.github.com>
-
karthikrangasai authored
* Update postporcessing accordingly to use SQuAD metric. * Update assets accordingly based on SQuAD metrics. * Fix function naming error.
-
Yih-Dar authored
* Start the work for TFViTModel * Convert to TF code - need to check in the follow up commits * Clean up model code * Expose TFViTModel * make style * make quality * Add test * make style & quality * Fix some imports * fix wrong usage - *kwargs => ** kwargs * Fix Conv2D weight loading (PT->TF) issue * Add tests for images with different sizes + fix model * Fix some common tests for TFViTModel * Use inputs instead of input_ids in test_compile_tf_model * Add a comment about transpose and Conv2D in convert_tf_weight_name_to_pt_weight_name * Avoid transpose in TFViT call * Fix Conv2D issue in load_tf2_weights_in_pytorch_model * Use tf.keras.layers.Conv2D instead of tf.nn.conv2d * Using simpler heuristic to detect Conv2D layer * Change convert_tf_weight_name_to_pt_weight_name to return TransposeType * Check tf_weight_shape is not None before using it * Apply suggestions from code review Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * fix missing comma * fix input dtype Co-authored-by:
ydshieh <ydshieh@users.noreply.github.com> Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
-
Apoorv Garg authored
* correct order of overflowing tokens for LayoutLmV2 tokenizer * test to check order of overflowing_tokens for a seq of input_ids * fix up quality * added suggested changes * check that tests the bbox sequence * pair_input test added * pass quality test * check bbox sequence added * unittest method * comments added * add overflowing bbox test * improved "seq_1" Co-authored-by:
SaulLu <55560583+SaulLu@users.noreply.github.com> * improve code quality Co-authored-by:
SaulLu <lucilesaul.com@gmail.com> Co-authored-by:
SaulLu <55560583+SaulLu@users.noreply.github.com>
-
Yih-Dar authored
* Start the work on FlaxVisionEncoderDecoderModel * Add FlaxVisionEncoderDecoderModel * Add VisionEncoderDecoderConfig * Make FlaxVisionEncoderDecoderModel visible to transformers * Add test * Fix wrong getattr usage * Fix tests * Add FlaxAutoModelForVision2Seq * Expose FLAX_MODEL_FOR_VISION_2_SEQ_MAPPING * clean-up * add integration test * update expected logits * update expected scores * Add ViT2GPT2ModelIntegrationTest + some cleaning * Add projection layer + PT/Flax equivalence tests * Fix import * minor changes * make test slow again * Apply suggestions * Add modeling_flax_vision_encoder_decoder to _ignore_modules in get_model_modules() * fix copies * Apply suggestions from code review Co-authored-by:
Suraj Patil <surajp815@gmail.com> * split long strings in multiple lines * decoder_input_ids can't be None * Add back test_configuration_tie * Remove attention_mask parameter * fix test - encoder_last_hidden_state should be encoder_outputs.last_hidden_state instead of the projected vector * Apply suggestions from code review Co-authored-by:
Patrick von Platen <patrick.v.platen@gmail.com> * Remove more encoder_attention_mask * remove encoder_attention_mask when calling self.decode (in FlaxVisionEncoderDecoderModule) * Fix style + pass 1s instead of None as encoder_attention_mask * fix init_weights * pass None for encoder_attention_mask * pass 1s instead of None as encoder_attention_mask * Fix doc style Co-authored-by:
ydshieh <ydshieh@users.noreply.github.com> Co-authored-by:
Suraj Patil <surajp815@gmail.com> Co-authored-by:
Patrick von Platen <patrick.v.platen@gmail.com>
-
Reza Yazdani authored
* minor modification to the wav2vec2 modeling file to support tensor-parallelism with DeepSpeed on this HuggingFace model * refine the comments * synch changes * fix comments * refine comments * fix format
-
- 08 Nov, 2021 8 commits
-
-
Jeff Rasley authored
* defer to DS_TEST_PORT if set * style Co-authored-by:Stas Bekman <stas@stason.org>
-
Sylvain Gugger authored
* Dynamic configs * Add config test * Better tests * Add tokenizer and test * Add to from_config * With save
-
nbertagnolli authored
* Changed relative imports to absolute to allow convert_graph_to_onnx.py to be run as a script * isorted code
-
Nicolas Patry authored
* Fixing mutable default argument. * XX. * Revert "XX." This reverts commit 61d4bb333f6d39a7fbe31d161b8bd14787ceec2e.
-
Nicolas Patry authored
* Fixing tests on master. * Better fix. * Lxmert doesn't have feature extractor but is bimodal.
-
Anton Lozhkov authored
* Add paddings to input shapes * Add padding comment
-
Anton Lozhkov authored
-
Patrick von Platen authored
-
- 06 Nov, 2021 4 commits
-
-
Junbum Lee authored
-
NielsRogge authored
-
Nicolas Patry authored
`start_` and `end_` tensors now contain a batch_size at this point.
-
Xing Han Lu authored
-
- 05 Nov, 2021 3 commits
-
-
Sylvain Gugger authored
-
Patrick von Platen authored
-
Sylvain Gugger authored
* Pin TF until tests are fixed * Also pin TF CPU
-
- 04 Nov, 2021 4 commits
-
-
Matt authored
* Removing Keras version pinning * make fixup
-
Chang Wang authored
-
Nicolas Patry authored
Fixes #14272
-
Nicolas Patry authored
* Fiixng slow pipeline tests * Remove the image-segmentaiton override. * Fixing clamping only in training. * Wav2vec2. * Remove last mention of `no_grad`. * Fixing copies. * Rename.
-
- 03 Nov, 2021 3 commits
-
-
Sylvain Gugger authored
* Add more instructions to the release guide * Apply suggestions from code review Co-authored-by:
Lysandre Debut <lysandre@huggingface.co> * Address review comment Co-authored-by:
Lysandre Debut <lysandre@huggingface.co>
-
Sylvain Gugger authored
* Start PR doc * Cleanup the quality checks and document them * Add reference in the contributing guide * Apply suggestions from code review Co-authored-by:
Stas Bekman <stas00@users.noreply.github.com> * Rename file as per review suggestion Co-authored-by:
Stas Bekman <stas00@users.noreply.github.com>
-
Sylvain Gugger authored
* Pin Keras cause they messed their release * Put != instead of < * Try this way * Back to the beginning but more agressive
-