- 14 Sep, 2022 6 commits
-
-
NielsRogge authored
* Clarify cookiecutter questions * Update first question Co-authored-by:Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>
-
Sylvain Gugger authored
* Make AutoProcessor a magic loading class for all modalities * Quality
-
Sylvain Gugger authored
-
Ekagra Ranjan authored
-
NielsRogge authored
* First draft * More improvements * Improve model, add custom CUDA code * Import torch before * Add script that imports custom layer * Add everything in new ops directory * Import custom layer in modeling file * Fix ARCHIVE_MAP typo * Creating the custom kernel on the fly. * Import custom layer in modeling file * More improvements * Fix CUDA loading * More improvements * Improve conversion script * Improve conversion script * Make it work until encoder_outputs * Make forward pass work * More improvements * Make logits match original implementation * Make implementation also support single_scale model * Add support for single_scale and dilation checkpoint * Add support for with_box_refine model * Support also two stage model * Improve tests * Fix more tests * Make more tests pass * Upload all models to the hub * Clean up some code * Improve decoder outputs * Rename intermediate hidden states and reference points * Improve model outputs * Move tests to dedicated folder * Improve model outputs * Fix retain_grad test * Improve docs * Clean up and make test_initialization pass * Improve variable names * Add copied from statements * Improve docs * Fix style * Improve docs * Improve docs, move tests to model folder * Fix rebase * Remove DetrForSegmentation from auto mapping * Apply suggestions from code review * Improve variable names and docstrings * Apply some more suggestions from code review * Apply suggestion from code review * better docs and variables names * hint to num_queries and two_stage confusion * remove asserts and code refactor * add exception if two_stage is True and with_box_refine is False * use f-strings * Improve docs and variable names * Fix code quality * Fix rebase * Add require_torch_gpu decorator * Add pip install ninja to CI jobs * Apply suggestion of @sgugger * Remove DeformableDetrForObjectDetection from auto mapping * Remove DeformableDetrModel from auto mapping * Add model to toctree * Add model back to mappings, skip model in pipeline tests * Apply @sgugger's suggestion * Fix imports in the init * Fix copies * Add CPU implementation * Comment out GPU function * Undo previous change * Apply more suggestions * Remove require_torch_gpu annotator * Fix quality * Add logger.info * Fix logger * Fix variable names * Fix initializaztion * Add missing initialization * Update checkpoint name * Add model to doc tests * Add CPU/GPU equivalence test * Add Deformable DETR to pipeline tests * Skip model for object detection pipeline Co-authored-by:
Nicolas Patry <patry.nicolas@protonmail.com> Co-authored-by:
Nouamane Tazi <nouamane98@gmail.com> Co-authored-by:
Sylvain Gugger <Sylvain.gugger@gmail.com>
-
Ahmed Elnaggar authored
FlaxLongT5PreTrainedModel is missing "enable_gradient_checkpointing" function. This gives an error if someone tries to enable gradient checkpointing for longt5. This pull request fixes it.
-
- 13 Sep, 2022 10 commits
-
-
Joao Gante authored
-
Sylvain Gugger authored
-
Yih-Dar authored
Co-authored-by:ydshieh <ydshieh@users.noreply.github.com>
-
Wang, Yi authored
only main_process will have HPO, and pass argument to other process Signed-off-by:
Wang, Yi A <yi.a.wang@intel.com> Signed-off-by:
Wang, Yi A <yi.a.wang@intel.com>
-
Rahul A R authored
* fixed bug which caused overwrite_cache to always be True (#18967). * reformatting changes
-
Ankur Goyal authored
Co-authored-by:Ankur Goyal <ankur@impira.com>
-
Yih-Dar authored
Co-authored-by:ydshieh <ydshieh@users.noreply.github.com>
-
Partho authored
* added type hints * fixed typo
-
Joao Gante authored
-
Alara Dirik authored
* fix preprocessing for instance segmentation maps * add support for per-image instance2class_id mapping * edit docstrings for clarity
-
- 12 Sep, 2022 14 commits
-
-
Chris Emezue authored
Fix connected to [this issue](https://github.com/huggingface/transformers/issues/18944)
-
Tobias Nusser authored
Fixed typo itmes --> items
-
Joao Gante authored
-
Partho authored
-
Yih-Dar authored
* fix checkpoint name for wav2vec2 conformer Co-authored-by:ydshieh <ydshieh@users.noreply.github.com>
-
Joao Gante authored
-
Matt authored
* Use int64 throughout TFLongFormer * make style * Do some more fixed casting in TFLongFormer * Fix some wonky "is None" conditionals * Cast all the dtypes, salt the earth * Fix copies to TFLED as well and do some casting there * dtype fix in TFLongformer test * Make fixup * Expand tolerances on the LED tests too (I think this is a TF32 thing) * Expand test tolerances for LED a tiny bit (probably a Tensorfloat thing again)
-
Sylvain Gugger authored
* Align try_to_load_from_cache with huggingface_hub * Fix tests
-
Matt authored
* Update our TF 2.0 input format tip across all models * make style
-
Shijie Wu authored
-
Yih-Dar authored
* create Past CI results as tables for GitHub issue Co-authored-by:ydshieh <ydshieh@users.noreply.github.com>
-
Yih-Dar authored
Co-authored-by:ydshieh <ydshieh@users.noreply.github.com>
-
Wang, Yi authored
only main_process will have HPO, and pass argument to other process Signed-off-by:
Wang, Yi A <yi.a.wang@intel.com> Signed-off-by:
Wang, Yi A <yi.a.wang@intel.com>
-
Shijie Wu authored
-
- 10 Sep, 2022 3 commits
-
-
Sylvain Gugger authored
This reverts commit d8cf3b20.
-
Joao Gante authored
-
Joao Gante authored
-
- 09 Sep, 2022 7 commits
-
-
Bram Vanroy authored
* update black target version * add comment as per https://github.com/huggingface/transformers/pull/18955#issuecomment-1242081649 * revert change Will only update to 3.7 after black 2023 upgrade in January
-
Sylvain Gugger authored
-
Matt authored
* Fix train_step and test_step, correctly enable CLIP fit test * Stop using get_args on older Python versions * Don't use get_origin either * UnionType is actually even newer, don't use that either * Apply the same fix to test_loss_computation * Just realized I was accidentally skipping a bunch of tests! * Fix test_loss_computation for models without separable labels * Fix scalar losses in test_step and train_step * Stop committing your breakpoints * Fix Swin loss shape * Fix Tapas loss shape * Shape fixes for TAPAS, DeIT, HuBERT and ViTMAE * Add loss computation to TFMobileBertForPreTraining * make fixup and move copied from statement * make fixup and move copied from statement * Correct copied from * Add labels and next_sentence_label inputs to TFMobileBERT * Make sure total_loss is always defined * Update tests/test_modeling_tf_common.py Co-authored-by:
amyeroberts <22614925+amyeroberts@users.noreply.github.com> * Fix copied from * Ensure CTC models get labels in tests * Ensure CTC models get labels in tests * Fix tests for vit_mae * Fix tests for vit_mae * Fix tests for vit_mae * Reduce batch size for wav2vec2 testing because it was causing OOM * Skip some TAPAS tests that are failing * Skip a failing HuBERT test * make style * Fix mobilebertforpretraining test * Skip Wav2Vec2 tests that use huge amounts of mem * Skip keras_fit for Wav2Vec2 as well Co-authored-by:
amyeroberts <22614925+amyeroberts@users.noreply.github.com>
-
Ekagra Ranjan authored
-
Rafa艂 Jankowski authored
* NeptuneCallback improvements * After review suggestions and deduplication of initial run * Added volatile checkpoints support due to missing post-rebase commit * Update README per review comments - Remove list formatting - Correct Neptune docs link Co-authored-by:Sabine <sabine.nyholm@neptune.ai>
-
Sanchit Gandhi authored
* [JAX] Replace all jax.tree_* calls with jax.tree_util.tree_* * fix double tree_util
-
HuYong authored
* add_ernie * remove Tokenizer in ernie * polish code * format code style * polish code * fix style * update doc * make fix-copies * change model name * change model name * fix dependency * add more copied from * rename ErnieLMHeadModel to ErnieForCausalLM do not expose ErnieLayer update doc * fix * make style * polish code * polish code * fix * fix * fix * fix * fix * final fix Co-authored-by:ydshieh <ydshieh@users.noreply.github.com>
-