- 19 Jul, 2022 6 commits
-
-
Sylvain Gugger authored
* Initial work * More work * Add tests for custom pipelines on the Hub * Protect import * Make the test work for TF as well * Last PyTorch specific bit * Add documentation * Style * Title in toc * Bad names! * Update docs/source/en/add_new_pipeline.mdx Co-authored-by:
Lysandre Debut <lysandre.debut@reseau.eseo.fr> * Auto stash before merge of "custom_pipeline" and "origin/custom_pipeline" * Address review comments * Address more review comments * Update src/transformers/pipelines/__init__.py Co-authored-by:
Lysandre Debut <lysandre.debut@reseau.eseo.fr> Co-authored-by:
Lysandre Debut <lysandre.debut@reseau.eseo.fr>
-
Patrick von Platen authored
* add first generation tutorial * [from_pretrained] Allow loading models from subfolders * remove gen file * add doc strings * allow download from subfolder * add tests * Apply suggestions from code review Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * apply comments * correct doc string Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
-
Snehan Kekre authored
* Update docs README with instructions on locally previewing docs * Add instructions to install `watchdog` before previewing the docs
-
orgoro authored
-
Sylvain Gugger authored
-
Duong A. Nguyen authored
* remove use_auth_token from from_config * restore use_auth_token from_pretrained run_t5_mlm_flax
-
- 18 Jul, 2022 17 commits
-
-
Sylvain Gugger authored
-
Sourab Mangrulkar authored
* FSDP integration enhancements and fixes * resolving comments * fsdp fp16 mixed precision requires `ShardedGradScaler`
-
Nicola Procopio authored
* added training.mdx * updated training.mdx * updated training.mdx * updated training.mdx * updated _toctree.yml * fixed typos after review Co-authored-by:Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
-
Younes Belkada authored
* minor fixes - add correct revision - corrected dosctring for test - removed a test * contrib credits Co-authored-by:
Yih-Dar <2521628+ydshieh@users.noreply.github.com> Co-authored-by:
Nouamane Tazi <nouamane98@gmail.com> Co-authored-by:
Yih-Dar <2521628+ydshieh@users.noreply.github.com> Co-authored-by:
Nouamane Tazi <nouamane98@gmail.com>
-
Nicola Procopio authored
* added multilingual.mdx * updated multilingual.mdx * italian translation multilingual.mdx * updated _toctree.yml * fixed typos _toctree.yml * fixed typos after review * fixed error after review
-
Nicola Procopio authored
* updated _toctree.yml * added preprocessing * updated preprocessing.mdx * updated preprocessing.mdx updated after review
-
SaulLu authored
-
Sylvain Gugger authored
-
Sylvain Gugger authored
-
Ayan Sengupta authored
-
Yih-Dar authored
Co-authored-by:
Joao Gante <joaofranciscocardosogante@gmail.com> Co-authored-by:
ydshieh <ydshieh@users.noreply.github.com>
-
Yih-Dar authored
* fix expected loss values Co-authored-by:ydshieh <ydshieh@users.noreply.github.com>
-
Wang, Yi authored
* [HPO] update to sigopt new experiment api * follow https://docs.sigopt.com/experiments Signed-off-by:
Wang, Yi A <yi.a.wang@intel.com> * [HPO] use new API if sigopt version >= 8.0.0 Signed-off-by:
Wang, Yi A <yi.a.wang@intel.com>
-
gcheron authored
Co-authored-by:Guilhem Ch茅ron <guilhemc@authentifier.com>
-
Lysandre Debut authored
* NLLB tokenizer * Apply suggestions from code review - Thanks Stefan! Co-authored-by:
Stefan Schweter <stefan@schweter.it> * Final touches * Style :) * Update docs/source/en/model_doc/nllb.mdx Co-authored-by:
Stefan Schweter <stefan@schweter.it> * Apply suggestions from code review Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * PR reviews * Auto models Co-authored-by:
Stefan Schweter <stefan@schweter.it> Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
-
John Giorgi authored
-
John Giorgi authored
-
- 15 Jul, 2022 2 commits
-
-
Nicolas Patry authored
* Adding support for `device_map` directly in `pipeline(..)` function. * Updating the docstring. * Adding a better docstring * Put back type hints. * Blacked. (`make fixup` didn't work ??!!)
-
Nicolas Patry authored
* Fixing a bug where attention mask was not passed to generate. * Fixing zero-size prompts. * Comment on top.
-
- 13 Jul, 2022 9 commits
-
-
amyeroberts authored
* Initial TF DeiT implementation * Fix copies naming issues * Fix up + docs * Properly same main layer * Name layers properly * Initial TF DeiT implementation * Fix copies naming issues * Fix up + docs * Properly same main layer * Name layers properly * Fixup * Fix import * Fix import * Fix import * Fix weight loading for tests whilst not on hub * Add doc tests and remove to_2tuple * Add back to_2tuple Removing to_2tuple results in many downstream changes needed because of the copies checks * Incorporate updates in Improve vision models #17731 PR * Don't hard code num_channels * Copy PyTorch DeiT embeddings and remove pytorch operations with mask * Fix patch embeddings & tidy up * Update PixelShuffle to move logic into class layer * Update doc strings - remove PT references * Use NHWC format in internal layers * Fix up * Use linear activation layer * Remove unused import * Apply suggestions from code review Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by:
NielsRogge <48327001+NielsRogge@users.noreply.github.com> Co-authored-by:
NielsRogge <48327001+NielsRogge@users.noreply.github.com> Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Move dataclass to top of file * Remove from_pt now weights on hub * Fixup Co-authored-by:
NielsRogge <48327001+NielsRogge@users.noreply.github.com> Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by:
Amy Roberts <amyeroberts@users.noreply.github.com>
-
Wei authored
* enable fx2trt * Update perf_train_gpu_one.mdx * Update perf_train_gpu_one.mdx * add lib check * update * format * update * fix import check * fix isort * improve doc * refactor ctx manager * fix isort * black format * isort fix * fix format * update args * update black * cleanups * Update perf_train_gpu_one.mdx * code refactor * code refactor to init * remove redundancy * isort * replace self.args with args Co-authored-by:Stas Bekman <stas@stason.org>
-
Sylvain Gugger authored
* Make sharded checkpoints work in offline mode * Add test
-
Sylvain Gugger authored
This reverts commit 3564c657.
-
Sylvain Gugger authored
-
lmagne authored
* added metadata to training summary * Apply suggestions from code review Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
-
John Giorgi authored
* Add summarization name mapping for MultiNews * Add summarization name mapping for MultiNews
-
Sebastian Sosa authored
* supported python versions reference * Update CONTRIBUTING.md removing commit hash from link Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
-
Joao Gante authored
-
- 12 Jul, 2022 6 commits
-
-
Joao Gante authored
-
Niklas Muennighoff authored
* Add fp16 option * Fix BLOOM dtype * Formatting * Remove torch_dtype arg * Revert formatting * Apply formatting * Add n_embed backward compat
-
Joao Gante authored
-
wei zhao authored
* Report value for a step instead of epoch. Report an objective function value for a step instead of epoch to optuna. I made this modification for the following reason: If "eval_steps" is less than steps per epoch, there maybe warnings like this: "optuna/trial/_trial.py:592: UserWarning: The reported value is ignored because this `step` 0 is already reported.". So "step" are more appropriate than "epoch" here. * MOD: make style. Co-authored-by:zhaowei01 <zhaowei01@yuanfudao.com>
-
Sijun He authored
-
jianan-gu authored
* enhance ipex import * refine codes * refine style * add link * style Co-authored-by:Stas Bekman <stas@stason.org>
-