- 23 Mar, 2022 2 commits
-
-
Sylvain Gugger authored
* Split file_utils in several submodules * Fixes * Add back more objects * More fixes * Who exactly decided to import that from there? * Second suggestion to code with code review * Revert wront move * Fix imports * Adapt all imports * Adapt all imports everywhere * Revert this import, will fix in a separate commit
-
Lysandre Debut authored
* Updates the default branch from master to main * Links from `master` to `main` * Typo * Update examples/flax/README.md Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
-
- 21 Mar, 2022 1 commit
-
-
Anton Lozhkov authored
* update results * per-language metrics * Format the per-language metrics
-
- 17 Mar, 2022 1 commit
-
-
Suraj Patil authored
-
- 16 Mar, 2022 3 commits
-
-
Anton Lozhkov authored
* Minor fixes * Fix vocab union * Update examples/research_projects/xtreme-s/README.md Co-authored-by:
Patrick von Platen <patrick.v.platen@gmail.com> * Update README * unused import Co-authored-by:
Patrick von Platen <patrick.v.platen@gmail.com>
-
Sanchit Gandhi authored
* Replace all deprecated `jax.ops` operations with jnp's `at` * np to jnp scores * suggested changes
-
Patrick von Platen authored
-
- 15 Mar, 2022 1 commit
-
-
Anton Lozhkov authored
* CTC+classification draft * CTC+classification draft * style * multilingual runs * Fix race condition during processor.from_reatrained * Merge covost experiments * Add README * Quality * Switch to .all configs * Fix typos
-
- 12 Mar, 2022 1 commit
-
-
Stas Bekman authored
* [WIP] add support for bf16 mode * prep for bf16 * prep for bf16 * fix; zero2/bf16 is ok * check bf16 is available * test fixes * enable zero3_bf16 * config files * docs * split stage_dtype; merge back to non-dtype-specific config file * fix doc * cleanup * cleanup * bfloat16 => bf16 to match the PR changes * s/zero_gather_fp16_weights_on_model_save/zero_gather_16bit_weights_on_model_save/; s/save_fp16_model/save_16bit_model/ * test fixes/skipping * move * fix * Update docs/source/main_classes/deepspeed.mdx Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * backticks * cleanup * cleanup * cleanup * new version * add note about grad accum in bf16 Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
-
- 10 Mar, 2022 2 commits
-
-
Sylvain Gugger authored
-
Sanchit Gandhi authored
-
- 09 Mar, 2022 2 commits
-
-
Shotaro Ishihara authored
at transformers/examples/pytorch/contrastive-image-text
-
Joao Gante authored
-
- 08 Mar, 2022 2 commits
-
-
Joao Gante authored
-
Yeb Havinga authored
Speedup training by using numpy instead of jnp for batch shuffling Co-authored-by:Yeb Havinga <y.t.havinga@mgrid.net>
-
- 04 Mar, 2022 2 commits
-
-
Patrick von Platen authored
-
Sanchit Gandhi authored
-
- 03 Mar, 2022 2 commits
-
-
davidleonfdez authored
-
Sylvain Gugger authored
-
- 02 Mar, 2022 2 commits
-
-
Ross Johnstone authored
-
Joao Gante authored
-
- 01 Mar, 2022 1 commit
-
-
Joao Gante authored
-
- 25 Feb, 2022 1 commit
-
-
Suraj Patil authored
-
- 23 Feb, 2022 1 commit
-
-
Lysandre Debut authored
* Per-folder tests reorganization Co-authored-by:
sgugger <sylvain.gugger@gmail.com> Co-authored-by:
Stas Bekman <stas@stason.org>
-
- 22 Feb, 2022 1 commit
-
-
Yongrae Jo authored
cna -> can
-
- 21 Feb, 2022 4 commits
-
-
Joao Gante authored
* Working example with to_tf_dataset * updated text_classification * more comments
-
Suraj Patil authored
* begin script * update script * fix features and data args * main * add requirements * add column name args * fix captions * don't jit transforms * fix caption * fix labels, handle attention mask * convert pixel values to numpy * labels => input_ids * transform images on the fly * use AutoModel class, create the hybird model outside of the script * fix version message * add readme * Apply suggestions from code review Co-authored-by:
Patrick von Platen <patrick.v.platen@gmail.com> * adderss review comments * add more comments * allow freezing vision and text models Co-authored-by:
Patrick von Platen <patrick.v.platen@gmail.com>
-
Ivan Agarsk媒 authored
-
Simon Sardorf authored
Remove input and target reset after preprocessing
-
- 17 Feb, 2022 2 commits
-
-
NielsRogge authored
* Add first draft * Make model importable * Make SwinForMaskedImageModeling importable * Fix imports * Add missing inits * Add support for Swin * Fix bug * Fix bug * Fix another bug * Fix Swin MIM implementation * Fix default encoder stride * Fix Swin * Add print statements for debugging * Add image_size data argument * Fix Swin * Fix image_size * Add print statements for debugging * Fix print statement * Remove print statements * Improve reshaping of bool_masked_pos * Add support for DeiT, fix tests * Improve docstrings * Apply new black version * Improve script * Fix bug * Improve README * Apply suggestions from code review * Remove DS_Store and add to gitignore * Apply suggestions from code review + fix BEiT Flax * Revert BEiT changes * Improve README * Fix code quality * Improve README Co-authored-by:
Niels Rogge <nielsrogge@Nielss-MBP.localdomain> Co-authored-by:
Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>
-
NielsRogge authored
Co-authored-by:Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>
-
- 15 Feb, 2022 1 commit
-
-
Shamane Siri authored
-
- 11 Feb, 2022 1 commit
-
-
Stas Bekman authored
* [research_projects] deal with security alerts * add a note of the original PL ver and warning
-
- 10 Feb, 2022 1 commit
-
-
Patrick von Platen authored
-
- 09 Feb, 2022 1 commit
-
-
Lysandre Debut authored
* Upgrade black to version ~=22.0 * Check copies * Fix code
-
- 07 Feb, 2022 1 commit
-
-
Anton Lozhkov authored
* Single-epoch run * Apply suggestions from code review Co-authored-by:
Patrick von Platen <patrick.v.platen@gmail.com> * Infinite dataset * Trainer fix + distributed benchmark * Benchmark fix * unused import * interleaved splits * interleaved splits * has_length util * Move to research projects * Leftover Sized checks * Bump min version * Unused import * Revert trainer changes Co-authored-by:
Patrick von Platen <patrick.v.platen@gmail.com>
-
- 03 Feb, 2022 1 commit
-
-
davidleonfdez authored
* Add preprocess_logits_for_metrics Trainer param * Compute accuracy in LM examples * Improve comments
-
- 02 Feb, 2022 1 commit
-
-
Sylvain Gugger authored
* Playing * Properly set labels in model config for token classification example * Port to run_ner_no_trainer * Quality
-
- 01 Feb, 2022 2 commits
-
-
Sylvain Gugger authored
* Harder check for IndexErrors in QA scripts * Make test stronger
-
Kamal Raj authored
fix typo
-