- 06 Jul, 2021 6 commits
-
-
SaulLu authored
-
Patrick von Platen authored
-
Suzana Ili膰 authored
-
Suraj Patil authored
* flax gpt neo * fix query scaling * update generation test * use flax model for test
-
yujun authored
* add RoFormerTokenizerFast into AutoTokenizer * fix typo in roformer docs * make onnx export happy * update RoFormerConfig embedding_size * use jieba not rjieba * fix 12244 and make test_alignement passed * update ARCHIVE_MAP * make style & quality & fixup * update * make style & quality & fixup * make style quality fixup * update * suggestion from LysandreJik Co-authored-by:
Lysandre Debut <lysandre@huggingface.co> * make style * use rjieba Co-authored-by:
Lysandre Debut <lysandre@huggingface.co>
-
Suraj Patil authored
* fix saving and loading * update readme
-
- 05 Jul, 2021 12 commits
-
-
Patrick von Platen authored
* fix_torch_device_generate_test * remove @ * adapt flax pretrain script
-
Patrick von Platen authored
* fix_torch_device_generate_test * remove @ * up
-
Patrick von Platen authored
* fix_torch_device_generate_test * remove @ * push
-
Patrick von Platen authored
* fix_torch_device_generate_test * remove @ * add logging steps * correct training scripts * correct readme * correct
-
Matt authored
* NER example for Tensorflow * Style pass * Style pass * Added metric computation on the evaluation set * Style pass * Fixed label masking * Style pass * Style pass
-
Patrick von Platen authored
* fix_torch_device_generate_test * remove @ * upload * finish dataset streaming * adapt readme * finish * up * up * up * up * Apply suggestions from code review * finish * make style * make style2 * finish Co-authored-by:Patrick von Platen <patrick@huggingface.co>
-
Navjot authored
-
Suraj Patil authored
* begin script * clean example, add readme * update readme * remove decay mask * remove masking * update readme & make flake happy
-
Akmal authored
-
sadakmed authored
* create LxmertModelIntegrationTest * implementation using numpy seeding to fix inputs params. * fix code quality * isort check
-
Suraj Patil authored
* clip style example * fix post init * add requirements * update readme, few small fixes
-
Lysandre Debut authored
-
- 04 Jul, 2021 1 commit
-
-
Patrick von Platen authored
-
- 02 Jul, 2021 7 commits
-
-
Omar Sanseviero authored
-
Patrick von Platen authored
-
Patrick von Platen authored
-
Patrick von Platen authored
Thanks a lot @BirgerMoell
-
Lysandre Debut authored
-
Matthew LeMay authored
-
Sylvain Gugger authored
-
- 01 Jul, 2021 12 commits
-
-
Stas Bekman authored
* fix lm_head.decoder.weight ignore_key handling * fix the mutable class variable * Update src/transformers/models/roberta/modeling_roberta.py Co-authored-by:
Lysandre Debut <lysandre@huggingface.co> * replicate the comment * make deterministic Co-authored-by:
Lysandre Debut <lysandre@huggingface.co>
-
Teven authored
* fixing bug with param count without embeddings * Update src/transformers/modeling_utils.py Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * style Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
-
Souvic Chakraborty authored
* Validation split added: custom data files Validation split added in case of no validation file and loading custom data * Updated documentation with custom file usage Updated documentation with custom file usage * Update README.md * Update README.md * Update README.md * Made some suggested stylistic changes * Used logger instead of print. Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Made similar changes to add validation split In case of a missing validation file, a validation split will be used now. * max_train_samples to be used for training only max_train_samples got misplaced, now corrected so that it is applied on training data only, not whole data. * styled * changed ordering * Improved language of documentation Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Improved language of documentation Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Fixed styling issue * Update run_mlm.py Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
-
Thibault FEVRY authored
* Import fix_inits handling of duplicate definitions. * Style fix
-
Patrick von Platen authored
* Add TPU README * Apply suggestions from code review * Update examples/research_projects/jax-projects/README.md * Update examples/research_projects/jax-projects/README.md Co-authored-by:
Stefan Schweter <stefan@schweter.it> Co-authored-by:
Stefan Schweter <stefan@schweter.it>
-
Patrick von Platen authored
-
Suzana Ili膰 authored
-
Jin Young (Daniel) Sohn authored
torch_xla currently has its own synchronization primitives, so use xm.rendezvous(tag) instead.
-
Lysandre Debut authored
-
Patrick von Platen authored
* fix_torch_device_generate_test * remove @ * fix test
-
Patrick von Platen authored
* fix_torch_device_generate_test * remove @ * upload * finish doc * Apply suggestions from code review Co-authored-by:
Omar Sanseviero <osanseviero@users.noreply.github.com> Co-authored-by:
Lysandre Debut <lysandre@huggingface.co> Co-authored-by:
Julien Chaumond <chaumond@gmail.com> * finish Co-authored-by:
Omar Sanseviero <osanseviero@users.noreply.github.com> Co-authored-by:
Lysandre Debut <lysandre@huggingface.co> Co-authored-by:
Julien Chaumond <chaumond@gmail.com>
-
SaulLu authored
* add a test for a WordLevel tokenizer * adapt common test to new tokenizer
-
- 30 Jun, 2021 2 commits
-
-
Patrick von Platen authored
* fix_torch_device_generate_test * remove @ * start flax wav2vec2 * save intermediate * forward pass has correct shape * add weight norm * add files * finish ctc * make style * finish gumbel quantizer * correct docstrings * correct some more files * fix vit * finish quality * correct tests * correct docstring * correct tests * start wav2vec2 pretraining script * save intermediate * start pretraining script * finalize pretraining script * finish * finish * small typo * finish * correct * Apply suggestions from code review Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by:
Suraj Patil <surajp815@gmail.com> * make style * push Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by:
Suraj Patil <surajp815@gmail.com>
-
Suraj Patil authored
* add philosophy doc * fix typos * update doc * Apply suggestions from code review Co-authored-by:
Patrick von Platen <patrick.v.platen@gmail.com> * address Patricks suggestions * add a training example and fix typos * jit the training step * jit train step * fix example code * typo * Apply suggestions from code review Co-authored-by:
Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by:
Patrick von Platen <patrick.v.platen@gmail.com>
-