- 02 May, 2022 8 commits
-
-
Sylvain Gugger authored
-
yujun authored
* add torch.no_grad when in eval mode * make style quality
-
Martin P枚msl authored
-
Sanchit Gandhi authored
* [FlaxSpeechEncoderDecoder] Fix bug in `decoder_module` * [FlaxEncoderDecoder] Fix bug in `decoder_module`
-
Sylvain Gugger authored
-
Michael Benayoun authored
* Add meta proxy * Uses meta data to trace data dependent control-flow * Remove commented class * Handles torch creating functions * Added type annotation to fix tracing * Tracing works for everything but T5 and GPT-J * Almost all previously supported models pass * All architectures can be traced except T5 * Intermediate commit to have a trace of the comparison operators for HFProxy * Everything works, except loss computation * Everything works * Removed unused import * Overriden methods do not use underlying ops (linear and torch.matmul), and model attributes are copied to the traced version * Fix torch_matmul_override * Change attributes reference to deepcopy * Remove breakpoint and add torch_index_override * Small fix * Fix typo * Replace asserts by explicit exceptions
-
Sanchit Gandhi authored
-
Manan Dey authored
-
- 30 Apr, 2022 2 commits
-
-
Omar U. Espejel authored
* Add translating guide
-
Yih-Dar authored
* Add the check * add missing ckpts * add a list to ignore * call the added check script * better regex pattern Co-authored-by:ydshieh <ydshieh@users.noreply.github.com>
-
- 29 Apr, 2022 9 commits
-
-
Sylvain Gugger authored
* Result of new doc style with fixes * Add last two files * Bump hf-doc-builder
-
Sylvain Gugger authored
* Replace dict/BatchEncoding instance checks by Mapping * Typo
-
Nicolas Patry authored
This reverts commit 4f3a14e3.
-
Nicolas Patry authored
-
tarzan authored
-
Pavel Belevich authored
-
Joao Gante authored
-
Zachary Mueller authored
-
Yih-Dar authored
Co-authored-by:ydshieh <ydshieh@users.noreply.github.com>
-
- 28 Apr, 2022 6 commits
-
-
Sylvain Gugger authored
-
Zachary Mueller authored
-
Yih-Dar authored
Co-authored-by:ydshieh <ydshieh@users.noreply.github.com>
-
amyeroberts authored
-
conan1024hao authored
* dd parameter --config_overrides for run_mlm_wwm.py * linter
-
Yih-Dar authored
* fix * Apply suggestions from code review Co-authored-by:
ydshieh <ydshieh@users.noreply.github.com> Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
-
- 27 Apr, 2022 14 commits
-
-
Dat Quoc Nguyen authored
The emoji version must be either 0.5.4 or 0.6.0. Newer emoji versions have been updated to newer versions of the Emoji Charts, thus not consistent with the one used for pre-processing the pre-training Tweet corpus (i.e. not consistent with the vocab).
-
Yih-Dar authored
* Add -e flag * add check * create new keys * run python setup.py build install * add comments * change to develop Co-authored-by:ydshieh <ydshieh@users.noreply.github.com>
-
Yih-Dar authored
Co-authored-by:ydshieh <ydshieh@users.noreply.github.com>
-
Sylvain Gugger authored
* Fix doc notebooks links * Remove missing section
-
Zachary Mueller authored
* Fixup all examples
-
Sylvain Gugger authored
* Fix multiple deletions of the same files in save_pretrained * Add is_main_process argument
-
Sylvain Gugger authored
-
Mishig Davaadorj authored
BertModelForSequenceClassification -> BertForSequenceClassification
-
Antoni Baum authored
* Fix `distributed_concat` with scalar tensor * Update trainer_pt_utils.py
-
NielsRogge authored
* Add fix * Apply suggestion from code review Co-authored-by:Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local>
-
Leonid Boytsov authored
1. Fixes evaluation errors popping up when you train/eval on squad v2 (one was newly encountered and one that was previously reported Running SQuAD 1.0 sample command raises IndexError #15401 but not completely fixed). 2. Removes boolean arguments that don't use store_true. Please, don't use these: *ANY non-empty string is being converted to True in this case and this clearly is not the desired behavior (and it creates a LOT of confusion). 3. All no-trainer test scripts are now saving metric values in the same way (with the right prefix eval_), which is consistent with the trainer-based versions. 4. Adds forgotten model.eval() in the no-trainer versions. This improved some results, but not everything (see the discussion in the end). Please, see the F1 scores and the discussion below.
-
Yih-Dar authored
Co-authored-by:ydshieh <ydshieh@users.noreply.github.com>
-
NielsRogge authored
* Add first draft * Improve script and README * Improve README * Apply suggestions from code review * Improve script, add link to resulting model * Add corresponding test * Adjust learning rate
-
Anton Lozhkov authored
* Avoid repeated per-lang filtering * Language groups and logits preprocessing * Style
-
- 26 Apr, 2022 1 commit
-
-
Yongliang Shen authored
-