- 28 May, 2020 5 commits
-
-
Suraj Patil authored
-
Anthony MOI authored
-
Suraj Patil authored
-
Lavanya Shukla authored
Co-authored-by:Patrick von Platen <patrick.v.platen@gmail.com>
-
Iz Beltagy authored
* adding freeze roberta models * model cards * lint
-
- 27 May, 2020 10 commits
-
-
Patrick von Platen authored
* improve memory benchmarking * correct typo * fix current memory * check torch memory allocated * better pytorch function * add total cached gpu memory * add total gpu required * improve torch gpu usage * update memory usage * finalize memory tracing * save intermediate benchmark class * fix conflict * improve benchmark * improve benchmark * finalize * make style * improve benchmarking * correct typo * make train function more flexible * fix csv save * better repr of bytes * better print * fix __repr__ bug * finish plot script * rename plot file * delete csv and small improvements * fix in plot * fix in plot * correct usage of timeit * remove redundant line * remove redundant line * fix bug * add hf parser tests * add versioning and platform info * make style * add gpu information * ensure backward compatibility * finish adding all tests * Update src/transformers/benchmark/benchmark_args.py Co-authored-by:
Lysandre Debut <lysandre@huggingface.co> * Update src/transformers/benchmark/benchmark_args_utils.py Co-authored-by:
Lysandre Debut <lysandre@huggingface.co> * delete csv files * fix isort ordering * add out of memory handling * add better train memory handling Co-authored-by:
Lysandre Debut <lysandre@huggingface.co>
-
Suraj Patil authored
* LongformerForSequenceClassification * better naming x=>hidden_states, fix typo in doc * Update src/transformers/modeling_longformer.py * Update src/transformers/modeling_longformer.py Co-authored-by:Patrick von Platen <patrick.v.platen@gmail.com>
-
Suraj Patil authored
-
Lysandre Debut authored
* per_device instead of per_gpu/error thrown when argument unknown * [docs] Restore examples.md symlink * Correct absolute links so that symlink to the doc works correctly * Update src/transformers/hf_argparser.py Co-authored-by:
Julien Chaumond <chaumond@gmail.com> * Warning + reorder * Docs * Style * not for squad Co-authored-by:
Julien Chaumond <chaumond@gmail.com>
-
Mehrdad Farahani authored
HooshvareLab/bert-base-parsbert-uncased
-
Patrick von Platen authored
-
Darek K艂eczek authored
Co-authored-by: kldarek <darekmail>
-
Darek K艂eczek authored
Model card for cased model
-
Sam Shleifer authored
-
Hao Tan authored
The option `--do_lower_case` is currently required by the uncased models (i.e., bert-base-uncased, bert-large-uncased). Results: BERT-BASE without --do_lower_case: 'exact': 73.83, 'f1': 82.22 BERT-BASE with --do_lower_case: 'exact': 81.02, 'f1': 88.34
-
- 26 May, 2020 11 commits
-
-
Bayartsogt Yadamsuren authored
Here I am uploading Mongolian masked language model (ALBERT) on your platform. https://en.wikipedia.org/wiki/Mongolia
-
Wissam Antoun authored
* updated aubmindlab/bert-base-arabert/ Model card * updated aubmindlab/bert-base-arabertv01 model card
-
Oleksandr Bushkovskyi authored
Add language metadata, training and evaluation corpora details. Add example output. Fix inconsistent use of quotes.
-
Manuel Romero authored
-
Manuel Romero authored
-
Patrick von Platen authored
* revert convenience method * clean docs a bit
-
ohmeow authored
* adding BART summarization how-to community notebook * Update notebooks/README.md Co-authored-by:Patrick von Platen <patrick.v.platen@gmail.com>
-
Bram Vanroy authored
* make transformers-cli cross-platform Using "scripts" is a useful option in setup.py particularly when you want to get access to non-python scripts. However, in this case we want to have an entry point into some of our own Python scripts. To do this in a concise, cross-platfom way, we can use entry_points.console_scripts. This change is necessary to provide the CLI on different platforms, which "scripts" does not ensure. Usage remains the same, but the "transformers-cli" script has to be moved (be part of the library) and renamed (underscore + extension) * make style & quality
-
Patrick von Platen authored
* add new longformer for question answering model * add new config as well * fix links * fix links part 2
-
ZhuBaohe authored
* fix * fix1
-
ZhuBaohe authored
-
- 25 May, 2020 11 commits
-
-
Sam Shleifer authored
-
Julien Chaumond authored
-
Patrick von Platen authored
* fix reformer num buckets * fix * adapt docs * set num buckets in config
-
Elman Mansimov authored
-
Suraj Patil authored
-
Oliver Guhr authored
I looks like the conference has changed the link to the paper.
-
Sho Arora authored
-
Manuel Romero authored
-
Ali Safaya authored
-
Antonis Maronikolakis authored
-
Suraj Patil authored
* added LongformerForQuestionAnswering * add LongformerForQuestionAnswering * fix import for LongformerForMaskedLM * add LongformerForQuestionAnswering * hardcoded sep_token_id * compute attention_mask if not provided * combine global_attention_mask with attention_mask when provided * update example in docstring * add assert error messages, better attention combine * add test for longformerForQuestionAnswering * typo * cast gloabl_attention_mask to long * make style * Update src/transformers/configuration_longformer.py * Update src/transformers/configuration_longformer.py * fix the code quality * Merge branch 'longformer-for-question-answering' of https://github.com/patil-suraj/transformers into longformer-for-question-answering Co-authored-by:
Patrick von Platen <patrick.v.platen@gmail.com>
-
- 23 May, 2020 1 commit
-
-
Bharat Raghunathan authored
-
- 22 May, 2020 2 commits
-
-
Bijay Gurung authored
* Add Type Hints to modeling_utils.py Closes #3911 Add Type Hints to methods in `modeling_utils.py` Note: The coverage isn't 100%. Mostly skipped internal methods. * Reformat according to `black` and `isort` * Use typing.Iterable instead of Sequence * Parameterize Iterable by its generic type * Use typing.Optional when None is the default value * Adhere to style guideline * Update src/transformers/modeling_utils.py * Update src/transformers/modeling_utils.py Co-authored-by:Julien Chaumond <chaumond@gmail.com>
-
Funtowicz Morgan authored
* Warn the user about max_len being on the path to be deprecated. * Ensure better backward compatibility when max_len is provided to a tokenizer. * Make sure to override the parameter and not the actual instance value. * Format & quality
-