"vscode:/vscode.git/clone" did not exist on "002915aa2ad7d7826294fabd7ba4e6297772768c"
- 30 Mar, 2022 1 commit
-
-
Stas Bekman authored
* [examples] max samples can't be bigger than then len of dataset * do tf and flax
-
- 01 Mar, 2022 1 commit
-
-
Joao Gante authored
-
- 12 Jan, 2022 1 commit
-
-
Russell Klopfer authored
* use block_size instead of max_seq_length * fixup * remove pad_to_block_size Co-authored-by:Russell Klopfer <russell@kloper.us>
-
- 06 Dec, 2021 1 commit
-
-
Julien Chaumond authored
* Replace outdated model tags with their now-canonical pipeline types * spam the CI till it's green
-
- 22 Nov, 2021 1 commit
-
-
Nicholas Broad authored
* remove sum for list flattening * change to chain(*) * make chain object a list * delete empty lines per sgugger's suggestions Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by:
Nicholas Broad <nicholas@nmbroad.com> Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
-
- 11 Nov, 2021 1 commit
-
-
Matt authored
* Fixing requirements for TF LM models and use correct model mappings * make style
-
- 21 Oct, 2021 1 commit
-
-
Christopher Akiki authored
-
- 31 Aug, 2021 1 commit
-
-
Matt authored
-
- 28 Aug, 2021 1 commit
-
-
Stefan Schweter authored
* examples: only use keep_linebreaks when reading TXT files for all CLM examples * examples: only use keep_linebreaks when reading TXT files for all CLM examples * examples: only use keep_linebreaks when reading TXT files for all CLM examples
-
- 27 Aug, 2021 1 commit
-
-
Stefan Schweter authored
* examples: add keep_linebreaks option to text dataset loader for all CLM examples * examples: introduce new keep_linebreaks option as data argument in CLM examples
-
- 28 Jul, 2021 1 commit
-
-
Elysium1436 authored
* Fixed train_test_split test_size argument * `Seq2SeqTrainer` set max_length and num_beams only when non None (#12899) * set max_length and num_beams only when non None * fix instance variables * fix code style * [FLAX] Minor fixes in CLM example (#12914) * readme: fix retrieval of vocab size for flax clm example * examples: fix flax clm example when using training/evaluation files * Fix module path for symbolic_trace example Co-authored-by:
cchen-dialpad <47165889+cchen-dialpad@users.noreply.github.com> Co-authored-by:
Stefan Schweter <stefan@schweter.it> Co-authored-by:
Sylvain Gugger <sylvain.gugger@gmail.com>
-
- 08 Jul, 2021 1 commit
-
-
Sylvain Gugger authored
-
- 01 Jul, 2021 1 commit
-
-
Souvic Chakraborty authored
* Validation split added: custom data files Validation split added in case of no validation file and loading custom data * Updated documentation with custom file usage Updated documentation with custom file usage * Update README.md * Update README.md * Update README.md * Made some suggested stylistic changes * Used logger instead of print. Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Made similar changes to add validation split In case of a missing validation file, a validation split will be used now. * max_train_samples to be used for training only max_train_samples got misplaced, now corrected so that it is applied on training data only, not whole data. * styled * changed ordering * Improved language of documentation Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Improved language of documentation Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Fixed styling issue * Update run_mlm.py Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
-
- 28 Jun, 2021 1 commit
-
-
Matt authored
* Tensorflow MLM example * Add CLM example * Style fixes, adding missing checkpoint code from the CLM example * Fix TPU training, avoid massive dataset warnings * Fix incorrect training length calculation for multi-GPU training * Fix incorrect training length calculation for multi-GPU training * Refactors and nitpicks from the review * Style pass * Adding README
-