- 09 May, 2023 2 commits
-
-
Sylvain Gugger authored
-
Sebastian authored
Call str on id to make sure any ints are converted into the expected format for squad datasets
-
- 05 May, 2023 1 commit
-
-
Ashwin Mathur authored
* Add run_mim_no_trainer.py draft from #20412 Add parse_args method and copy over other dependencies Add Method call for sending telemetry Initialize Accelerator Make one log on every process Set seed and Handle repository creation Initialize dataset and Set validation split Create Config Adapt Config Update Config Create Feature Extractor Create model Set column names Create transforms Create mask generator Create method to preprocess images Shuffle datasets if needed and set transforms Create Dataloaders Add optimizer Add learning rate scheduler Prepare everything with our accelerator Tie weights for TPU training Recalculate training steps and training epochs Set accelerator checkpointing steps Initialize trackers and store configuration Set total batch size Fix typo: mlm -> mim Log info at the start of training Load in the weights and states from previous save update the progress_bar if load from checkpoint Define train loop Add evaluation loop to training Add to parse_args method Push repo to hub Save accelerator state End training and save model and feature extractor Remove unused imports Fix trailing whitespace * Update code based on comments, Rename feature_extractor to image_processor * Fix linting * Add argument for learning rate * Add argument for setting number of training epochs * Remove incorrect logger argument * Convert max_train_steps to int for tqdm --------- Co-authored-by:Saad Mahmud <shuvro.mahmud79@gmail.com>
-
- 03 May, 2023 1 commit
-
-
Robert Stone authored
Migration to Evaluate for metric is not quite complete
-
- 02 May, 2023 1 commit
-
-
regisss authored
Save the tokenizer and image preprocessor after training a model with the contrastive image-text example (#23035) Save tokenizer and image preprocessor
-
- 13 Apr, 2023 1 commit
-
-
Sylvain Gugger authored
-
- 11 Apr, 2023 1 commit
-
-
Sylvain Gugger authored
* Replace -100s in predictions by the pad token * Style * Try to catch them all
-
- 05 Apr, 2023 1 commit
-
-
Mikel Penagarikano authored
* Update run_speech_recognition_ctc.py Make sure all processes wait until data is saved before loading the processor from the output_dit * Make sure all processes wait until data is saved before loading the processor from the output_dit * Update run_speech_recognition_ctc.py * Update run_speech_recognition_seq2seq.py
-
- 04 Apr, 2023 1 commit
-
-
Maziyar Panahi authored
Add id2label and label2id to config in run_xnil
-
- 29 Mar, 2023 1 commit
-
-
Sabine authored
-
- 23 Mar, 2023 1 commit
-
-
Sylvain authored
-
- 22 Mar, 2023 3 commits
-
-
Connor Henderson authored
allow only test_file in pytorch and flax summarization
-
Wang, Yi authored
* add low_cpu_mem_usage option in run_clm.py example which will benefit LLM loading Signed-off-by:
Wang, Yi A <yi.a.wang@intel.com> * update all the example and README under language-modeling Signed-off-by:
Wang, Yi A <yi.a.wang@intel.com> --------- Signed-off-by:
Wang, Yi A <yi.a.wang@intel.com>
-
jiqing-feng authored
-
- 14 Mar, 2023 1 commit
-
-
Sylvain Gugger authored
-
- 08 Mar, 2023 1 commit
-
-
bofeng huang authored
* Add specaugment to run_speech_recognition_seq2seq.py * Remove useless argument: text_column * Fix quality * Update return_attention_mask condition * Update specaugment arguments only for whisper models * Remove SpecAugment arguments from ModelArguments, only leave default values for simplicity * Apply suggestions from code review Co-authored-by:
Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com> * Update apply_spec_augment only for whisper models * Apply suggestions from code review Co-authored-by:
Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com> * Rename return_attention_mask to forward_attention_mask to avoid confusion with wav2vec2 models --------- Co-authored-by:
Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>
-
- 27 Feb, 2023 1 commit
-
-
bofeng huang authored
* Override the decoding parameters of Seq2SeqTrainer * Fix quality * Fix max_length parameter * Fix quality * Remove redundant parameter max_length * Separate the preprocess of train and validation to use different max_target_length
-
- 24 Feb, 2023 1 commit
-
-
Sanchit Gandhi authored
* [Examples] Generalise run audio classification for log-mel models * batch feature extractor * make style
-
- 22 Feb, 2023 2 commits
-
-
Sylvain Gugger authored
* Respect documentation on passive log level * Fix test and set log level in examples * Add doc
-
Aaron Gokaslan authored
-
- 16 Feb, 2023 1 commit
-
-
regisss authored
-
- 13 Feb, 2023 1 commit
-
-
Warren Green authored
-
- 10 Feb, 2023 1 commit
-
-
steventk-g authored
Update run_mae.py
-
- 09 Feb, 2023 1 commit
-
-
lee1jun authored
Update run_speech_recognition_ctc.py There should be `# limitations under the License` line at the end of the documentation section.
-
- 08 Feb, 2023 1 commit
-
-
Stefan Schweter authored
docs: fix some references in PyTorch text classification readme
-
- 07 Feb, 2023 1 commit
-
-
Jeroen Van Der Donckt authored
-
- 06 Feb, 2023 2 commits
-
-
Sylvain Gugger authored
* Result of black 23.1 * Update target to Python 3.7 * Switch flake8 to ruff * Configure isort * Configure isort * Apply isort with line limit * Put the right black version * adapt black in check copies * Fix copies
-
Stas Bekman authored
-
- 31 Jan, 2023 1 commit
-
-
Quentin Lhoest authored
* simplify column_names in run_clm * simplify column_names in run_mlm * minor
-
- 30 Jan, 2023 1 commit
-
-
Stas Bekman authored
* [run_clm example] add streaming dataset support * unrefactor kwargs * fix * fix * require datasets>=2.0.0 * port to mlm
-
- 23 Jan, 2023 2 commits
-
-
Sylvain Gugger authored
-
Mostafa Elhoushi authored
-
- 19 Jan, 2023 1 commit
-
-
amyeroberts authored
* Update examples to use image processors * Small fixes * Resolve conflicts
-
- 18 Jan, 2023 1 commit
-
-
Sylvain Gugger authored
* Adapt repository creation to latest hf_hub * Update all examples * Fix other tests, add Flax examples * Address review comments
-
- 06 Jan, 2023 1 commit
-
-
Observer46 authored
fix args passed to predict function
-
- 05 Jan, 2023 2 commits
-
-
Roy Hvaara authored
[NumPy] Remove references to deprecated NumPy type aliases. This change replaces references to a number of deprecated NumPy type aliases (np.bool, np.int, np.float, np.complex, np.object, np.str) with their recommended replacement (bool, int, float, complex, object, str). NumPy 1.24 drops the deprecated aliases, so we must remove uses before updating NumPy. Co-authored-by:
Peter Hawkins <phawkins@google.com> Co-authored-by:
Peter Hawkins <phawkins@google.com>
-
Magnus Pierrau authored
Added mask_time_prob and mask_time_length arguments to wav2vec2 pretraining script and readme - new branch
-
- 03 Jan, 2023 1 commit
-
-
Wang, Yi authored
* [run_clm example] add torch_dtype option for model load. for BLOOM 175B model. peak memory will reduce about 350G for inference. the weight of BLOOM in model hub is bfloat16 Signed-off-by:
Wang, Yi A <yi.a.wang@intel.com> * add other type in option * fix style Signed-off-by:
Wang, Yi A <yi.a.wang@intel.com>
-
- 24 Dec, 2022 1 commit
-
-
Márton Makrai authored
-
- 21 Dec, 2022 1 commit
-
-
NielsRogge authored
Update big table Co-authored-by:Niels Rogge <nielsrogge@Nielss-MBP.localdomain>
-