- 20 Apr, 2023 1 commit
-
-
Sayak Paul authored
minor refactoring to allow compatible datasets to work.
-
- 17 Apr, 2023 2 commits
-
-
Zachary Mueller authored
Remove accelerate from tf
-
Matt authored
-
- 14 Apr, 2023 1 commit
-
-
Sayak Paul authored
* add: tokenizer training script for TF TPU LM training. * add: script for preparing the TFRecord shards. * add: sequence of execution to readme. * remove limit from the tfrecord shard name. * Add initial train_model.py * Add basic training arguments and model init * Get up to the point of writing the data collator * Pushing progress so far! * Complete first draft of model training code * feat: grouping of texts efficiently. Co-authored-by:
Matt <rocketknight1@gmail.com> * Add proper masking collator and get training loop working * fix: things. * Read sample counts from filenames * Read sample counts from filenames * Draft README * Improve TPU warning * Use distribute instead of distribute.experimental * Apply suggestions from code review Co-authored-by:
Matt <Rocketknight1@users.noreply.github.com> * Modularize loading and add MLM probability as arg * minor refactoring to better use the cli args. * readme fillup. * include tpu and inference sections in the readme. * table of contents. * parallelize maps. * polish readme. * change script name to run_mlm.py * address PR feedback (round I). --------- Co-authored-by:
Matt <rocketknight1@gmail.com> Co-authored-by:
Matt <Rocketknight1@users.noreply.github.com>
-
- 13 Apr, 2023 1 commit
-
-
Sylvain Gugger authored
-
- 24 Mar, 2023 2 commits
-
-
Joao Gante authored
-
Sylvain Gugger authored
* Pin tensorflow-text to go with tensorflow * Make it more convenient to pin TensorFlow * setup don't like f-strings
-
- 14 Mar, 2023 1 commit
-
-
Sylvain Gugger authored
-
- 07 Mar, 2023 1 commit
-
-
Matt authored
* Stop requiring Torch for our TF examples! * Slight tweak to logging in the example itself
-
- 06 Mar, 2023 1 commit
-
-
Matt authored
* Initial commit * stash commit * Add model checkpointing and pushing * Fix model name inference * Update README * Update README * Remove a couple of Torch references * Update copyright date * make fixup * Update PushToHubCallback args! * Remove the torch summary * Add strategy.scope
-
- 01 Mar, 2023 1 commit
-
-
Matt authored
* Add check for different embedding types in examples * Correctly update summarization example
-
- 22 Feb, 2023 1 commit
-
-
Aaron Gokaslan authored
-
- 06 Feb, 2023 1 commit
-
-
Sylvain Gugger authored
* Result of black 23.1 * Update target to Python 3.7 * Switch flake8 to ruff * Configure isort * Configure isort * Apply isort with line limit * Put the right black version * adapt black in check copies * Fix copies
-
- 01 Feb, 2023 1 commit
-
-
amyeroberts authored
* TF image classification script * Update requirements * Fix up * Add tests * Update test fetcher Co-authored-by:
Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Fix directory path * Adding `zero-shot-object-detection` pipeline doctest. (#20274) * Adding `zero-shot-object-detection` pipeline doctest. * Remove nested_simplify. * Add generate kwargs to `AutomaticSpeechRecognitionPipeline` (#20952) * Add generate kwargs to AutomaticSpeechRecognitionPipeline * Add test for generation kwargs * Trigger CI * Data collator returns np * Update feature extractor -> image processor * Bug fixes - updates to reflect changes in API * Update flags to match PT & run faster * Update instructions - Maria's comment * Update examples/tensorflow/image-classification/README.md * Remove slow decorator --------- Co-authored-by:
Nicolas Patry <patry.nicolas@protonmail.com> Co-authored-by:
bofeng huang <bofenghuang7@gmail.com> Co-authored-by:
Sylvain Gugger <Sylvain.gugger@gmail.com>
-
- 24 Jan, 2023 1 commit
-
-
Matt authored
Return NP instead of TF tensors for our data loading pipeline
-
- 23 Jan, 2023 1 commit
-
-
Sylvain Gugger authored
-
- 05 Jan, 2023 1 commit
-
-
Roy Hvaara authored
[NumPy] Remove references to deprecated NumPy type aliases. This change replaces references to a number of deprecated NumPy type aliases (np.bool, np.int, np.float, np.complex, np.object, np.str) with their recommended replacement (bool, int, float, complex, object, str). NumPy 1.24 drops the deprecated aliases, so we must remove uses before updating NumPy. Co-authored-by:
Peter Hawkins <phawkins@google.com> Co-authored-by:
Peter Hawkins <phawkins@google.com>
-
- 01 Dec, 2022 1 commit
-
-
Sylvain Gugger authored
-
- 18 Nov, 2022 1 commit
-
-
Sylvain Gugger authored
-
- 03 Nov, 2022 1 commit
-
-
Sylvain Gugger authored
* Only resize embeddings when necessary * Add comment
-
- 01 Nov, 2022 1 commit
-
-
Sylvain Gugger authored
-
- 10 Oct, 2022 1 commit
-
-
Lysandre authored
-
- 22 Sep, 2022 1 commit
-
-
Matt authored
-
- 14 Sep, 2022 1 commit
-
-
Lysandre authored
-
- 10 Aug, 2022 1 commit
-
-
Matt authored
* Finished QA example * Dodge a merge conflict * Update text classification and LM examples * Update NER example * New Keras metrics WIP, fix NER example * Update NER example * Update MC, summarization and translation examples * Add XLA warnings when shapes are variable * Make sure batch_size is consistently scaled by num_replicas * Add PushToHubCallback to all models * Add docs links for KerasMetricCallback * Add docs links for prepare_tf_dataset and jit_compile * Correct inferred model names * Don't assume the dataset has 'lang' * Don't assume the dataset has 'lang' * Write metrics in text classification * Add 'framework' to TrainingArguments and TFTrainingArguments * Export metrics in all examples and add tests * Fix training args for Flax * Update command line args for translation test * make fixup * Fix accidentally running other tests in fp16 * Remove do_train/do_eval from run_clm.py * Remove do_train/do_eval from run_mlm.py * Add tensorflow tests to circleci * Fix circleci * Update examples/tensorflow/language-modeling/run_mlm.py Co-authored-by:
Joao Gante <joaofranciscocardosogante@gmail.com> * Update examples/tensorflow/test_tensorflow_examples.py Co-authored-by:
Joao Gante <joaofranciscocardosogante@gmail.com> * Update examples/tensorflow/translation/run_translation.py Co-authored-by:
Joao Gante <joaofranciscocardosogante@gmail.com> * Update examples/tensorflow/token-classification/run_ner.py Co-authored-by:
Joao Gante <joaofranciscocardosogante@gmail.com> * Fix save path for tests * Fix some model card kwargs * Explain the magical -1000 * Actually enable tests this time * Skip text classification PR until we fix shape inference * make fixup Co-authored-by:
Joao Gante <joaofranciscocardosogante@gmail.com>
-
- 06 Aug, 2022 2 commits
-
-
Julien Chaumond authored
* zero chance anyone's using that constant no? * `transformers-cli login` => `huggingface-cli login` * `transformers-cli repo create` => `huggingface-cli repo create` * `make style`
-
Julien Chaumond authored
* Delete valohai.yaml * NLP => ML * typo * website supports https * datasets * 60k + modalities * unrelated link fixing for accelerate * Ok those links were actually broken * Fix link * Make `AutoTokenizer` auto-link * wording tweak * add at least one non-nlp task
-
- 01 Aug, 2022 1 commit
-
-
Sylvain Gugger authored
* Fix ROUGE add example check and update README * Stay consistent in values
-
- 29 Jul, 2022 1 commit
-
-
Sylvain Gugger authored
* Preliminary work on tokenizers * Quality + fix tests * Treat processors * Fix pad * Remove all uses of in tests, docs and examples * Replace all as_target_tokenizer * Fix tests * Fix quality * Update examples/flax/image-captioning/run_image_captioning_flax.py Co-authored-by:
amyeroberts <amy@huggingface.co> * Style Co-authored-by:
amyeroberts <amy@huggingface.co>
-
- 28 Jul, 2022 1 commit
-
-
Vijay S Kalmath authored
* Migrate metric to Evaluate library in tf examples Currently tensorflow examples use `load_metric` function from Datasets library , commit migrates function call to `load` function to Evaluate library. Fix for #18306 * Migrate metric to Evaluate library in tf examples Currently tensorflow examples use `load_metric` function from Datasets library , commit migrates function call to `load` function to Evaluate library. Fix for #18306 * Migrate `metric` to Evaluate for all tf examples Currently tensorflow examples use `load_metric` function from Datasets library , commit migrates function call to `load` function to Evaluate library.
-
- 27 Jul, 2022 1 commit
-
-
Lysandre authored
-
- 13 Jul, 2022 1 commit
-
-
John Giorgi authored
* Add summarization name mapping for MultiNews * Add summarization name mapping for MultiNews
-
- 16 Jun, 2022 1 commit
-
-
Sylvain Gugger authored
-
- 07 Jun, 2022 1 commit
-
-
Sylvain Gugger authored
* Add examples telemetry * Alternative approach * Add to all other examples * Add to templates as well * Put framework separately * Same for TensorFlow
-
- 12 May, 2022 2 commits
-
-
Sylvain Gugger authored
* Black preview * Fixup too! * Fix check copies * Use the same version as the CI * Bump black
-
Lysandre Debut authored
-
- 27 Apr, 2022 1 commit
-
-
Leonid Boytsov authored
1. Fixes evaluation errors popping up when you train/eval on squad v2 (one was newly encountered and one that was previously reported Running SQuAD 1.0 sample command raises IndexError #15401 but not completely fixed). 2. Removes boolean arguments that don't use store_true. Please, don't use these: *ANY non-empty string is being converted to True in this case and this clearly is not the desired behavior (and it creates a LOT of confusion). 3. All no-trainer test scripts are now saving metric values in the same way (with the right prefix eval_), which is consistent with the trainer-based versions. 4. Adds forgotten model.eval() in the no-trainer versions. This improved some results, but not everything (see the discussion in the end). Please, see the F1 scores and the discussion below.
-
- 19 Apr, 2022 1 commit
-
-
Wonjae Kim authored
-
- 06 Apr, 2022 1 commit
-
-
Lysandre Debut authored
-
- 04 Apr, 2022 1 commit
-
-
Karim Foda authored
* fix formatting and remove use_auth * Add use_auth_token to Flax examples
-