Commits · ce8e64fbe22118d5b75ccf2da3e524a893ed2b99 · chenpangpang / transformers

18 Apr, 2024 1 commit
- Dev version · ce8e64fb
  Lysandre authored Apr 18, 2024
  
  ce8e64fb
17 Apr, 2024 1 commit

Upgrading to tokenizers 0.19.0 (#30289) · 8e5f76f5

Nicolas Patry authored Apr 17, 2024

* [DO NOT MERGE] Testing tokenizers 0.19.0rc0

* Accounting for the breaking change.

* Ruff.

* Upgrading to tokenizers `0.19` (new release with preprend_scheme fixed
and new surface for BPE tiktoken bug).

8e5f76f5

20 Mar, 2024 1 commit
- v4.40.0.dev.0 · 1248f092
  Arthur Zucker authored Mar 20, 2024
  
  1248f092
12 Mar, 2024 1 commit

Update legacy Repository usage in various example files (#29085) · b6404866

Hilco van der Wilk authored Mar 12, 2024

* Update legacy Repository usage in `examples/pytorch/text-classification/run_glue_no_trainer.py`

Marked for deprecation here https://huggingface.co/docs/huggingface_hub/guides/upload#legacy-upload-files-with-git-lfs

* Fix import order

* Replace all example usage of deprecated Repository

* Fix remaining repo call and rename args variable

* Revert removing creation of gitignore files and don't change research examples

b6404866

21 Feb, 2024 1 commit
- v4.39.dev.0 · 1a77f07f
  Arthur Zucker authored Feb 21, 2024
  
  1a77f07f
19 Feb, 2024 1 commit

change version (#29097) · b2724d7b

Arthur authored Feb 19, 2024



* change version

* nuke

* this doesn't make sense

* update some requirements.py

* revert + no main

* nits

* change cache number

* more pin

* revert

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

b2724d7b

16 Feb, 2024 1 commit
- Update all references to canonical models (#29001) · f497f564
  Lysandre Debut authored Feb 16, 2024
```
* Script & Manual edition

* Update
```
  f497f564
02 Feb, 2024 1 commit

[Docs] Fix spelling and grammar mistakes (#28825) · 721ee783

Klaus Hipp authored Feb 02, 2024

* Fix typos and grammar mistakes in docs and examples

* Fix typos in docstrings and comments

* Fix spelling of `tokenizer` in model tests

* Remove erroneous spaces in decorators

* Remove extra spaces in Markdown link texts

721ee783

01 Feb, 2024 1 commit
- [docs] fix some bugs about parameter description (#28806) · d98591a1
  zspo authored Feb 02, 2024
```
Co-authored-by: p_spozzhang <p_spozzhang@tencent.com>
```
  d98591a1
29 Jan, 2024 1 commit
- Fix input data file extension in examples (#28741) · 39fa4009
  Klaus Hipp authored Jan 29, 2024
  
  39fa4009
19 Jan, 2024 1 commit
- v4.38.dev.0 · b2748a6e
  Amy Roberts authored Jan 19, 2024
  
  b2748a6e
15 Jan, 2024 1 commit
- improve dev setup comments and hints (#28495) · ff86bc36
  Timothy Cronin authored Jan 15, 2024
```
* improve dev setup comments and hints

* fix tests for new dev setup hints
```
  ff86bc36
11 Jan, 2024 1 commit

Set `cache_dir` for `evaluate.load()` in example scripts (#28422) · 95091e15

Alex Hedges authored Jan 11, 2024

While using `run_clm.py`,[^1] I noticed that some files were being added
to my global cache, not the local cache. I set the `cache_dir` parameter
for the one call to `evaluate.load()`, which partially solved the
problem. I figured that while I was fixing the one script upstream, I
might as well fix the problem in all other example scripts that I could.

There are still some files being added to my global cache, but this
appears to be a bug in `evaluate` itself. This commit at least moves
some of the files into the local cache, which is better than before.

To create this PR, I made the following regex-based transformation:
`evaluate\.load\((.*?)\)` -> `evaluate\.load\($1,
cache_dir=model_args.cache_dir\)`. After using that, I manually fixed
all modified files with `ruff` serving as useful guidance. During the
process, I removed one existing usage of the `cache_dir` parameter in a
script that did not have a corresponding `--cache-dir` argument
declared.

[^1]: I specifically used `pytorch/language-modeling/run_clm.py` from
v4.34.1 of the library. For the original code, see the following URL:
https://github.com/huggingface/transformers/tree/acc394c4f5e1283c19783581790b3dc3105a3697/examples/pytorch/language-modeling/run_clm.py.

95091e15

13 Dec, 2023 1 commit
- Dev version · 3ed3e319
  Lysandre authored Dec 13, 2023
  
  3ed3e319
12 Dec, 2023 1 commit
- Fix link in README.md of Image Captioning (#27969) · a49f4aca
  saswatmeher authored Dec 12, 2023
```
Update the link for vision encoder decoder doc used by
FlaxVisionEncoderDecoderModel link.
```
  a49f4aca
07 Dec, 2023 1 commit
- fix: fix gradient accumulate step for learning rate (#27667) · 0410a29a
  Phuc Van Phan authored Dec 07, 2023
  
  0410a29a
17 Nov, 2023 1 commit
- Broken links fixed related to datasets docs (#27569) · ffbcfc01
  V.Prasanna kumar authored Nov 18, 2023
```
fixed the broken links belogs to dataset library of transformers
```
  ffbcfc01
16 Nov, 2023 2 commits

[`Styling`] stylify using ruff (#27144) · 651408a0

Arthur authored Nov 16, 2023



* try to stylify using ruff

* might need to remove these changes?

* use ruf format andruff check

* use isinstance instead of type comparision

* use # fmt: skip

* use # fmt: skip

* nits

* soem styling changes

* update ci job

* nits isinstance

* more files update

* nits

* more nits

* small nits

* check and format

* revert wrong changes

* actually use formatter instead of checker

* nits

* well docbuilder is overwriting this commit

* revert notebook changes

* try to nuke docbuilder

* style

* fix feature exrtaction test

* remve `indent-width = 4`

* fixup

* more nits

* update the ruff version that we use

* style

* nuke docbuilder styling

* leve the print for detected changes

* nits

* Remove file I/O
Co-authored-by: charliermarsh <charlie.r.marsh@gmail.com>

* style

* nits

* revert notebook changes

* Add # fmt skip when possible

* Add # fmt skip when possible

* Fix

* More `  # fmt: skip` usage

* More `  # fmt: skip` usage

* More `  # fmt: skip` usage

* NIts

* more fixes

* fix tapas

* Another way to skip

* Recommended way

* Fix two more fiels

* Remove asynch
Remove asynch

---------
Co-authored-by: charliermarsh <charlie.r.marsh@gmail.com>

651408a0

docs: add docs for map, and add num procs to load_dataset (#27520) · 69c9b89f
Phuc Van Phan authored Nov 16, 2023

69c9b89f

15 Nov, 2023 1 commit

Incorrect setting for num_beams in translation and summarization examples (#27519) · 2e72bbab

Matt authored Nov 15, 2023



* Remove the torch main_process_first context manager from TF examples

* Correctly set num_beams=1 in our examples, and add a guard in GenerationConfig.validate()

* Update src/transformers/generation/configuration_utils.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

---------
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

2e72bbab

02 Nov, 2023 1 commit
- Dev version · bc78fd12
  Lysandre authored Nov 02, 2023
  
  bc78fd12
27 Oct, 2023 1 commit
- Provide alternative when warning on use_auth_token (#27105) · 66b088fa
  Lucain authored Oct 27, 2023
  
  66b088fa
12 Oct, 2023 1 commit
- Add many missing spaces in adjacent strings (#26751) · 40ea9ab2
  Tom Aarsen authored Oct 12, 2023
```
Add missing spaces in adjacent strings
```
  40ea9ab2
10 Oct, 2023 1 commit

[JAX] Replace uses of `jnp.array` in types with `jnp.ndarray`. (#26703) · fc639143

Roy Hvaara authored Oct 10, 2023

`jnp.array` is a function, not a type:
https://jax.readthedocs.io/en/latest/_autosummary/jax.numpy.array.html


so it never makes sense to use `jnp.array` in a type annotation. Presumably the intent was to write `jnp.ndarray` aka `jax.Array`.
Co-authored-by: Peter Hawkins <phawkins@google.com>

fc639143

03 Oct, 2023 1 commit
- v4.35.0.dev0 · bd620591
  Lysandre authored Oct 03, 2023
  
  bd620591
29 Sep, 2023 1 commit

[Flax Examples] Seq2Seq ASR Fine-Tuning Script (#21764) · 68e85fc8

Sanchit Gandhi authored Sep 29, 2023

* from seq2seq speech

* [Flax] Example script for speech seq2seq

* tests and fixes

* make style

* fix: label padding tokens

* fix: label padding tokens over list

* update ln names for Whisper

* try datasets iter loader

* create readme and append results

* style

* make style

* adjust lr

* use pt dataloader

* make fast

* pin gen max len

* finish

* add pt to requirements for test

* fix pt -> torch

* add accelerate

68e85fc8

22 Sep, 2023 1 commit

feat: adding num_proc to load_dataset (#26326) · 910faa3e

Phuc Van Phan authored Sep 23, 2023

* feat: adding num_proc to load_dataset

* feat: add add_num_proc for run_mlm_flax

* feat: add num_proc for bart and t5

* chorse: remove

910faa3e

18 Sep, 2023 1 commit
- refactor: change default block_size in block size > max position embeddings (#26069) · 8b5da9fc
  Phuc Van Phan authored Sep 18, 2023
```
* refactor: change default block_size when not initialize

* reformat: add the min of block size
```
  8b5da9fc
11 Sep, 2023 2 commits
- docs: add space to docs (#26067) · 5af2c626
  Phuc Van Phan authored Sep 12, 2023
```
* docs: add space to docs

* docs: remove reduntant space
```
  5af2c626
- docs: update link huggingface map (#26077) · 9cebae64
  Phuc Van Phan authored Sep 11, 2023
  
  9cebae64
04 Sep, 2023 1 commit
- v4.34.dev.0 · d8e13b3e
  Lysandre authored Sep 04, 2023
  
  d8e13b3e
21 Aug, 2023 1 commit
- v4.33.0.dev0 · 5c67682b
  Sylvain Gugger authored Aug 21, 2023
  
  5c67682b
07 Aug, 2023 1 commit

Allow `trust_remote_code` in example scripts (#25248) · 14510938

Jackmin801 authored Aug 07, 2023

* pytorch examples

* pytorch mim no trainer

* cookiecutter

* flax examples

* missed line in pytorch run_glue

* tensorflow examples

* tensorflow run_clip

* tensorflow run_mlm

* tensorflow run_ner

* tensorflow run_clm

* pytorch example from_configs

* pytorch no trainer examples

* Revert "tensorflow run_clip"

This reverts commit 261f86ac1f1c9e05dd3fd0291e1a1f8e573781d5.

* fix: duplicated argument

14510938

02 Aug, 2023 1 commit

Add `token` arugment in example scripts (#25172) · 149cb0cc

Yih-Dar authored Aug 02, 2023



* fix

* fix

* fix

* fix

* fix

* fix

* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

149cb0cc

28 Jul, 2023 2 commits

Update `use_auth_token` -> `token` in example scripts (#25167) · d53b8ad7

Yih-Dar authored Jul 28, 2023



* pytorch examples

* tensorflow examples

* flax examples

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

d53b8ad7

Fix `.push_to_hub` and cleanup `get_full_repo_name` usage (#25120) · 6232c380

Lucain authored Jul 28, 2023

* Fix .push_to_hub and cleanup get_full_repo_name usage

* Do not rely on Python bool conversion magic

* request changes

6232c380

17 Jul, 2023 1 commit
- 4.32.0.dev0 · e9ad5130
  Sylvain Gugger authored Jul 17, 2023
  
  e9ad5130
12 Jul, 2023 1 commit

Replacement of 20 asserts with exceptions (#24757) · fc9e387d

Bauke Brenninkmeijer authored Jul 12, 2023



* initial replacements of asserts with errors/exceptions

* replace assert with exception in generation, align and bart

* reset formatting change

* reset another formatting issue

* Apply suggestion
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* don't touch this file

* change to 'is not False'

* fix type

---------
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

fc9e387d

07 Jun, 2023 1 commit
- v4.31.0.dev0 · ba695c1e
  Sylvain Gugger authored Jun 07, 2023
  
  ba695c1e
22 May, 2023 1 commit
- changing the requirements to a cpu torch version that works (#23483) · 6f72e71f
  sshahrokhi authored May 22, 2023
  
  6f72e71f