- 08 May, 2023 1 commit
-
-
Sylvain Gugger authored
-
- 04 May, 2023 1 commit
-
-
Sylvain Gugger authored
-
- 03 May, 2023 1 commit
-
-
Sylvain Gugger authored
-
- 20 Apr, 2023 1 commit
-
-
amyeroberts authored
* Pin optax version * Pin flax too * Fixup
-
- 18 Apr, 2023 1 commit
-
-
Zachary Mueller authored
-
- 17 Apr, 2023 1 commit
-
-
Zachary Mueller authored
* Use accelerate for device management * Add accelerate to setup Co-authored-by:Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
-
- 13 Apr, 2023 1 commit
-
-
Sylvain Gugger authored
-
- 07 Apr, 2023 1 commit
-
-
Sylvain Gugger authored
-
- 06 Apr, 2023 1 commit
-
-
Nicolas Patry authored
* Adding Llama FastTokenizer support. - Requires https://github.com/huggingface/tokenizers/pull/1183 version - Only support byte_fallback for llama, raise otherwise (safety net). - Lots of questions are special tokens How to test: ```python from transformers.convert_slow_tokenizer import convert_slow_tokenizer from transformers import AutoTokenizer from tokenizers import Tokenizer tokenizer = AutoTokenizer.from_pretrained("huggingface/llama-7b") if False: new_tokenizer = Tokenizer.from_file("tok.json") else: new_tokenizer = convert_slow_tokenizer(tokenizer) new_tokenizer.save("tok.json") strings = [ "This is a test", "生活的真谛是", "生活的真谛是[MASK]。", # XXX: This one is problematic because of special tokens # "<s> Something something", ] for string in strings: encoded = tokenizer(string)["input_ids"] encoded2 = new_tokenizer.encode(string).ids assert encoded == encoded2, f"{encoded} != {encoded2}" decoded = tokenizer.decode(encoded) decoded2 = new_tokenizer.decode(encoded2) assert decoded.strip() == decoded2, f"{repr(decoded)} != {repr(decoded2)}" ``` The converter + some test script. The test script. Tmp save. Adding Fast tokenizer + tests. Adding the tokenization tests. Correct combination. Small fix. Fixing tests. Fixing with latest update. Rebased. fix copies + normalized added tokens + copies. Adding doc. TMP. Doc + split files. Doc. Versions + try import. Fix Camembert + warnings -> Error. Fix by ArthurZucker. Not a decorator. * Fixing comments. * Adding more to docstring. * Doc rewriting.
-
- 03 Apr, 2023 2 commits
-
-
Xuehai Pan authored
* [setup] migrate setup script to `pyproject.toml` * [setup] cleanup configurations * remove unused imports
-
Xuehai Pan authored
* [setup] drop deprecated `distutils` usage * drop deprecated `distutils.util.strtobool` usage * fix import order * reformat docstring by `doc-builder`
-
- 29 Mar, 2023 2 commits
-
-
Sylvain Gugger authored
-
Sylvain Gugger authored
-
- 24 Mar, 2023 2 commits
-
-
Joao Gante authored
-
Sylvain Gugger authored
* Pin tensorflow-text to go with tensorflow * Make it more convenient to pin TensorFlow * setup don't like f-strings
-
- 22 Mar, 2023 1 commit
-
-
Stas Bekman authored
* [deepspeed] offload + non-cpuadam optimizer exception doc * deps
-
- 21 Mar, 2023 2 commits
-
-
Ali Hassani authored
-
Yih-Dar authored
* time to say goodbye, torch 1.7 and 1.8 * clean up torch_int_div * clean up is_torch_less_than_1_8-9 * update --------- Co-authored-by:ydshieh <ydshieh@users.noreply.github.com>
-
- 17 Mar, 2023 1 commit
-
-
Ali Hassani authored
* Add kernel size to NATTEN's QK arguments. The new NATTEN 0.14.5 supports PyTorch 2.0, but also adds an additional argument to the QK operation to allow optional RPBs. This ends up failing NATTEN tests. This commit adds NATTEN back to circleci and adds the arguments to get it working again. * Force NATTEN >= 0.14.5
-
- 14 Mar, 2023 1 commit
-
-
Sylvain Gugger authored
-
- 02 Mar, 2023 1 commit
-
-
amyeroberts authored
* Use PyAV instead of Decord * Get frame indices * Fix number of frames * Update src/transformers/models/videomae/image_processing_videomae.py * Fix up * Fix copies * Update timesformer doctests * Update docstrings
-
- 16 Feb, 2023 1 commit
-
-
Sylvain Gugger authored
-
- 13 Feb, 2023 1 commit
-
-
Stas Bekman authored
* Update setup.py * suggestions
-
- 09 Feb, 2023 1 commit
-
-
Sylvain Gugger authored
* Fix inclusion of non py files in package * No need for the **
-
- 06 Feb, 2023 1 commit
-
-
Sylvain Gugger authored
* Result of black 23.1 * Update target to Python 3.7 * Switch flake8 to ruff * Configure isort * Configure isort * Apply isort with line limit * Put the right black version * adapt black in check copies * Fix copies
-
- 31 Jan, 2023 1 commit
-
-
NielsRogge authored
* First draft * Add initial draft of conversion script * Convert all weights * Fix config * Add image processor * Fix DetaImageProcessor * Run make fix copies * Remove timm dependency * Fix dummy objects * Improve loss function * Remove conv_encoder attribute * Update conversion scripts * Improve postprocessing + docs * Fix copied from statements * Add tests * Improve postprocessing * Improve postprocessing * Update READMEs * More improvements * Fix rebase * Add is_torchvision_available * Add torchvision dependency * Fix typo and README * Fix bug * Add copied from * Fix style * Apply suggestions * Fix thanks to @ydshieh * Fix another dependency check * Simplify image processor * Add scipy * Improve code * Add threshold argument * Fix bug * Set default threshold * Improve integration test * Add another integration test * Update setup.py * Address review * Improve deformable attention function * Improve copied from * Use relative imports * Address review * Replace assertions * Address review * Update dummies * Remove dummies * Address comments, update READMEs * Remove custom kernel code * Add image processor tests * Add requires_backends * Add minor comment * Update scripts * Update organization name * Fix defaults, add doc tests * Add id2label for object 365 * Fix tests * Update task guide
-
- 30 Jan, 2023 1 commit
-
-
Sylvain Gugger authored
-
- 23 Jan, 2023 1 commit
-
-
Sylvain Gugger authored
-
- 18 Jan, 2023 1 commit
-
-
Sylvain Gugger authored
* Adapt repository creation to latest hf_hub * Update all examples * Fix other tests, add Flax examples * Address review comments
-
- 31 Dec, 2022 1 commit
-
-
Hao Wang authored
* update pyknp to rhoknp * fix linter * fix linter * fix linter * fix linter * fix linter * support rhoknp==1.1.0, fix testcase
-
- 16 Dec, 2022 1 commit
-
-
Yih-Dar authored
* Install sentencepiece in DS CI image * update Co-authored-by:ydshieh <ydshieh@users.noreply.github.com>
-
- 08 Dec, 2022 1 commit
-
-
Nathan Raw authored
*
🚧 wip video classification pipeline *🚧 wip - add is_decord_available check *🐛 add missing import *✅ add tests *🔧 add decord to setup extras *🚧 add is_decord_available *✨ add video-classification pipeline *📝 add video classification pipe to docs *🐛 add missing VideoClassificationPipeline import *📌 add decord install in test runner *✅ fix url inputs to video-classification pipeline *✨ updates from review *📝 add video cls pipeline to docs *📝 add docstring *🔥 remove unused import *🔥 remove some code *📝 docfix
-
- 06 Dec, 2022 1 commit
-
-
Sylvain Gugger authored
-
- 01 Dec, 2022 1 commit
-
-
Sylvain Gugger authored
-
- 29 Nov, 2022 1 commit
-
-
Pi Esposito authored
* add minimal working gpt2 tokenizer * graph mode and output equivalence tests working * not today tensorflow. serialization test passing! * fix style, documentation, docstrings and all that jazz * passing consistency checks * move keras nlp to tf dependencies * fix tf modeling utils and gpt2 attention to enable compiling * fix (I hope) keras nlp dependencies * rever changes on generation * remove debug prints * remove redundant tf dummy objects * add from config, get config and max length settings to address review * let flake ignore the error on distillation you are welcome * test from config * add padding test * address sgugger review
-
- 18 Nov, 2022 4 commits
-
-
Ali Hassani authored
* Add DiNAT * Adds DiNAT + tests * Minor fixes * Added HF model * Add natten to dependencies. * Cleanup * Minor fixup * Reformat * Optional NATTEN import. * Reformat & add doc to _toctree * Reformat (finally) * Dummy objects for DiNAT * Add NAT + minor changes Adds NAT as its own independent model + docs, tests Adds NATTEN to ext deps to ensure ci picks it up. * Remove natten from `all` and `dev-torch` deps, add manual pip install to ci tests * Minor fixes. * Fix READMEs. * Requested changes to docs + minor fixes. * Requested changes. * Add NAT/DiNAT tests to layoutlm_job * Correction to Dinat doc. * Requested changes.
-
Sylvain Gugger authored
-
Sylvain Gugger authored
-
Sylvain Gugger authored
-
- 15 Nov, 2022 1 commit
-
-
Sylvain Gugger authored
* Try PT1.13 by removing torch scatter * Skip failing tests * Style * Remvoe testing extras for repo utils * Try with all decorators * Try to wipe the cache * Fix all tests? * Try this way * Fix comma * Update to main * Try with less deps * Quality
-