1. 29 Jun, 2023 1 commit
  2. 26 Jun, 2023 1 commit
  3. 19 Jun, 2023 1 commit
  4. 13 Jun, 2023 1 commit
  5. 08 Jun, 2023 1 commit
  6. 23 May, 2023 1 commit
    • Matt's avatar
      TF version compatibility fixes (#23663) · 876d9a32
      Matt authored
      * New TF version compatibility fixes
      
      * Remove dummy print statement, move expand_1d
      
      * Make a proper framework inference function
      
      * Make a proper framework inference function
      
      * ValueError -> TypeError
      876d9a32
  7. 11 May, 2023 1 commit
    • Alessandro Pietro Bardelli's avatar
      Better check for packages availability (#23163) · 83eda643
      Alessandro Pietro Bardelli authored
      * Better check for packages availability
      
      * amend _optimumneuron_available
      
      * amend torch_version
      
      * amend PIL detection and lint
      
      * lint
      
      * amend _faiss_available
      
      * remove overloaded signatures of _is_package_available
      
      * fix sklearn and decord detection
      
      * remove unused checks
      
      * revert
      83eda643
  8. 10 May, 2023 1 commit
    • Sylvain Gugger's avatar
      Test composition (#23214) · 33357243
      Sylvain Gugger authored
      
      
      * Remove nestedness in tool config
      
      * Really do it
      
      * Use remote tools descriptions
      
      * Work
      
      * Clean up eval
      
      * Changes
      
      * Tools
      
      * Tools
      
      * tool
      
      * Fix everything
      
      * Use last result/assign for evaluation
      
      * Prompt
      
      * Remove hardcoded selection
      
      * Evaluation for chat agents
      
      * correct some spelling
      
      * Small fixes
      
      * Change summarization model (#23172)
      
      * Fix link displayed
      
      * Update description of the tool
      
      * Fixes in chat prompt
      
      * Custom tools, custom prompt
      
      * Tool clean up
      
      * save_pretrained and push_to_hub for tool
      
      * Fix init
      
      * Tests
      
      * Fix tests
      
      * Tool save/from_hub/push_to_hub and tool->load_tool
      
      * Clean push_to_hub and add app file
      
      * Custom inference API for endpoints too
      
      * Clean up
      
      * old remote tool and new remote tool
      
      * Make a requirements
      
      * return_code adds tool creation
      
      * Avoid redundancy between global variables
      
      * Remote tools can be loaded
      
      * Tests
      
      * Text summarization tests
      
      * Quality
      
      * Properly mark tests
      
      * Test the python interpreter
      
      * And the CI shall be green.
      
      * fix loading of additional tools
      
      * Work on RemoteTool and fix tests
      
      * General clean up
      
      * Guard imports
      
      * Fix tools
      
      * docs: Fix broken link in 'How to add a model...'  (#23216)
      
      fix link
      
      * Get default endpoint from the Hub
      
      * Add guide
      
      * Simplify tool config
      
      * Docs
      
      * Some fixes
      
      * Docs
      
      * Docs
      
      * Docs
      
      * Fix code returned by agent
      
      * Try this
      
      * Match args with signature in remote tool
      
      * Should fix python interpreter for Python 3.8
      
      * Fix push_to_hub for tools
      
      * Other fixes to push_to_hub
      
      * Add API doc page
      
      * Docs
      
      * Docs
      
      * Custom tools
      
      * Pin tensorflow-probability (#23220)
      
      * Pin tensorflow-probability
      
      * [all-test]
      
      * [all-test] Fix syntax for bash
      
      * PoC for some chaining API
      
      * Text to speech
      
      * J'ai pris des libertés
      
      * Rename
      
      * Basic python interpreter
      
      * Add agents
      
      * Quality
      
      * Add translation tool
      
      * temp
      
      * GenQA + LID + S2T
      
      * Quality + word missing in translation
      
      * Add open assistance, support f-strings in evaluate
      
      * captioning + s2t fixes
      
      * Style
      
      * Refactor descriptions and remove chain
      
      * Support errors and rename OpenAssistantAgent
      
      * Add setup
      
      * Deal with typos + example of inference API
      
      * Some rename + README
      
      * Fixes
      
      * Update prompt
      
      * Unwanted change
      
      * Make sure everyone has a default
      
      * One prompt to rule them all.
      
      * SD
      
      * Description
      
      * Clean up remote tools
      
      * More remote tools
      
      * Add option to return code and update doc
      
      * Image segmentation
      
      * ControlNet
      
      * Gradio demo
      
      * Diffusers protection
      
      * Lib protection
      
      * ControlNet description
      
      * Cleanup
      
      * Style
      
      * Remove accelerate and try to be reproducible
      
      * No randomness
      
      * Male Basic optional in token
      
      * Clean description
      
      * Better prompts
      
      * Fix args eval in interpreter
      
      * Add tool wrapper
      
      * Tool on the Hub
      
      * Style post-rebase
      
      * Big refactor of descriptions, batch generation and evaluation for agents
      
      * Make problems easier - interface to debug
      
      * More problems, add python primitives
      
      * Back to one prompt
      
      * Remove dict for translation
      
      * Be consistent
      
      * Add prompts
      
      * New version of the agent
      
      * Evaluate new agents
      
      * New endpoints agents
      
      * Make all tools a dict variable
      
      * Typo
      
      * Add problems
      
      * Add to big prompt
      
      * Harmonize
      
      * Add tools
      
      * New evaluation
      
      * Add more tools
      
      * Build prompt with tools descriptions
      
      * Tools on the Hub
      
      * Let's chat!
      
      * Cleanup
      
      * Temporary bs4 safeguard
      
      * Cache agents and clean up
      
      * Blank init
      
      * Fix evaluation for agents
      
      * New format for tools on the Hub
      
      * Add method to reset state
      
      * Remove nestedness in tool config
      
      * Really do it
      
      * Use remote tools descriptions
      
      * Work
      
      * Clean up eval
      
      * Changes
      
      * Tools
      
      * Tools
      
      * tool
      
      * Fix everything
      
      * Use last result/assign for evaluation
      
      * Prompt
      
      * Remove hardcoded selection
      
      * Evaluation for chat agents
      
      * correct some spelling
      
      * Small fixes
      
      * Change summarization model (#23172)
      
      * Fix link displayed
      
      * Update description of the tool
      
      * Fixes in chat prompt
      
      * Custom tools, custom prompt
      
      * Tool clean up
      
      * save_pretrained and push_to_hub for tool
      
      * Fix init
      
      * Tests
      
      * Fix tests
      
      * Tool save/from_hub/push_to_hub and tool->load_tool
      
      * Clean push_to_hub and add app file
      
      * Custom inference API for endpoints too
      
      * Clean up
      
      * old remote tool and new remote tool
      
      * Make a requirements
      
      * return_code adds tool creation
      
      * Avoid redundancy between global variables
      
      * Remote tools can be loaded
      
      * Tests
      
      * Text summarization tests
      
      * Quality
      
      * Properly mark tests
      
      * Test the python interpreter
      
      * And the CI shall be green.
      
      * Work on RemoteTool and fix tests
      
      * fix loading of additional tools
      
      * General clean up
      
      * Guard imports
      
      * Fix tools
      
      * Get default endpoint from the Hub
      
      * Simplify tool config
      
      * Add guide
      
      * Docs
      
      * Some fixes
      
      * Docs
      
      * Docs
      
      * Fix code returned by agent
      
      * Try this
      
      * Docs
      
      * Match args with signature in remote tool
      
      * Should fix python interpreter for Python 3.8
      
      * Fix push_to_hub for tools
      
      * Other fixes to push_to_hub
      
      * Add API doc page
      
      * Fixes
      
      * Doc fixes
      
      * Docs
      
      * Fix audio
      
      * Custom tools
      
      * Audio fix
      
      * Improve custom tools docstring
      
      * Docstrings
      
      * Trigger CI
      
      * Mode docstrings
      
      * More docstrings
      
      * Improve custom tools
      
      * Fix for remote tools
      
      * Style
      
      * Fix repo consistency
      
      * Quality
      
      * Tip
      
      * Cleanup on doc
      
      * Cleanup toc
      
      * Add disclaimer for starcoder vs openai
      
      * Remove disclaimer
      
      * Small fixed in the prompts
      
      * 4.29
      
      * Update src/transformers/tools/agents.py
      Co-authored-by: default avatarLysandre Debut <lysandre.debut@reseau.eseo.fr>
      
      * Complete documentation
      
      * Small fixes
      
      * Agent evaluation
      
      * Note about gradio-tools & LC
      
      * Clean up agents and prompt
      
      * Apply suggestions from code review
      Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
      
      * Apply suggestions from code review
      Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
      
      * Note about gradio-tools & LC
      
      * Add copyrights and address review comments
      
      * Quality
      
      * Add all language codes
      
      * Add remote tool tests
      
      * Move custom prompts to other docs
      
      * Apply suggestions from code review
      Co-authored-by: default avataramyeroberts <22614925+amyeroberts@users.noreply.github.com>
      
      * TTS tests
      
      * Quality
      
      ---------
      Co-authored-by: default avatarLysandre <hi@lyand.re>
      Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
      Co-authored-by: default avatarPhilipp Schmid <32632186+philschmid@users.noreply.github.com>
      Co-authored-by: default avatarConnor Henderson <connor.henderson@talkiatry.com>
      Co-authored-by: default avatarLysandre <lysandre.debut@reseau.eseo.fr>
      Co-authored-by: default avatarLysandre <lysandre@huggingface.co>
      Co-authored-by: default avataramyeroberts <22614925+amyeroberts@users.noreply.github.com>
      33357243
  9. 09 May, 2023 2 commits
    • Sylvain Gugger's avatar
      Revert "[Doctests] Refactor doctests + add CI" (#23245) · 69ee4624
      Sylvain Gugger authored
      Revert "[Doctests] Refactor doctests + add CI (#22987)"
      
      This reverts commit 627f4479.
      69ee4624
    • Arthur's avatar
      [Doctests] Refactor doctests + add CI (#22987) · 627f4479
      Arthur authored
      
      
      * intiial commit
      
      * new styling
      
      * update
      
      * just run doctest in CI
      
      * remove more test for fast dev
      
      * update
      
      * update refs
      
      * update path and fetch upstream
      
      * update documentatyion trests
      
      * typo
      
      * parse pwd
      
      * don't check for files that are in hidden folders
      
      * just give paths relative to transformers
      
      * update
      
      * update
      
      * update
      
      * major refactoring
      
      * make sure options is ok
      
      * lest test that mdx is tested
      
      * doctest glob
      
      * nits
      
      * update doctest nightly
      
      * some cleaning
      
      * run correct test on diff
      
      * debug
      
      * run on a single worker
      
      * skip_cuda_test tampkate
      
      * updates
      
      * add rA and continue on failure
      
      * test options
      
      * parse `py` codeblock?
      
      * we don't need to replace ignore results, don't remember whyu I put it
      
      * cleanup
      
      * more cleaning
      
      * fix arg
      
      * more cleaning
      
      * clean an todo
      
      * more pre-processing
      
      * doctest-module has none so extra `- ` is needed
      
      * remove logs
      
      * nits
      
      * doctest-modules ....
      
      * oups
      
      * let's use sugar
      
      * make dataset go quiet
      
      * add proper timeout
      
      * nites
      
      * spleling timeout
      
      * update
      
      * properly skip tests that have CUDSA
      
      * proper skipping
      
      * cleaning main and get tests to run
      
      * remove make report?
      
      * remove tee
      
      * some updates
      
      * tee was removed but is the full output still available?
      
      * [all-test]
      
      * only our tests
      
      * don't  touch tee in this PR
      
      * no atee-sys
      
      * proper sub
      
      * monkey
      
      * only replace call
      
      * fix sub
      
      * nits
      
      * nits
      
      * fix invalid syntax
      
      * add skip cuda doctest env variable
      
      * make sure all packages are installed
      
      * move file
      
      * update check repo
      
      * revert changes
      
      * nit
      
      * finish cleanup
      
      * fix re
      
      * findall
      
      * update don't test init files
      
      * ignore pycache
      
      * `-ignore-pycache` when running pytests
      
      * try to fix the import missmatch error
      
      * install dec
      
      * pytest is required as doctest_utils imports things from it
      
      * the only log issues were dataset, ignore results should work
      
      * more cleaning
      
      * Update .circleci/create_circleci_config.py
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      
      * Apply suggestions from code review
      Co-authored-by: default avataramyeroberts <22614925+amyeroberts@users.noreply.github.com>
      
      * [ydshieh] empty string if cuda is found
      
      * [ydshieh] fix condition
      
      * style
      
      * [ydshieh] fix
      
      * Add comment
      
      * style
      
      * style
      
      * show failure
      
      * trigger CI
      
      ---------
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      Co-authored-by: default avatarYih-Dar <2521628+ydshieh@users.noreply.github.com>
      Co-authored-by: default avataramyeroberts <22614925+amyeroberts@users.noreply.github.com>
      Co-authored-by: default avatarydshieh <ydshieh@users.noreply.github.com>
      627f4479
  10. 27 Apr, 2023 1 commit
  11. 26 Apr, 2023 1 commit
  12. 20 Apr, 2023 1 commit
  13. 18 Apr, 2023 1 commit
    • Sylvain Gugger's avatar
      Use code on the Hub from another repo (#22814) · 5f9b825c
      Sylvain Gugger authored
      * initial work
      
      * Add other classes
      
      * Refactor code
      
      * Move warning and fix dynamic pipeline
      
      * Issue warning when necessary
      
      * Add test
      
      * Do not skip auto tests
      
      * Fix failing tests
      
      * Refactor and address review comments
      
      * Address review comments
      5f9b825c
  14. 17 Apr, 2023 2 commits
  15. 12 Apr, 2023 1 commit
    • pioliverse's avatar
      add model resources for CPMAnt (new) (#20906) · 523ca4e0
      pioliverse authored
      
      
      * resolve conflicts
      
      * rebase and make style
      
      * test
      
      * test
      
      * test
      
      * rebase and make style
      
      * rebase and make style
      
      * tests
      
      * tests
      
      * rewrite some functions
      
      * rebase and make style
      
      * fix load_tf_weights_in_cpmant
      
      * reformat some unrelated files
      
      * upgrade quality
      
      * fix some bugs & docstring
      
      * add models and tests
      
      * solve conflicts
      
      * resolve conflicts
      
      * resolve conflicts
      
      * resolve conflicts
      
      * resolve conflicts
      
      * tests
      
      * resolve conflicts
      
      * resolve conflicts
      
      * fix load_tf_weights_in_cpmant
      
      * reformat some unrelated files
      
      * upgrade quality
      
      * fix some bugs & docstring
      
      * save resolution
      
      * make style
      
      * delete redefinition code
      
      * reformat function
      
      * reformat
      
      * resolve conflicts
      
      * resolve conflicts
      
      * resolve conflicts
      
      * resolve conflicts
      
      * resolve conflicts
      
      * tests
      
      * resolve conflicts
      
      * resolve conflicts
      
      * fix load_tf_weights_in_cpmant
      
      * reformat some unrelated files
      
      * upgrade quality
      
      * resolve conflicts
      
      * resolve conflicts
      
      * resolve conflicts
      
      * resolve conflicts
      
      * resolve conflicts
      
      * fix load_tf_weights_in_cpmant
      
      * reformat some unrelated files
      
      * upgrade quality
      
      * resolve conflicts
      
      * make style
      
      * fix bugs and refactor
      
      * modify docstrings and make style
      
      * unify import format in __init__.py
      
      * fix import-altclp bug
      
      * fix copies to update index.md
      
      * fix unused config parameters
      
      * fix unused config parameters
      
      * fix unused config parameters
      
      * update README_ja.md
      
      * dummy commit for unit test
      
      * fix attention mask
      
      * add CPMAntTokenizer&-Fast to auto-mapping
      
      * drop redundant changes in README_ko
      
      * fix  defaults in docstring
      
      * fix use_cache and some docstring
      
      * add missing args in tokenizer
      
      * modify tester inheritance
      
      * add is_jieba_available
      
      * fix some bugs
      
      * make style and fix-copies
      
      * add doctests
      
      * skip integration tests
      
      * add is_jieba_available
      
      * fix bugs in common tests
      
      * adjust docstrings and make style
      
      * add argument docstring
      
      * adjust code to some specifications
      
      * make style and fix-copies
      
      * add fast tokenization test
      
      * dummy commit for unit test
      
      * dummy commit for unit test
      
      * dummy commit for unit test
      
      * normalize some comments and names
      
      * Bert->CPMAnt
      
      * camel names and drop redundant codes
      
      * make style and fix-coies
      
      * add CpmTokenizerFast _import_structure
      
      * drop cpmanttokenizerfast in model_doc
      
      * fix some problems
      
      * fix CPMAnt tokenization for common test
      
      * make style and fixup
      
      * fix copies and fixup
      
      * fix bugs in tokenization test
      
      * dummy commit for connection failure in unittest
      
      * fix copies
      
      * drop trailing comma
      
      * fix decorator in tests
      
      * dummy commit for connection failure in unittest
      
      ---------
      Co-authored-by: default avatarGong Baitao <gongbaitao11@gmail.com>
      523ca4e0
  16. 03 Apr, 2023 1 commit
  17. 21 Mar, 2023 1 commit
  18. 16 Feb, 2023 1 commit
  19. 07 Feb, 2023 1 commit
    • Sylvain Gugger's avatar
      Cleanup quality (#21493) · 67d07487
      Sylvain Gugger authored
      * Remove mentions of flake8/isort
      
      * Clean up inits
      
      * Deall with all other inits
      
      * Last special rule for dummy files
      67d07487
  20. 06 Feb, 2023 1 commit
  21. 31 Jan, 2023 1 commit
    • NielsRogge's avatar
      Add DETA (#20983) · 5451f889
      NielsRogge authored
      * First draft
      
      * Add initial draft of conversion script
      
      * Convert all weights
      
      * Fix config
      
      * Add image processor
      
      * Fix DetaImageProcessor
      
      * Run make fix copies
      
      * Remove timm dependency
      
      * Fix dummy objects
      
      * Improve loss function
      
      * Remove conv_encoder attribute
      
      * Update conversion scripts
      
      * Improve postprocessing + docs
      
      * Fix copied from statements
      
      * Add tests
      
      * Improve postprocessing
      
      * Improve postprocessing
      
      * Update READMEs
      
      * More improvements
      
      * Fix rebase
      
      * Add is_torchvision_available
      
      * Add torchvision dependency
      
      * Fix typo and README
      
      * Fix bug
      
      * Add copied from
      
      * Fix style
      
      * Apply suggestions
      
      * Fix thanks to @ydshieh
      
      * Fix another dependency check
      
      * Simplify image processor
      
      * Add scipy
      
      * Improve code
      
      * Add threshold argument
      
      * Fix bug
      
      * Set default threshold
      
      * Improve integration test
      
      * Add another integration test
      
      * Update setup.py
      
      * Address review
      
      * Improve deformable attention function
      
      * Improve copied from
      
      * Use relative imports
      
      * Address review
      
      * Replace assertions
      
      * Address review
      
      * Update dummies
      
      * Remove dummies
      
      * Address comments, update READMEs
      
      * Remove custom kernel code
      
      * Add image processor tests
      
      * Add requires_backends
      
      * Add minor comment
      
      * Update scripts
      
      * Update organization name
      
      * Fix defaults, add doc tests
      
      * Add id2label for object 365
      
      * Fix tests
      
      * Update task guide
      5451f889
  22. 19 Jan, 2023 1 commit
    • Clémentine Fourrier's avatar
      Graphormer model for Graph Classification (#20968) · 87208a05
      Clémentine Fourrier authored
      
      
      * [FT] First commit for graphormer architecture.
      
      The model has no tokenizer, as it uses a collator and preprocessing function for its input management.
      Architecture to be tested against original one.
      The arch might need to be changed to fit the checkpoint, but a revert to the original arch will make the code less nice to read.
      TODO: doc
      
      * [FIX] removed test model
      
      * [FIX] import error
      
      * [FIX] black and flake
      
      * [DOC] added paper refs
      
      * [FIX] [DOC]
      
      * [FIX] black
      
      * [DOC] Updated READMEs
      
      * [FIX] Order of imports + rm Tokenizer calls
      
      * [FIX] Moved assert in class to prevent doc build failure
      
      * [FIX] make fix-copies
      
      * [Doc] update from code review
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      
      * [FIX] Removed Graphormer from Sequence classification model list
      
      * [DOC] Added HF copyright to Cython file
      
      * [DOC] Fixed comments
      
      * [FIX] typos in class doc + removed config classes.
      
      Todo: update doc from paper definitions
      
      * [FIX] Removed dependency to fairseq, and replaced all asserts with Exception management
      
      * [FIX] Homogeneized initialization of weights to pretrained constructor
      
      * [FIX] [CP] Updated multi_hop parameter to get same results as in original implementation
      
      * [DOC] Relevant parameter description in the configuration file
      
      * [DOC] Updated doc and comments in main graphormer file
      
      * [FIX] make style and quality checks
      
      * [DOC] Fix doc format
      
      * [FIX] [WIP] Updated part of the tests, though still a wip
      
      * [FIX] [WIP]
      
      * [FIX] repo consistency
      
      * [FIX] Changed input names for more understandability
      
      * [FIX] [BUG] updated num_classes params for propagation in the model
      
      * simplified collator
      
      * [FIX] Updated tests to follow new naming pattern
      
      * [TESTS] Updated test suite along with model
      
      * |FIX] rm tokenizer import
      
      * [DOC] add link to graphormerdoc
      
      * Changed section in doc from text model to graph model
      
      * Apply suggestions from code review
      
      Spacing, inits
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      
      * [DOC] Explain algos_graphormer functions
      
      * Cython soft import protection
      
      * Rm call to Callable in configuration graphormer
      
      * [FIX] replaced asserts with Exceptions
      
      * Add org to graphormer checkpoints
      
      * Prefixed classes with Graphormer
      
      * Management of init functions
      
      * format
      
      * fixes
      
      * fix length file
      
      * update indent
      
      * relaunching ci
      
      * Errors for missing cython imports
      
      * fix style
      
      * fix style doc
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      87208a05
  23. 18 Jan, 2023 1 commit
    • jeffhataws's avatar
      Add AWS Neuron torchrun support (#20806) · c59d71b2
      jeffhataws authored
      * Add XLA torchrun support
      
      * Clarify that currently DDP doesn't work with torch.distributed XLA backend yet
      
      * Enable DDP with torchrun and XLA (now available in PT-XLA 1.13)
      
      * Add check for AWS Neuron availability and AWS Neuron specific compiler flag
      
      * Change the new test's name to TestTrainerDistributedNeuronCore
      
      * Remove "assert" and replace raised exception
      
      * Remove compiler flag as it is optional. If needed, will be another PR.
      
      * Use TORCHELASTIC_RUN_ID to determine whether torchrun is used
      c59d71b2
  24. 14 Dec, 2022 1 commit
  25. 08 Dec, 2022 2 commits
    • Nathan Raw's avatar
      Add video classification pipeline (#20151) · 9e56aff5
      Nathan Raw authored
      * 🚧 wip video classification pipeline
      
      * 🚧 wip - add is_decord_available check
      
      * 🐛 add missing import
      
      *  add tests
      
      * 🔧 add decord to setup extras
      
      * 🚧 add is_decord_available
      
      *  add video-classification pipeline
      
      * 📝 add video classification pipe to docs
      
      * 🐛 add missing VideoClassificationPipeline import
      
      * 📌 add decord install in test runner
      
      *  fix url inputs to video-classification pipeline
      
      *  updates from review
      
      * 📝 add video cls pipeline to docs
      
      * 📝 add docstring
      
      * 🔥 remove unused import
      
      * 🔥 remove some code
      
      * 📝 docfix
      9e56aff5
    • Sylvain Gugger's avatar
      Migrate torchdynamo to torch.compile (#20634) · 9cc65f87
      Sylvain Gugger authored
      * Migrate torchdynamo to torch.compile
      
      * Add docstring and generic option
      
      * Properly use the function...
      
      * Reorg args
      9cc65f87
  26. 05 Dec, 2022 1 commit
  27. 29 Nov, 2022 1 commit
    • Pi Esposito's avatar
      add in layer gpt2 tokenizer (#20421) · fb2b45e5
      Pi Esposito authored
      * add minimal working gpt2 tokenizer
      
      * graph mode and output equivalence tests working
      
      * not today tensorflow. serialization test passing!
      
      * fix style, documentation, docstrings and all that jazz
      
      * passing consistency checks
      
      * move keras nlp to tf dependencies
      
      * fix tf modeling utils and gpt2 attention to enable compiling
      
      * fix (I hope) keras nlp dependencies
      
      * rever changes on generation
      
      * remove debug prints
      
      * remove redundant tf dummy objects
      
      * add from config, get config and max length settings to address review
      
      * let flake ignore the error on distillation you are welcome
      
      * test from config
      
      * add padding test
      
      * address sgugger review
      fb2b45e5
  28. 21 Nov, 2022 1 commit
  29. 18 Nov, 2022 2 commits
    • Ali Hassani's avatar
      Add Neighborhood Attention Transformer (NAT) and Dilated NAT (DiNAT) models (#20219) · fc4a993e
      Ali Hassani authored
      * Add DiNAT
      
      * Adds DiNAT + tests
      
      * Minor fixes
      
      * Added HF model
      
      * Add natten to dependencies.
      
      * Cleanup
      
      * Minor fixup
      
      * Reformat
      
      * Optional NATTEN import.
      
      * Reformat & add doc to _toctree
      
      * Reformat (finally)
      
      * Dummy objects for DiNAT
      
      * Add NAT + minor changes
      
      Adds NAT as its own independent model + docs, tests
      Adds NATTEN to ext deps to ensure ci picks it up.
      
      * Remove natten from `all` and `dev-torch` deps, add manual pip install to ci tests
      
      * Minor fixes.
      
      * Fix READMEs.
      
      * Requested changes to docs + minor fixes.
      
      * Requested changes.
      
      * Add NAT/DiNAT tests to layoutlm_job
      
      * Correction to Dinat doc.
      
      * Requested changes.
      fc4a993e
    • atturaioe's avatar
      Add AnyPrecisionAdamW optimizer (#18961) · 84c9cc6d
      atturaioe authored
      * Add AnyPrecisionAdamW optimizer
      
      * Add optim_args argument to TrainingArgs
      
      * Add tests for AnyPrecisionOptimizer
      
      * Change AnyPrecisionAdam default params to float32
      
      * Move default_anyprecision_kwargs in trainer test
      
      * Rename AnyPrecisionAdamW
      84c9cc6d
  30. 15 Nov, 2022 1 commit
  31. 14 Nov, 2022 1 commit
  32. 27 Oct, 2022 1 commit
  33. 26 Oct, 2022 1 commit
    • raghavanone's avatar
      Change the import of kenlm from github to pypi (#19770) · 7829c890
      raghavanone authored
      * Change the import of kenlm from github to pypi
      
      * Change the import of kenlm from github to pypi in circleci config
      
      * Fix code quality issues
      
      * Fix isort issue, add kenlm in extras for audio
      
      * Add kenlm to deps
      
      * Add kenlm to deps
      
      * Commit 'make fixup' changes
      
      * Remove version from kenlm deps
      
      * commit make fixup changes
      
      * Remove manual installation of kenlm
      
      * Remove manual installation of kenlm
      
      * Remove manual installation of kenlm
      7829c890
  34. 24 Oct, 2022 1 commit
  35. 21 Oct, 2022 1 commit
  36. 12 Oct, 2022 1 commit