1. 14 Sep, 2023 1 commit
    • Matt's avatar
      Overhaul Conversation class and prompt templating (#25323) · 866df66f
      Matt authored
      
      
      * First commit while I figure this out
      
      * make fixup
      
      * Remove unused method
      
      * Store prompt attrib
      
      * Fix prompt argument for tests
      
      * Make same changes in fast tokenizer
      
      * Remove global prompts from fast tokenizer too
      
      * stash commit
      
      * stash commit
      
      * Migrate PromptConfig to its True Final Location
      
      * Replace Conversation entirely with the new class
      
      * Import/dependency fixes
      
      * Import/dependency fixes
      
      * Change format for lots of default prompts
      
      * More default prompt fixups
      
      * Revert llama old methods so we can compare
      
      * Fix some default configs
      
      * Fix some default configs
      
      * Fix misspelled kwarg
      
      * Fixes for Blenderbot
      
      * make fixup
      
      * little rebase cleanup
      
      * Add basic documentation
      
      * Quick doc fix
      
      * Truncate docstring for now
      
      * Add handling for the case when messages is a single string
      
      * Quick llama merges
      
      * Update conversational pipeline and tests
      
      * Add a couple of legacy properties for backward compatibility
      
      * More legacy handling
      
      * Add docstring for build_conversation_input_ids
      
      * Restructure PromptConfig
      
      * Let's start T E M P L A T I N G
      
      * Refactor all default configs to use templates instead
      
      * Revert changes to the special token properties since we don't need them anymore
      
      * More class templates
      
      * Make the sandbox even sandier
      
      * Everything replaced with pure templating
      
      * Remove docs for PromptConfig
      
      * Add testing and optional requirement boilerplate
      
      * Fix imports and make fixup
      
      * Fix LLaMA tests and add Conversation docstring
      
      * Finally get LLaMA working with the template system
      
      * Finally get LLaMA working with the template system
      
      * make fixup
      
      * make fixup
      
      * fmt-off for the long lists of test tokens
      
      * Rename method to apply_chat_template for now
      
      * Start on documentation
      
      * Make chat_template a property that reads through to the default if it's not set
      
      * Expand docs
      
      * Expand chat templating doc some more
      
      * trim/lstrip blocks by default and update doc
      
      * Few doc tweaks
      
      * rebase cleanup
      
      * Clarify docstring
      
      * rebase cleanup
      
      * rebase cleanup
      
      * make fixup
      
      * Quick doc edit
      
      * Reformat the standard template to match ChatML
      
      * Re-add PEFT check
      
      * Update docs/source/en/chat_templating.md
      Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
      
      * Add apply_chat_template to the tokenizer doc
      
      * make fixup
      
      * Add doc links
      
      * Fix chat links
      
      * Fix chat links
      
      * Explain system messages in the doc
      
      * Add chat template test
      
      * Proper save-loading for chat template attribute
      
      * Add test skips for layout models
      
      * Remove _build_conversation_input_ids, add default_chat_template to code_llama
      
      * Make sure all LLaMA models are using the latest template
      
      * Remove default_system_prompt block in code_llama because it has no default prompt
      
      * Update ConversationPipeline preprocess
      
      * Add correct #Copied from links to the default_chat_templates
      
      * Remove unneeded type checking line
      
      * Add a dummy mark_processsed method
      
      * Reorganize Conversation to have **deprecated_kwargs
      
      * Update chat_templating.md
      
      * Quick fix to LLAMA tests
      
      * Small doc tweaks
      
      * Add proper docstrings and "copied from" statements to all default chat templates
      
      * Merge use_default_system_prompt support for code_llama too
      
      * Improve clarity around self.chat_template
      
      * Docstring fix
      
      * Fix blenderbot default template
      
      * More doctest fix
      
      * Break out some tokenizer kwargs
      
      * Update doc to explain default templates
      
      * Quick tweaks to tokenizer args
      
      * Cleanups for tokenizer args
      
      * Add note about cacheing
      
      * Quick tweak to the chat-templating doc
      
      * Update the LLaMA template with error checking and correct system message embedding
      
      * make fixup
      
      * make fixup
      
      * add requires_jinja
      
      * Cleanup to expected output formatting
      
      * Add cacheing
      
      * Fix typo in llama default template
      
      * Update LLaMA tests
      
      * Update documentation
      
      * Improved legacy handling in the Conversation class
      
      * Update Jinja template with proper error handling
      
      * Quick bugfix
      
      * Proper exception raising
      
      * Change cacheing behaviour so it doesn't try to pickle an entire Jinja env
      
      * make fixup
      
      * rebase cleanup
      
      ---------
      Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
      866df66f
  2. 24 Aug, 2023 1 commit
  3. 22 Aug, 2023 1 commit
    • Arthur's avatar
      Fix bloom add prefix space (#25652) · e20fab0b
      Arthur authored
      * properly support Sequence of pretokenizers
      
      * actual fix
      
      * make sure the fix works. Tests are not working for sure!
      
      * hacky way
      
      * add TODO
      
      * update
      
      * add a todo
      
      * nits
      
      * rename test
      
      * nits
      
      * rename test
      e20fab0b
  4. 18 Aug, 2023 1 commit
  5. 02 Aug, 2023 1 commit
  6. 31 Jul, 2023 1 commit
  7. 27 Jul, 2023 1 commit
    • Sanchit Gandhi's avatar
      Add bloom flax (#25094) · e9310363
      Sanchit Gandhi authored
      
      
      * First commit
      
      * step 1 working
      
      * add alibi
      
      * placeholder for `scan`
      
      * add matrix mult alibi
      
      * beta scaling factor for bmm
      
      * working v1 - simple forward pass
      
      * move layer_number from attribute to arg in call
      
      * partial functioning scan
      
      * hacky working scan
      
      * add more modifs
      
      * add test
      
      * update scan for new kwarg order
      
      * fix position_ids problem
      
      * fix bug in attention layer
      
      * small fix
      
      - do the alibi broadcasting only once
      
      * prelim refactor
      
      * finish refactor
      
      * alibi shifting
      
      * incorporate dropout_add to attention module
      
      * make style
      
      * make padding work again
      
      * update
      
      * remove bogus file
      
      * up
      
      * get generation to work
      
      * clean code a bit
      
      * added small tests
      
      * adding albii test
      
      * make CI tests pass:
      
      - change init weight
      - add correct tuple for output attention
      - add scan test
      - make CI tests work
      
      * fix few nits
      
      * fix nit onnx
      
      * fix onnx nit
      
      * add missing dtype args to nn.Modules
      
      * remove debugging statements
      
      * fix scan generate
      
      * Update modeling_flax_bloom.py
      
      * Update test_modeling_flax_bloom.py
      
      * Update test_modeling_flax_bloom.py
      
      * Update test_modeling_flax_bloom.py
      
      * fix small test issue + make style
      
      * clean up
      
      * Update tests/models/bloom/test_modeling_flax_bloom.py
      Co-authored-by: default avatarSanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>
      
      * fix function name
      
      * small fix test
      
      * forward contrib credits from PR17761
      
      * Fix failing test
      
      * fix small typo documentation
      
      * fix non passing test
      
      - remove device from build alibi
      
      * refactor call
      
      - refactor `FlaxBloomBlockCollection` module
      
      * make style
      
      * upcast to fp32
      
      * cleaner way to upcast
      
      * remove unused args
      
      * remove layer number
      
      * fix scan test
      
      * make style
      
      * fix i4 casting
      
      * fix slow test
      
      * Update src/transformers/models/bloom/modeling_flax_bloom.py
      Co-authored-by: default avatarSanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>
      
      * remove `layer_past`
      
      * refactor a bit
      
      * fix `scan` slow test
      
      * remove useless import
      
      * major changes
      
      - remove unused code
      - refactor a bit
      - revert import `torch`
      
      * major refactoring
      
      - change build alibi
      
      * remove scan
      
      * fix tests
      
      * make style
      
      * clean-up alibi
      
      * add integration tests
      
      * up
      
      * fix batch norm conversion
      
      * style
      
      * style
      
      * update pt-fx cross tests
      
      * update copyright
      
      * Update src/transformers/modeling_flax_pytorch_utils.py
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      
      * per-weight check
      
      * style
      
      * line formats
      
      ---------
      Co-authored-by: default avataryounesbelkada <younesbelkada@gmail.com>
      Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
      Co-authored-by: default avatarYounes Belkada <49240599+younesbelkada@users.noreply.github.com>
      Co-authored-by: default avatarhaileyschoelkopf <haileyschoelkopf@users.noreply.github.com>
      Co-authored-by: default avatarSylvain Gugger <35901082+sgugger@users.noreply.github.com>
      e9310363
  8. 16 Jun, 2023 1 commit
  9. 16 May, 2023 1 commit
  10. 21 Mar, 2023 1 commit
  11. 28 Feb, 2023 1 commit
    • Yih-Dar's avatar
      馃敟Rework pipeline testing by removing `PipelineTestCaseMeta` 馃殌 (#21516) · 871c31a6
      Yih-Dar authored
      
      
      * Add PipelineTesterMixin
      
      * remove class PipelineTestCaseMeta
      
      * move validate_test_components
      
      * Add for ViT
      
      * Add to SPECIAL_MODULE_TO_TEST_MAP
      
      * style and quality
      
      * Add feature-extraction
      
      * update
      
      * raise instead of skip
      
      * add tiny_model_summary.json
      
      * more explicit
      
      * skip tasks not in mapping
      
      * add availability check
      
      * Add Copyright
      
      * A way to diable irrelevant tests
      
      * update with main
      
      * remove disable_irrelevant_tests
      
      * skip tests
      
      * better skip message
      
      * better skip message
      
      * Add all pipeline task tests
      
      * revert
      
      * Import PipelineTesterMixin
      
      * subclass test classes with PipelineTesterMixin
      
      * Add pipieline_model_mapping
      
      * Fix import after adding pipieline_model_mapping
      
      * Fix style and quality after adding pipieline_model_mapping
      
      * Fix one more import after adding pipieline_model_mapping
      
      * Fix style and quality after adding pipieline_model_mapping
      
      * Fix test issues
      
      * Fix import requirements
      
      * Fix mapping for MobileViTModelTest
      
      * Update
      
      * Better skip message
      
      * pipieline_model_mapping could not be None
      
      * Remove some PipelineTesterMixin
      
      * Fix typo
      
      * revert tests_fetcher.py
      
      * update
      
      * rename
      
      * revert
      
      * Remove PipelineTestCaseMeta from ZeroShotAudioClassificationPipelineTests
      
      * style and quality
      
      * test fetcher for all pipeline/model tests
      
      ---------
      Co-authored-by: default avatarydshieh <ydshieh@users.noreply.github.com>
      871c31a6
  12. 22 Feb, 2023 1 commit
  13. 06 Feb, 2023 1 commit
    • Sylvain Gugger's avatar
      Update quality tooling for formatting (#21480) · 6f79d264
      Sylvain Gugger authored
      * Result of black 23.1
      
      * Update target to Python 3.7
      
      * Switch flake8 to ruff
      
      * Configure isort
      
      * Configure isort
      
      * Apply isort with line limit
      
      * Put the right black version
      
      * adapt black in check copies
      
      * Fix copies
      6f79d264
  14. 27 Dec, 2022 1 commit
  15. 09 Nov, 2022 1 commit
  16. 10 Oct, 2022 1 commit
  17. 04 Oct, 2022 1 commit
  18. 26 Sep, 2022 1 commit
  19. 15 Sep, 2022 1 commit
  20. 12 Aug, 2022 1 commit
  21. 29 Jul, 2022 1 commit
    • Michael Benayoun's avatar
      [FX] Symbolic trace for Bloom (#18356) · 4e2f4a92
      Michael Benayoun authored
      * Bloom model can now be traced
      
      * Bloom traced model can be torch scripted and serialized
      
      * Bloom can be traced with variable keyword arguments
      
      * Enable XLNet support
      
      * Disable XLNet for now
      4e2f4a92
  22. 18 Jul, 2022 1 commit
  23. 11 Jul, 2022 1 commit
  24. 23 Jun, 2022 1 commit
  25. 14 Jun, 2022 2 commits
  26. 09 Jun, 2022 1 commit