1. 25 Oct, 2023 1 commit
    • Younes Belkada's avatar
      [`core`] Refactor of `gradient_checkpointing` (#27020) · 06e782da
      Younes Belkada authored
      * v1
      
      * fix
      
      * remove `create_custom_forward`
      
      * fixup
      
      * fixup
      
      * add test and fix all failing GC tests
      
      * remove all remaining `create_custom_forward` methods
      
      * fix idefics bug
      
      * fixup
      
      * replace with `__call__`
      
      * add comment
      
      * quality
      06e782da
  2. 25 Aug, 2023 1 commit
    • David Reguera's avatar
      Add type hints for several pytorch models (batch-3) (#25705) · 4d9e45f3
      David Reguera authored
      * Add missing type hints for ErnieM family
      
      * Add missing type hints for EsmForProteinFolding model
      
      * Add missing type hints for Graphormer model
      
      * Add type hints for InstructBlipQFormer model
      
      * Add missing type hints for LayoutLMForMaskedLM model
      
      * Add missing type hints for LukeForEntitySpanClassification model
      4d9e45f3
  3. 08 Aug, 2023 1 commit
  4. 04 Aug, 2023 1 commit
  5. 02 Aug, 2023 1 commit
    • Euan Ong's avatar
      Fix return_dict_in_generate bug in InstructBlip generate function (#25246) · 1baeed5b
      Euan Ong authored
      Fix bug in InstructBlip generate function
      
      Previously, the postprocessing conducted on generated sequences in InstructBlip's generate function assumed these sequences were tensors (i.e. that `return_dict_in_generate == False`).
      
      This commit checks whether the result of the call to the wrapped language model `generate()` is a tensor, and if not attempts to postprocess the sequence attribute of the returned results object.
      1baeed5b
  6. 21 Jul, 2023 1 commit
  7. 18 Jul, 2023 1 commit
  8. 11 Jul, 2023 1 commit
  9. 28 Jun, 2023 2 commits
  10. 26 Jun, 2023 2 commits
  11. 22 Jun, 2023 1 commit
  12. 21 Jun, 2023 1 commit
  13. 13 Jun, 2023 1 commit
    • Sylvain Gugger's avatar
      Tied params cleanup (#24211) · 695928e1
      Sylvain Gugger authored
      * First test
      
      * Add info for all models
      
      * style
      
      * Repo consistency
      
      * Fix last model and cleanup prints
      
      * Repo consistency
      
      * Use consistent function for detecting tied weights
      695928e1
  14. 31 May, 2023 1 commit
  15. 23 May, 2023 1 commit
  16. 01 May, 2023 1 commit
  17. 07 Apr, 2023 1 commit
  18. 04 Apr, 2023 1 commit
  19. 31 Mar, 2023 1 commit
  20. 28 Feb, 2023 2 commits
  21. 27 Feb, 2023 1 commit
  22. 10 Feb, 2023 1 commit
  23. 09 Feb, 2023 1 commit
    • NielsRogge's avatar
      Add BLIP-2 (#21441) · d7f1e7c0
      NielsRogge authored
      
      
      * First draft
      
      * More improvements
      
      * More improvements
      
      * Improve conversion script
      
      * Convert all weights
      
      * Make forward pass work
      
      * Make logits match
      
      * More improvements
      
      * More improvements
      
      * More improvements
      
      * Use get_input_embeddings
      
      * Improve some more
      
      * Improve model tests
      
      * Improve model tests
      
      * More improvements
      
      * Fix processor
      
      * Update files
      
      * Update prepare_inputs_for_generation
      
      * More improvements
      
      * Fix copies
      
      * More fixes
      
      * Make fixup
      
      * More improvements
      
      * Add support for seq2seq language model
      
      * More improvements
      
      * Fix test
      
      * More improvements
      
      * Improve conversion script
      
      * Remove some todo's
      
      * Fix README's
      
      * Improve conversion script
      
      * Fix generation
      
      * Fix style and remove Blip2Model
      
      * Fix model outputs
      
      * More improvements
      
      * Set eos_token_id in config
      
      * Fix quality
      
      * Small improvements
      
      * Add processor tests
      
      * More improvements
      
      * Apply suggestions
      
      * Apply suggestions
      
      * Add integration test
      
      * Update image URL
      
      * Add integration test
      
      * Fix model_type
      
      * Update style
      
      * Improve docs
      
      * Add doc tests
      
      * Fix copies
      
      * Remove tests which are passing
      
      * Improve some more
      
      * Add tests for seq2seq language models
      
      * Minor fix
      
      * Convert more checkpoints
      
      * finalize CI
      
      * Fix blip and blip2 processors
      
      * add `accelerate` support for `blip2`
      
      * clean up
      
      * make style
      
      * Update conversion script
      
      * Update conversion script some more
      
      * Update organization
      
      * revert toc file
      
      * add blip-2 to toc file
      
      * Some more improvements
      
      * Fix docstring
      
      * Improve docs
      
      ---------
      Co-authored-by: default avatarydshieh <ydshieh@users.noreply.github.com>
      Co-authored-by: default avataryounesbelkada <younesbelkada@gmail.com>
      d7f1e7c0