1. 10 Jun, 2024 3 commits
  2. 07 Jun, 2024 12 commits
  3. 06 Jun, 2024 18 commits
  4. 05 Jun, 2024 7 commits
    • amyeroberts's avatar
      Skip failing JetMOE generation tests (#31266) · 940fde8d
      amyeroberts authored
      Skip failing tests for now
      940fde8d
    • Cyril Vallez's avatar
      Reduce by 2 the memory requirement in `generate()` 🔥🔥🔥 (#30536) · bd5091df
      Cyril Vallez authored
      * Fix contrastive_search for new cache structure, and improve performance by removing inneficient torch.stack(torch.split(x, top_k, dim=0))
      
      * Fix _contrastive_search for non-standard cache using ellipsis slicing
      
      * Fix all outputs.logits memory leaks for all decoding strategies!
      
      * Fix small error in _contrastive_search()
      
      * Make all necessary change and revert for the new class
      
      * Apply coding style
      
      * Remove pipes in type hints for compatibility
      
      * correct type hint
      
      * apply style
      
      * Use DynamicCache by default and solve conflicts
      
      * Fix rebase issues
      
      * Add `_supports_dynamic_cache_class` in models for models that support DynamicCache but not other caches to make DynamicCache the default for more models
      
      * Create generation config to return legacy format by default, or to choose not to
      
      * style
      
      * Fix case when use_cache is False
      
      * Remove default DynamicCache in assiste_decoding if assistant_model does not support it + fix _seen_tokens when cropping cache
      
      * Update prepare_inputs_for_generation() for case with empty DynamicCache
      
      * Correct return of args in _assisted_decoding
      
      * Remove EfficientDynamicCache as it is no longer needed
      
      * Correct mistake in generation config
      
      * Move cache logic of assisted decoding to AssistedCandidateGenerator.__init__
      
      * change DynamicCache function names from "split" to "batch_split" for readability + apply coding style
      
      * Remove `_supports_dynamic_cache_class` attribute after rebase
      
      * Correct missing line lost in conflict resolution during rebasing
      
      * Add special case for Jamba
      
      * Fix jamba test
      
      * Coding style
      
      * coding style
      
      * Correct missing import in rebasing
      
      * Simplify _validate_model_kwargs based on removal of _supports_dynamic_cache attribute
      
      * Simplify code paths in _contrastive_search
      
      * coding style
      
      * Update docstrings of cache methods
      
      * Update prepare_inputs_for_generation() -> past_key_values are always Cache objects
      bd5091df
    • Yih-Dar's avatar
    • Dhaivat Bhatt's avatar
    • bastrob's avatar
      Add missing Flaubert tokenizer tests (#30492) · 464d986b
      bastrob authored
      * add flaubert tokenization test, enrich inheritance in FlaubertTokenizer.
      
      * fix quality code ci
      
      * ensure parameter consistency
      
      * fix ci
      
      * fix copyright year and flatten vocab list.
      
      * fix style
      464d986b
    • Huazhong Ji's avatar
      enable deterministic mode for npu (#31253) · 41cf4097
      Huazhong Ji authored
      41cf4097
    • Vaibhav Srivastav's avatar
      doc: add info about wav2vec2 bert in older wav2vec2 models. (#31120) · 4a602492
      Vaibhav Srivastav authored
      
      
      * doc: add info about wav2vec2 bert in older wav2vec2 models.
      
      * apply suggestions from review.
      
      * forward contrib credits from review
      
      ---------
      Co-authored-by: default avatarSanchit Gandhi <sanchit-gandhi@users.noreply.github.com>
      4a602492