"examples/vscode:/vscode.git/clone" did not exist on "57a6626929f0e47b2de5fd58040d413ee34dc676"
- 11 Oct, 2023 1 commit
-
-
Billy Bradley authored
In assisted decoding, pass model_kwargs to model's forward call (fix prepare_input_for_generation in all models) (#25242) * In assisted decoding, pass model_kwargs to model's forward call Previously, assisted decoding would ignore any additional kwargs that it doesn't explicitly handle. This was inconsistent with other generation methods, which pass the model_kwargs through prepare_inputs_for_generation and forward the returned dict to the model's forward call. The prepare_inputs_for_generation method needs to be amended in all models, as previously it only kept the last input ID when a past_key_values was passed. * Improve variable names in _extend_attention_mask * Refactor extending token_type_ids into a function * Replace deepcopy with copy to optimize performance * Update new persimmon model with llama changes for assisted generation * Update new mistral model for assisted generation with prepare_inputs_for_generation * Update position_ids creation in falcon prepare_inputs_for_generation to support assisted generation
-
- 14 Sep, 2023 1 commit
-
-
Dong-Yong Lee authored
* Fix GPTNeoX beam search when using parallelize * Fix beam search idx device when using model parallel * remove onnx related stuff Co-authored-by:
Arthur <48595927+ArthurZucker@users.noreply.github.com> * fix: move test_beam_search_on_multi_gpu to GenerationTesterMixin * fix: add right item to _no_split_modules of MegaPreTrainedModel * fix: add num_beams within parallelized beam_search test Co-authored-by:
amyeroberts <22614925+amyeroberts@users.noreply.github.com> --------- Co-authored-by:
Arthur <48595927+ArthurZucker@users.noreply.github.com> Co-authored-by:
amyeroberts <22614925+amyeroberts@users.noreply.github.com>
-
- 12 Sep, 2023 2 commits
-
-
pokjay authored
* Fix issues in test_exponential_decay_length_penalty Fix tests which were broken and add validation of negative scores. Current test didn't take into account that ExponentialDecayLengthPenalty updates the score inplace, resulting in updates to base tested Tensor. In addition, the gt assert had empty Tensors due to indexing along the batch dimension. Test is currently expected to fail to show ExponentialDecayLengthPenalty issues with negative scores * Fix ExponentialDecayLengthPenalty negative logits issue In cases where the scores are negative, ExponentialDecayLengthPenalty decreases the score of eos_token_id instead of increasing it. To fix this issue we compute the penalty of the absolute value and add it to the original score. * Add examples for ExponentialDecayLengthPenalty * Fix styling issue in ExponentialDecayLengthPenalty doc * Apply suggestions from code review Co-authored-by:
Arthur <48595927+ArthurZucker@users.noreply.github.com> * Style and quality fix * Fix example outputs --------- Co-authored-by:
Arthur <48595927+ArthurZucker@users.noreply.github.com>
-
Joao Gante authored
-
- 23 Aug, 2023 1 commit
-
-
Joao Gante authored
Co-authored-by:Arthur <48595927+ArthurZucker@users.noreply.github.com>
-
- 16 Aug, 2023 1 commit
-
-
Joao Gante authored
-
- 10 Aug, 2023 1 commit
-
-
Joao Gante authored
* strict gen config save; Add tests * add note that the warning will be an exception in v4.34
-
- 09 Aug, 2023 1 commit
-
-
hukuda222 authored
* aligned sample_beam specs with beam_search * pull origin main * Revert "pull origin main" This reverts commit 06d356f1137bb52272e120a03636598c44449cf3. * update test_utils.py * fix format * remove comment --------- Co-authored-by:Shogo Fujita <shogo.fujita@legalontech.jp>
-
- 06 Aug, 2023 1 commit
-
-
Guillaume "Vermeille" Sanchez authored
-
- 26 Jul, 2023 1 commit
-
-
Yih-Dar authored
* update --------- Co-authored-by:ydshieh <ydshieh@users.noreply.github.com>
-
- 20 Jul, 2023 2 commits
-
-
Benjamin Badger authored
Co-authored-by:Joao Gante <joaofranciscocardosogante@gmail.com>
-
Joao Gante authored
-
- 27 Jun, 2023 2 commits
-
-
Xiaoli Wang authored
* Fix TypeError: Object of type int64 is not JSON serializable * Convert numpy.float64 and numpy.int64 to float and int for json serialization * Black reformatted examples/pytorch/token-classification/run_ner_no_trainer.py * * make style
-
Joao Gante authored
* add exception * update docs
-
- 23 Jun, 2023 1 commit
-
-
Bowen Bao authored
* Replace python random with torch.rand to enable dynamo.export * revert changes to flax model code * Remove unused random import * Fix torch template * Move torch.manual_seed(0) to right location
-
- 21 Jun, 2023 1 commit
-
-
Joao Gante authored
-
- 13 Jun, 2023 1 commit
-
-
Joao Gante authored
Co-authored-by:amyeroberts <22614925+amyeroberts@users.noreply.github.com>
-
- 07 Jun, 2023 1 commit
-
-
Joao Gante authored
increase atol
-
- 24 May, 2023 1 commit
-
-
Matt authored
* Rework TF type hints to use | None instead of Optional[] for tf.Tensor * Rework TF type hints to use | None instead of Optional[] for tf.Tensor * Don't forget the imports * Add the imports to tests too * make fixup * Refactor tests that depended on get_type_hints * Better test refactor * Fix an old hidden bug in the test_keras_fit input creation code * Fix for the Deit tests
-
- 18 May, 2023 2 commits
-
-
Yih-Dar authored
* fix * fix --------- Co-authored-by:ydshieh <ydshieh@users.noreply.github.com>
-
Joao Gante authored
-
- 16 May, 2023 1 commit
-
-
Joao Gante authored
Co-authored-by:amyeroberts <22614925+amyeroberts@users.noreply.github.com>
-
- 08 May, 2023 1 commit
-
-
Joao Gante authored
* starcoder has joined the chat * indexing that works for all
-
- 03 May, 2023 2 commits
-
-
Mayank Agarwal authored
* add code for numReturnSeq * add flax support for num return sequences * Make Fix up for changes * add test for num return sequences * lint
-
Joao Gante authored
-
- 29 Apr, 2023 1 commit
-
-
Joao Gante authored
-
- 24 Apr, 2023 1 commit
-
-
Joao Gante authored
* temperature controls speed
-
- 18 Apr, 2023 2 commits
-
-
Joao Gante authored
* working mvp * remove breakpoint * fix commit * standardize outputs * tmp commit * tests almost ready * tmp commit * skip a few models * Add streaming; Docs and examples * document limitations * PR commits * Amy PR comments
-
Yih-Dar authored
* fix --------- Co-authored-by:ydshieh <ydshieh@users.noreply.github.com>
-
- 13 Apr, 2023 1 commit
-
-
Joao Gante authored
-
- 05 Apr, 2023 1 commit
-
-
Joao Gante authored
-
- 04 Apr, 2023 1 commit
-
-
Joao Gante authored
-
- 03 Apr, 2023 1 commit
-
-
Joao Gante authored
* haha text go brrr (but in gradio)
-
- 30 Mar, 2023 1 commit
-
-
Joao Gante authored
* haha tokens go brrrr
-
- 23 Mar, 2023 1 commit
-
-
Joao Gante authored
-
- 22 Mar, 2023 2 commits
-
-
Joao Gante authored
* tmp commit * beef up llama tests
-
Joao Gante authored
* Export TF generate with a TF tokenizer * remove unused lines
-
- 21 Mar, 2023 1 commit
-
-
Yih-Dar authored
* time to say goodbye, torch 1.7 and 1.8 * clean up torch_int_div * clean up is_torch_less_than_1_8-9 * update --------- Co-authored-by:ydshieh <ydshieh@users.noreply.github.com>
-
- 16 Mar, 2023 1 commit
-
-
Yih-Dar authored
* py38 + torch 2 * increment cache versions --------- Co-authored-by:ydshieh <ydshieh@users.noreply.github.com>
-
- 09 Mar, 2023 1 commit
-
-
Lucain authored
* Remove set_access_token usage + fail tests if FutureWarning * do not fail on FutureWarning in CI --------- Co-authored-by:testbot <lucainp@hf.co>
-