Commits · e0eda4d3ffa10e5f65e0976161cd134bec61983a · gaoqiong / lm-evaluation-harness

14 Dec, 2023 1 commit

Refactor `hf` modeling code (#1096) · e0eda4d3

Hailey Schoelkopf authored Dec 14, 2023

* modularize HFLM code

* pass through extra kwargs to AutoModel.from_pretrained call

* remove explicit model_kwargs

* rename gptq -> autogptq

* fix tokenizer pad token errors

* ensure model always respects device_map and autogptq's selected devices

* add a _get_config helper fn

e0eda4d3

29 Nov, 2023 1 commit
- change torch req for mps · 7bb147b5
  baberabb authored Nov 29, 2023
  
  7bb147b5
26 Nov, 2023 1 commit
- use Seq2Seq backend where either can be loaded from HF · 7ab782ec
  haileyschoelkopf authored Nov 26, 2023
  
  7ab782ec
21 Nov, 2023 1 commit
- update multi-token stopsequence handling · f7873a49
  haileyschoelkopf authored Nov 21, 2023
  
  f7873a49
20 Nov, 2023 1 commit
- add typehints · 1e9c8b59
  baberabb authored Nov 20, 2023
  
  1e9c8b59
17 Nov, 2023 1 commit
- edits and format · 10cc0a56
  lintangsutawika authored Nov 17, 2023
  
  10cc0a56
10 Nov, 2023 1 commit
- added initialize_task and updated where eval_logger is imported from · 056c9d85
  lintangsutawika authored Nov 10, 2023
  
  056c9d85
02 Nov, 2023 1 commit
- cleanup hf tqdm · ad8eee89
  haileyschoelkopf authored Nov 02, 2023
  
  ad8eee89
01 Nov, 2023 1 commit
- fix tqdm total for hf model · 55407cd6
  haileyschoelkopf authored Nov 01, 2023
  
  55407cd6
19 Oct, 2023 1 commit
- fix issue with default metrics and aggregation functions · 90d818da
  lintangsutawika authored Oct 19, 2023
  
  90d818da
17 Oct, 2023 1 commit
- change all mentions of `greedy_until` to `generate_until` · c64bf9a9
  lintangsutawika authored Oct 17, 2023
  
  c64bf9a9
13 Oct, 2023 1 commit
- Fix "TypeError: 'tqdm' object is not subscriptable" error that occurs · 10625bd8
  Jason Krone authored Oct 13, 2023
```
in hugging face model loglikelihood_tokens and greedy_util functions
when batch-size is set to auto
```
  10625bd8
11 Oct, 2023 3 commits
- check with pre-commit · 660dfb71
  Zhiwei Zhuang authored Oct 11, 2023
  
  660dfb71
- finished test code · 2bd5dcb6
  Zhiwei Zhuang authored Oct 11, 2023
  
  2bd5dcb6
- add _batch_scheduler in greedy_until · 1aa3bc1e
  Zhiwei Zhuang authored Oct 11, 2023
  
  1aa3bc1e
21 Sep, 2023 1 commit
- Fix positional arguments in HF model generate · 877e9a61
  Chris authored Sep 21, 2023
  
  877e9a61
13 Sep, 2023 1 commit
- Update device list and dtype detection for MPS · bd81b8c0
  baberabb authored Sep 14, 2023
  
  bd81b8c0
05 Sep, 2023 2 commits
- format for pre-commit · 01ad787d
  lintangsutawika authored Sep 05, 2023
  
  01ad787d
- pre-commit · 9bca36a9
  lintangsutawika authored Sep 05, 2023
  
  9bca36a9
04 Sep, 2023 2 commits
- placate precommit · f96f330f
  Hailey Schoelkopf authored Sep 04, 2023
  
  f96f330f
- modified changes to fix loglikelihood prediction for seq2seq · 86e78589
  lintangsutawika authored Sep 04, 2023
  
  86e78589
26 Aug, 2023 1 commit
- fix FSDP error with .prepare_model() · 92561822
  Benjamin Fattori authored Aug 26, 2023
  
  92561822
25 Aug, 2023 2 commits
- Add suggestions from autotyping · fc69d84f
  Ethan Smith authored Aug 25, 2023
```
This adds a bunch of simple annotations suggested by https://github.com/JelleZijlstra/autotyping.
```
  fc69d84f
- reformat · caac0843
  lintangsutawika authored Aug 25, 2023
  
  caac0843
22 Aug, 2023 1 commit
- added truncation option · d4e075e3
  lintangsutawika authored Aug 22, 2023
  
  d4e075e3
11 Aug, 2023 1 commit
- placate pre-commit · d69a962d
  Hailey Schoelkopf authored Aug 10, 2023
  
  d69a962d
10 Aug, 2023 1 commit
- Use evaluation_mode=True for accelerate to prevent OOM · c7ef3da7
  Til Jasper Ullrich authored Aug 10, 2023
  
  c7ef3da7
07 Aug, 2023 1 commit
- making t5 version of superglue prompt · 9c748204
  lintangsutawika authored Aug 07, 2023
  
  9c748204
04 Aug, 2023 1 commit
- pre-commit format · 2f4124fa
  lintangsutawika authored Aug 04, 2023
  
  2f4124fa
03 Aug, 2023 1 commit
- Update huggingface.py · f40c434e
  Lintang Sutawika authored Aug 03, 2023
```
`max_length` was misdefined as a tuple.
```
  f40c434e
02 Aug, 2023 1 commit
- fix to add a case for if a user add `max_length` to generation_kwargs · ba864e09
  lintangsutawika authored Aug 02, 2023
  
  ba864e09
27 Jul, 2023 2 commits
- skip recomputing batch size if it is maximal · 96ea9f54
  Benjamin Fattori authored Jul 27, 2023
  
  96ea9f54
- autobatching support for enc-dec · 3168fc00
  Benjamin Fattori authored Jul 27, 2023
  
  3168fc00
26 Jul, 2023 1 commit
- materialze "out" tensor during batch calculation · 81a11d6d
  Benjamin Fattori authored Jul 26, 2023
  
  81a11d6d
24 Jul, 2023 1 commit

Early stop bug of greedy_until (primary_until should be a list of str) · 984f8793

ZZR0 authored Jul 24, 2023

I discovered that the accuracy of all models (e.g., llama7b, llama13b, starcoder) in the 'gsm8k-cot' task was 0%. After a thorough investigation, I realized that the generated text for each question was causing an early stop, preventing the 'regex_pattern' from finding any answers. This issue was caused by an incorrect assignment of the 'primary_until' variable in the 'greedy_until' function. Specifically, 'primary_until' should be a list of strings instead of a single string, as the 'stop_sequences' parameter in the 'stop_sequences_criteria' function requires a List[str]. Once I assigned 'primary_until' to '[until[0]]', the accuracy of llama7b in the 'gsm8k-cot' task increased to 1.67%.

984f8793

22 Jul, 2023 1 commit
- multi-device: take minimum computed autobatch over all ranks · 0a7720e9
  Benjamin Fattori authored Jul 22, 2023
  
  0a7720e9
21 Jul, 2023 1 commit
- handle trust_remote_code models better · 380c62f9
  haileyschoelkopf authored Jul 21, 2023
  
  380c62f9
17 Jul, 2023 1 commit
- edge case for P-Tuning in causal LM · fc63c7af
  haileyschoelkopf authored Jul 17, 2023
  
  fc63c7af
16 Jul, 2023 1 commit
- set fp32 if device=mps · ce214979
  baberabb authored Jul 16, 2023
  
  ce214979
15 Jul, 2023 1 commit
- add mps · 2bb7ce3b
  baberabb authored Jul 15, 2023
  
  2bb7ce3b