- 03 Jul, 2024 1 commit
-
-
lintangsutawika authored
-
- 02 Jul, 2024 2 commits
-
-
haileyschoelkopf authored
-
haileyschoelkopf authored
-
- 25 Jun, 2024 2 commits
-
-
Baber Abbasi authored
* refactored `lm.apply_chat_template` * nit * fix weird type error * fixed! * skip failing test * pre-commit run all * add type hints * nit * nit * fixup
-
haileyschoelkopf authored
-
- 24 Jun, 2024 2 commits
-
-
Stella Biderman authored
-
achervyakov authored
* add tokenizer logs info * add no tokenizer case * Update lm_eval/logging_utils.py Co-authored-by:
Hailey Schoelkopf <65563625+haileyschoelkopf@users.noreply.github.com> * Update lm_eval/logging_utils.py Co-authored-by:
Hailey Schoelkopf <65563625+haileyschoelkopf@users.noreply.github.com> * add updates * fix conflict --------- Co-authored-by:
Hailey Schoelkopf <65563625+haileyschoelkopf@users.noreply.github.com>
-
- 19 Jun, 2024 1 commit
-
-
Hailey Schoelkopf authored
* log fewshot_as_multiturn in general tracker args * Update evaluator.py --------- Co-authored-by:Lintang Sutawika <lintang@eleuther.ai>
-
- 10 Jun, 2024 3 commits
-
-
lintangsutawika authored
-
lintangsutawika authored
-
lintangsutawika authored
-
- 07 Jun, 2024 1 commit
-
-
lintangsutawika authored
-
- 04 Jun, 2024 1 commit
-
-
lintangsutawika authored
-
- 03 Jun, 2024 2 commits
-
-
KonradSzafer authored
* initial chat template * tokenizer attribute check * variable rename * interface update * system instruction * system inst default update * fewshot as multiturn * typing update * indent update * added comments * Adding a fewshot in a more readable way * linting * Moved apply chat template to LM * multiturn alternation fix * cache key update * apply chat template method fix * add system prompt hash to cache_key * tokenizer name property for cache_key * property name fix * linting backward compatibility fix * docs and errors update * add documentation on adding chat template compatibility to model_guide * fewshot as multiturn check fix * saving system inst and chat template in results * eval tracker update * docs update * Apply suggestions from code review Co-authored-by:
Hailey Schoelkopf <65563625+haileyschoelkopf@users.noreply.github.com> --------- Co-authored-by:
haileyschoelkopf <hailey@eleuther.ai> Co-authored-by:
Clémentine Fourrier <22726840+clefourrier@users.noreply.github.com> Co-authored-by:
Hailey Schoelkopf <65563625+haileyschoelkopf@users.noreply.github.com>
-
LSinev authored
Fix #1906
-
- 30 May, 2024 1 commit
-
-
Zafir Stojanovski authored
* Higher is better tickers in output table * add extra check for `higher_is_better` not being None already * Update lm_eval/evaluator.py * fixup format I messed up * add comment (and retrigger tests) --------- Co-authored-by:
Hailey Schoelkopf <65563625+haileyschoelkopf@users.noreply.github.com> Co-authored-by:
haileyschoelkopf <hailey@eleuther.ai>
-
- 26 May, 2024 1 commit
-
-
Hailey Schoelkopf authored
* rename lm_eval.logging module * fix evaluation tracker args
-
- 24 May, 2024 1 commit
-
-
Hailey Schoelkopf authored
* add handling for bootstrap_iters=0 case * add more detail to docstring * run precommit
-
- 16 May, 2024 1 commit
-
-
lintangsutawika authored
-
- 10 May, 2024 2 commits
-
-
lintangsutawika authored
-
lintangsutawika authored
-
- 08 May, 2024 2 commits
-
-
lintangsutawika authored
-
lintangsutawika authored
-
- 07 May, 2024 8 commits
-
-
lintangsutawika authored
-
lintangsutawika authored
-
lintangsutawika authored
-
lintangsutawika authored
-
lintangsutawika authored
-
Hailey Schoelkopf authored
-
lintangsutawika authored
-
lintangsutawika authored
-
- 06 May, 2024 1 commit
-
-
LSinev authored
* Added fewshot sampling seeds to evaluator.simple_evaluate signature Way to control seed of fewshot sampling may help with #1591 * Added ability for custom sampler for ConfigurableTask May be set in config like ``` fewshot_config: sampler: !function utils.MyFewshotSampler ``` * explicitly set fewshot random generator seed for HFLM generate_until_task test * add backward compatibility for three args seed setup * save seeds info to logs/reports
-
- 05 May, 2024 1 commit
-
-
KonradSzafer authored
-
- 03 May, 2024 1 commit
-
-
KonradSzafer authored
* evaluation tracker implementation * OVModelForCausalLM test fix * typo fix * moved methods args * multiple args in one flag * loggers moved to dedicated dir * improved filename sanitization
-
- 25 Apr, 2024 1 commit
-
-
lintangsutawika authored
-
- 24 Apr, 2024 1 commit
-
-
lintangsutawika authored
-
- 23 Apr, 2024 1 commit
-
-
lintangsutawika authored
-
- 22 Mar, 2024 1 commit
-
-
Baber Abbasi authored
* add logging of model args * nit * Add warnings. * nit * add warning * nit
-
- 18 Mar, 2024 1 commit
-
-
Hailey Schoelkopf authored
* Update interface.md * fix: make caching reqs always work with accelerate launch * remove stale task migration checklist * remove deprecation warnings * make informative TypeErrors for get_task_dict * bump version metadata * fix num_fewshot printing bug * add fewshot value to cache key
-
- 17 Mar, 2024 1 commit
-
-
kwrobel.eth authored
-