Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
gaoqiong
lm-evaluation-harness
Commits
b5111e31
Commit
b5111e31
authored
Jun 27, 2024
by
Nathan Habib
Browse files
Merge remote-tracking branch 'origin/main' into adding_all_changess
parents
50cc356c
e5e5ee0c
Changes
8
Hide whitespace changes
Inline
Side-by-side
Showing
8 changed files
with
14 additions
and
1 deletion
+14
-1
.github/workflows/unit_tests.yml
.github/workflows/unit_tests.yml
+0
-1
lm_eval/tasks/mmlu/continuation/_continuation_template_yaml
lm_eval/tasks/mmlu/continuation/_continuation_template_yaml
+2
-0
lm_eval/tasks/mmlu/default/_default_template_yaml
lm_eval/tasks/mmlu/default/_default_template_yaml
+2
-0
lm_eval/tasks/mmlu/flan_cot_fewshot/_mmlu_flan_cot_fewshot_template_yaml
...mlu/flan_cot_fewshot/_mmlu_flan_cot_fewshot_template_yaml
+2
-0
lm_eval/tasks/mmlu/flan_cot_zeroshot/_mmlu_flan_cot_zeroshot_template_yaml
...u/flan_cot_zeroshot/_mmlu_flan_cot_zeroshot_template_yaml
+2
-0
lm_eval/tasks/mmlu/flan_n_shot/generative/_mmlu_flan_generative_template_yaml
...lan_n_shot/generative/_mmlu_flan_generative_template_yaml
+2
-0
lm_eval/tasks/mmlu/flan_n_shot/loglikelihood/_mmlu_flan_loglikelihood_template_yaml
...shot/loglikelihood/_mmlu_flan_loglikelihood_template_yaml
+2
-0
lm_eval/tasks/mmlu/generative/_default_template_yaml
lm_eval/tasks/mmlu/generative/_default_template_yaml
+2
-0
No files found.
.github/workflows/unit_tests.yml
View file @
b5111e31
...
...
@@ -55,7 +55,6 @@ jobs:
cache-dependency-path
:
pyproject.toml
-
name
:
Install dependencies
run
:
|
export HF_DATASETS_TRUST_REMOTE_CODE=1
python -m pip install --upgrade pip
pip install -e '.[dev,anthropic,sentencepiece]' --extra-index-url https://download.pytorch.org/whl/cpu
# Install optional git dependencies
...
...
lm_eval/tasks/mmlu/continuation/_continuation_template_yaml
View file @
b5111e31
...
...
@@ -9,3 +9,5 @@ doc_to_choice: "{{choices}}"
doc_to_target: "{{answer}}"
metadata:
version: 0.0
dataset_kwargs:
trust_remote_code: true
lm_eval/tasks/mmlu/default/_default_template_yaml
View file @
b5111e31
...
...
@@ -13,3 +13,5 @@ metric_list:
higher_is_better: true
metadata:
version: 0.0
dataset_kwargs:
trust_remote_code: true
lm_eval/tasks/mmlu/flan_cot_fewshot/_mmlu_flan_cot_fewshot_template_yaml
View file @
b5111e31
...
...
@@ -27,3 +27,5 @@ metric_list:
ignore_punctuation: true
metadata:
version: 1.0
dataset_kwargs:
trust_remote_code: true
lm_eval/tasks/mmlu/flan_cot_zeroshot/_mmlu_flan_cot_zeroshot_template_yaml
View file @
b5111e31
...
...
@@ -34,3 +34,5 @@ metric_list:
ignore_punctuation: true
metadata:
version: 2.0
dataset_kwargs:
trust_remote_code: true
lm_eval/tasks/mmlu/flan_n_shot/generative/_mmlu_flan_generative_template_yaml
View file @
b5111e31
...
...
@@ -31,3 +31,5 @@ metric_list:
higher_is_better: true
metadata:
version: 2.0
dataset_kwargs:
trust_remote_code: true
lm_eval/tasks/mmlu/flan_n_shot/loglikelihood/_mmlu_flan_loglikelihood_template_yaml
View file @
b5111e31
...
...
@@ -13,3 +13,5 @@ metric_list:
higher_is_better: true
metadata:
version: 1.0
dataset_kwargs:
trust_remote_code: true
lm_eval/tasks/mmlu/generative/_default_template_yaml
View file @
b5111e31
...
...
@@ -16,3 +16,5 @@ metric_list:
higher_is_better: true
metadata:
version: 1.0
dataset_kwargs:
trust_remote_code: true
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment