"src/vscode:/vscode.git/clone" did not exist on "8cdcb2a92f80bc7752949ad3266edf1bc6595b5c"
Commit f38c7469 authored by lintangsutawika's avatar lintangsutawika
Browse files

split to easy and challenge

parent 56abc3a1
include: ../_arc_challenge_alt_yaml
group: arc_challenge_alt_ov_06
task: arc_challenge_alt_ov_06c
doc_to_text: !function ../styles.template_06
doc_to_choice: !function ../styles.choice_06c
doc_to_decontamination_query: !function ../styles.template_06
include: ../_arc_challenge_alt_yaml
group: arc_challenge_alt_ov_07
task: arc_challenge_alt_ov_07a
doc_to_text: !function ../styles.template_07
doc_to_choice: !function ../styles.choice_07a
doc_to_decontamination_query: !function ../styles.template_07
include: ../_arc_challenge_alt_yaml
group: arc_challenge_alt_ov_07
task: arc_challenge_alt_ov_07b
doc_to_text: !function ../styles.template_07
doc_to_choice: !function ../styles.choice_07b
doc_to_decontamination_query: !function ../styles.template_07
include: ../_arc_challenge_alt_yaml
group: arc_challenge_alt_ov_07
task: arc_challenge_alt_ov_07c
doc_to_text: !function ../styles.template_07
doc_to_choice: !function ../styles.choice_07c
doc_to_decontamination_query: !function ../styles.template_07
include: ../_arc_challenge_alt_yaml
group: arc_challenge_alt_ov_08
task: arc_challenge_alt_ov_08a
doc_to_text: !function ../styles.template_08
doc_to_choice: !function ../styles.choice_08a
doc_to_decontamination_query: !function ../styles.template_08
include: ../_arc_challenge_alt_yaml
group: arc_challenge_alt_ov_08
task: arc_challenge_alt_ov_08b
doc_to_text: !function ../styles.template_08
doc_to_choice: !function ../styles.choice_08b
doc_to_decontamination_query: !function ../styles.template_08
include: ../_arc_challenge_alt_yaml
group: arc_challenge_alt_ov_08
task: arc_challenge_alt_ov_08c
doc_to_text: !function ../styles.template_08
doc_to_choice: !function ../styles.choice_08c
doc_to_decontamination_query: !function ../styles.template_08
dataset_path: ai2_arc
dataset_name: ARC-Challenge
output_type: multiple_choice
training_split: train
validation_split: validation
test_split: test
doc_to_text: "Question: {{question}}\nAnswer:"
doc_to_target: "{{choices.label.index(answerKey)}}"
doc_to_choice: "{{choices.text}}"
should_decontaminate: true
doc_to_decontamination_query: "Question: {{question}}\nAnswer:"
metric_list:
- metric: acc
aggregation: mean
higher_is_better: true
- metric: acc_norm
aggregation: mean
higher_is_better: true
- metric: brier_score
aggregation: brier_score
higher_is_better: false
include: _arc_challenge_alt_yaml
group: arc_challenge_alt_pv
task: arc_challenge_alt_pv_01
doc_to_text: "{{question}}"
doc_to_decontamination_query: "{{question}}"
include: _arc_challenge_alt_yaml
group: arc_challenge_alt_pv
task: arc_challenge_alt_pv_02
doc_to_text: "Q: {{question}}\nA:"
doc_to_decontamination_query: "Q: {{question}}\nA:"
include: _arc_challenge_alt_yaml
group: arc_challenge_alt_pv
task: arc_challenge_alt_pv_03
doc_to_text: "Question: {{question}}\nAnswer:"
doc_to_decontamination_query: "Question: {{question}}\nAnswer:"
Investigate affect of letter options
- (A)
- A)
- A.
- A\t
- (a)
- a)
- a.
- a\t
Answer types:
- letters only
- original option
- just letter
- letters + continuation
- original option
- just letter
- continuation
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment