Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
gaoqiong
lm-evaluation-harness
Commits
e58b8182
Commit
e58b8182
authored
Aug 08, 2024
by
lintangsutawika
Browse files
resolved merge conflict
parents
d213a533
0571eeb1
Changes
105
Hide whitespace changes
Inline
Side-by-side
Showing
20 changed files
with
105 additions
and
0 deletions
+105
-0
lm_eval/tasks/med_concepts_qa/_med_concepts_qa_atc.yaml
lm_eval/tasks/med_concepts_qa/_med_concepts_qa_atc.yaml
+6
-0
lm_eval/tasks/med_concepts_qa/_med_concepts_qa_icd10cm.yaml
lm_eval/tasks/med_concepts_qa/_med_concepts_qa_icd10cm.yaml
+6
-0
lm_eval/tasks/med_concepts_qa/_med_concepts_qa_icd10proc.yaml
...val/tasks/med_concepts_qa/_med_concepts_qa_icd10proc.yaml
+6
-0
lm_eval/tasks/med_concepts_qa/_med_concepts_qa_icd9cm.yaml
lm_eval/tasks/med_concepts_qa/_med_concepts_qa_icd9cm.yaml
+6
-0
lm_eval/tasks/med_concepts_qa/_med_concepts_qa_icd9proc.yaml
lm_eval/tasks/med_concepts_qa/_med_concepts_qa_icd9proc.yaml
+6
-0
lm_eval/tasks/med_concepts_qa/med_concepts_qa_atc_easy.yaml
lm_eval/tasks/med_concepts_qa/med_concepts_qa_atc_easy.yaml
+5
-0
lm_eval/tasks/med_concepts_qa/med_concepts_qa_atc_hard.yaml
lm_eval/tasks/med_concepts_qa/med_concepts_qa_atc_hard.yaml
+5
-0
lm_eval/tasks/med_concepts_qa/med_concepts_qa_atc_medium.yaml
...val/tasks/med_concepts_qa/med_concepts_qa_atc_medium.yaml
+5
-0
lm_eval/tasks/med_concepts_qa/med_concepts_qa_icd10cm_easy.yaml
...l/tasks/med_concepts_qa/med_concepts_qa_icd10cm_easy.yaml
+5
-0
lm_eval/tasks/med_concepts_qa/med_concepts_qa_icd10cm_hard.yaml
...l/tasks/med_concepts_qa/med_concepts_qa_icd10cm_hard.yaml
+5
-0
lm_eval/tasks/med_concepts_qa/med_concepts_qa_icd10cm_medium.yaml
...tasks/med_concepts_qa/med_concepts_qa_icd10cm_medium.yaml
+5
-0
lm_eval/tasks/med_concepts_qa/med_concepts_qa_icd10proc_easy.yaml
...tasks/med_concepts_qa/med_concepts_qa_icd10proc_easy.yaml
+5
-0
lm_eval/tasks/med_concepts_qa/med_concepts_qa_icd10proc_hard.yaml
...tasks/med_concepts_qa/med_concepts_qa_icd10proc_hard.yaml
+5
-0
lm_eval/tasks/med_concepts_qa/med_concepts_qa_icd10proc_medium.yaml
...sks/med_concepts_qa/med_concepts_qa_icd10proc_medium.yaml
+5
-0
lm_eval/tasks/med_concepts_qa/med_concepts_qa_icd9cm_easy.yaml
...al/tasks/med_concepts_qa/med_concepts_qa_icd9cm_easy.yaml
+5
-0
lm_eval/tasks/med_concepts_qa/med_concepts_qa_icd9cm_hard.yaml
...al/tasks/med_concepts_qa/med_concepts_qa_icd9cm_hard.yaml
+5
-0
lm_eval/tasks/med_concepts_qa/med_concepts_qa_icd9cm_medium.yaml
.../tasks/med_concepts_qa/med_concepts_qa_icd9cm_medium.yaml
+5
-0
lm_eval/tasks/med_concepts_qa/med_concepts_qa_icd9proc_easy.yaml
.../tasks/med_concepts_qa/med_concepts_qa_icd9proc_easy.yaml
+5
-0
lm_eval/tasks/med_concepts_qa/med_concepts_qa_icd9proc_hard.yaml
.../tasks/med_concepts_qa/med_concepts_qa_icd9proc_hard.yaml
+5
-0
lm_eval/tasks/med_concepts_qa/med_concepts_qa_icd9proc_medium.yaml
...asks/med_concepts_qa/med_concepts_qa_icd9proc_medium.yaml
+5
-0
No files found.
lm_eval/tasks/med_concepts_qa/_med_concepts_qa_atc.yaml
0 → 100644
View file @
e58b8182
group
:
med_concepts_qa_atc
task
:
-
med_concepts_qa_atc_tasks
aggregate_metric_list
:
-
metric
:
acc
aggregation
:
mean
lm_eval/tasks/med_concepts_qa/_med_concepts_qa_icd10cm.yaml
0 → 100644
View file @
e58b8182
group
:
med_concepts_qa_icd10cm
task
:
-
med_concepts_qa_icd10cm_tasks
aggregate_metric_list
:
-
metric
:
acc
aggregation
:
mean
lm_eval/tasks/med_concepts_qa/_med_concepts_qa_icd10proc.yaml
0 → 100644
View file @
e58b8182
group
:
med_concepts_qa_icd10proc
task
:
-
med_concepts_qa_icd10proc_tasks
aggregate_metric_list
:
-
metric
:
acc
aggregation
:
mean
lm_eval/tasks/med_concepts_qa/_med_concepts_qa_icd9cm.yaml
0 → 100644
View file @
e58b8182
group
:
med_concepts_qa_icd9cm
task
:
-
med_concepts_qa_icd9cm_tasks
aggregate_metric_list
:
-
metric
:
acc
aggregation
:
mean
lm_eval/tasks/med_concepts_qa/_med_concepts_qa_icd9proc.yaml
0 → 100644
View file @
e58b8182
group
:
med_concepts_qa_icd9proc
task
:
-
med_concepts_qa_icd9proc_tasks
aggregate_metric_list
:
-
metric
:
acc
aggregation
:
mean
lm_eval/tasks/med_concepts_qa/med_concepts_qa_atc_easy.yaml
0 → 100644
View file @
e58b8182
dataset_name
:
atc_easy
include
:
_default_template_yaml
tag
:
med_concepts_qa_atc_tasks
task
:
med_concepts_qa_atc_easy
task_alias
:
atc_easy
lm_eval/tasks/med_concepts_qa/med_concepts_qa_atc_hard.yaml
0 → 100644
View file @
e58b8182
dataset_name
:
atc_hard
include
:
_default_template_yaml
tag
:
med_concepts_qa_atc_tasks
task
:
med_concepts_qa_atc_hard
task_alias
:
atc_hard
lm_eval/tasks/med_concepts_qa/med_concepts_qa_atc_medium.yaml
0 → 100644
View file @
e58b8182
dataset_name
:
atc_medium
include
:
_default_template_yaml
tag
:
med_concepts_qa_atc_tasks
task
:
med_concepts_qa_atc_medium
task_alias
:
atc_medium
lm_eval/tasks/med_concepts_qa/med_concepts_qa_icd10cm_easy.yaml
0 → 100644
View file @
e58b8182
dataset_name
:
icd10cm_easy
include
:
_default_template_yaml
tag
:
med_concepts_qa_icd10cm_tasks
task
:
med_concepts_qa_icd10cm_easy
task_alias
:
icd10cm_easy
lm_eval/tasks/med_concepts_qa/med_concepts_qa_icd10cm_hard.yaml
0 → 100644
View file @
e58b8182
dataset_name
:
icd10cm_hard
include
:
_default_template_yaml
tag
:
med_concepts_qa_icd10cm_tasks
task
:
med_concepts_qa_icd10cm_hard
task_alias
:
icd10cm_hard
lm_eval/tasks/med_concepts_qa/med_concepts_qa_icd10cm_medium.yaml
0 → 100644
View file @
e58b8182
dataset_name
:
icd10cm_medium
include
:
_default_template_yaml
tag
:
med_concepts_qa_icd10cm_tasks
task
:
med_concepts_qa_icd10cm_medium
task_alias
:
icd10cm_medium
lm_eval/tasks/med_concepts_qa/med_concepts_qa_icd10proc_easy.yaml
0 → 100644
View file @
e58b8182
dataset_name
:
icd10proc_easy
include
:
_default_template_yaml
tag
:
med_concepts_qa_icd10proc_tasks
task
:
med_concepts_qa_icd10proc_easy
task_alias
:
icd10proc_easy
lm_eval/tasks/med_concepts_qa/med_concepts_qa_icd10proc_hard.yaml
0 → 100644
View file @
e58b8182
dataset_name
:
icd10proc_hard
include
:
_default_template_yaml
tag
:
med_concepts_qa_icd10proc_tasks
task
:
med_concepts_qa_icd10proc_hard
task_alias
:
icd10proc_hard
lm_eval/tasks/med_concepts_qa/med_concepts_qa_icd10proc_medium.yaml
0 → 100644
View file @
e58b8182
dataset_name
:
icd10proc_medium
include
:
_default_template_yaml
tag
:
med_concepts_qa_icd10proc_tasks
task
:
med_concepts_qa_icd10proc_medium
task_alias
:
icd10proc_medium
lm_eval/tasks/med_concepts_qa/med_concepts_qa_icd9cm_easy.yaml
0 → 100644
View file @
e58b8182
dataset_name
:
icd9cm_easy
include
:
_default_template_yaml
tag
:
med_concepts_qa_icd9cm_tasks
task
:
med_concepts_qa_icd9cm_easy
task_alias
:
icd9cm_easy
lm_eval/tasks/med_concepts_qa/med_concepts_qa_icd9cm_hard.yaml
0 → 100644
View file @
e58b8182
dataset_name
:
icd9cm_hard
include
:
_default_template_yaml
tag
:
med_concepts_qa_icd9cm_tasks
task
:
med_concepts_qa_icd9cm_hard
task_alias
:
icd9cm_hard
lm_eval/tasks/med_concepts_qa/med_concepts_qa_icd9cm_medium.yaml
0 → 100644
View file @
e58b8182
dataset_name
:
icd9cm_medium
include
:
_default_template_yaml
tag
:
med_concepts_qa_icd9cm_tasks
task
:
med_concepts_qa_icd9cm_medium
task_alias
:
icd9cm_medium
lm_eval/tasks/med_concepts_qa/med_concepts_qa_icd9proc_easy.yaml
0 → 100644
View file @
e58b8182
dataset_name
:
icd9proc_easy
include
:
_default_template_yaml
tag
:
med_concepts_qa_icd9proc_tasks
task
:
med_concepts_qa_icd9proc_easy
task_alias
:
icd9proc_easy
lm_eval/tasks/med_concepts_qa/med_concepts_qa_icd9proc_hard.yaml
0 → 100644
View file @
e58b8182
dataset_name
:
icd9proc_hard
include
:
_default_template_yaml
tag
:
med_concepts_qa_icd9proc_tasks
task
:
med_concepts_qa_icd9proc_hard
task_alias
:
icd9proc_hard
lm_eval/tasks/med_concepts_qa/med_concepts_qa_icd9proc_medium.yaml
0 → 100644
View file @
e58b8182
dataset_name
:
icd9proc_medium
include
:
_default_template_yaml
tag
:
med_concepts_qa_icd9proc_tasks
task
:
med_concepts_qa_icd9proc_medium
task_alias
:
icd9proc_medium
Prev
1
2
3
4
5
6
Next
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment