Unverified Commit 8a875e9a authored by Baber Abbasi's avatar Baber Abbasi Committed by GitHub
Browse files

Openllm benchmark (#1526)

parent 292e5814
group: openllm
group_alias: Open LLM Leaderboard
task:
- task: arc_challenge
fewshot_split: validation
num_fewshot: 25
- task: hellaswag
fewshot_split: train
num_fewshot: 10
- task: truthfulqa
num_fewshot: 0
- task: mmlu
num_fewshot: 5
- task: winogrande
fewshot_split: train
num_fewshot: 5
- task: gsm8k
num_fewshot: 5
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment