# xglm-1.7B ## xglm-1.7B_common_sense_reasoning_0-shot.json | Task |Version| Metric |Value| |Stderr| |-------------|------:|--------|----:|---|-----:| |arc_challenge| 0|acc |20.99|± | 1.19| | | |acc_norm|24.32|± | 1.25| |arc_easy | 0|acc |53.62|± | 1.02| | | |acc_norm|47.90|± | 1.03| |boolq | 1|acc |58.56|± | 0.86| |copa | 0|acc |68.00|± | 4.69| |hellaswag | 0|acc |36.18|± | 0.48| | | |acc_norm|45.80|± | 0.50| |mc_taco | 0|em |12.91| | | | | |f1 |34.52| | | |openbookqa | 0|acc |17.00|± | 1.68| | | |acc_norm|29.80|± | 2.05| |piqa | 0|acc |69.70|± | 1.07| | | |acc_norm|70.35|± | 1.07| |prost | 0|acc |22.69|± | 0.31| | | |acc_norm|27.21|± | 0.33| |swag | 0|acc |45.97|± | 0.35| | | |acc_norm|62.19|± | 0.34| |winogrande | 0|acc |54.93|± | 1.40| |wsc273 | 0|acc |68.13|± | 2.83| ## xglm-1.7B_gsm8k_8-shot.json |Task |Version|Metric|Value| |Stderr| |-----|------:|------|----:|---|-----:| |gsm8k| 0|acc | 0.99|± | 0.27| ## xglm-1.7B_mathematical_reasoning_few_shot_5-shot.json | Task |Version| Metric |Value| |Stderr| |-------------------------|------:|--------|----:|---|-----:| |drop | 1|em | 0.67|± | 0.08| | | |f1 | 3.44|± | 0.13| |gsm8k | 0|acc | 0.83|± | 0.25| |math_algebra | 1|acc | 0.00|± | 0.00| |math_counting_and_prob | 1|acc | 0.00|± | 0.00| |math_geometry | 1|acc | 0.00|± | 0.00| |math_intermediate_algebra| 1|acc | 0.00|± | 0.00| |math_num_theory | 1|acc | 0.00|± | 0.00| |math_prealgebra | 1|acc | 0.00|± | 0.00| |math_precalc | 1|acc | 0.00|± | 0.00| |mathqa | 0|acc |22.91|± | 0.77| | | |acc_norm|21.44|± | 0.75| ## xglm-1.7B_pawsx_0-shot.json | Task |Version|Metric|Value| |Stderr| |--------|------:|------|----:|---|-----:| |pawsx_de| 0|acc |57.55|± | 1.11| |pawsx_en| 0|acc |52.65|± | 1.12| |pawsx_es| 0|acc |53.80|± | 1.12| |pawsx_fr| 0|acc |47.35|± | 1.12| |pawsx_ja| 0|acc |46.10|± | 1.11| |pawsx_ko| 0|acc |51.40|± | 1.12| |pawsx_zh| 0|acc |48.10|± | 1.12| ## xglm-1.7B_xcopa_0-shot.json | Task |Version|Metric|Value| |Stderr| |--------|------:|------|----:|---|-----:| |xcopa_et| 0|acc | 56.8|± | 2.22| |xcopa_ht| 0|acc | 55.8|± | 2.22| |xcopa_id| 0|acc | 64.6|± | 2.14| |xcopa_it| 0|acc | 54.0|± | 2.23| |xcopa_qu| 0|acc | 52.2|± | 2.24| |xcopa_sw| 0|acc | 56.6|± | 2.22| |xcopa_ta| 0|acc | 55.2|± | 2.23| |xcopa_th| 0|acc | 58.2|± | 2.21| |xcopa_tr| 0|acc | 53.4|± | 2.23| |xcopa_vi| 0|acc | 63.0|± | 2.16| |xcopa_zh| 0|acc | 58.0|± | 2.21| ## xglm-1.7B_xnli_0-shot.json | Task |Version|Metric|Value| |Stderr| |-------|------:|------|----:|---|-----:| |xnli_ar| 0|acc |33.51|± | 0.67| |xnli_bg| 0|acc |44.73|± | 0.70| |xnli_de| 0|acc |45.33|± | 0.70| |xnli_el| 0|acc |40.10|± | 0.69| |xnli_en| 0|acc |49.68|± | 0.71| |xnli_es| 0|acc |43.61|± | 0.70| |xnli_fr| 0|acc |45.73|± | 0.70| |xnli_hi| 0|acc |42.61|± | 0.70| |xnli_ru| 0|acc |45.97|± | 0.70| |xnli_sw| 0|acc |42.00|± | 0.70| |xnli_th| 0|acc |41.70|± | 0.70| |xnli_tr| 0|acc |42.95|± | 0.70| |xnli_ur| 0|acc |39.50|± | 0.69| |xnli_vi| 0|acc |45.03|± | 0.70| |xnli_zh| 0|acc |33.77|± | 0.67| ## xglm-1.7B_xstory_cloze_0-shot.json | Task |Version|Metric|Value| |Stderr| |---------------|------:|------|----:|---|-----:| |xstory_cloze_ar| 0|acc |52.48|± | 1.29| |xstory_cloze_en| 0|acc |64.33|± | 1.23| |xstory_cloze_es| 0|acc |59.23|± | 1.26| |xstory_cloze_eu| 0|acc |56.12|± | 1.28| |xstory_cloze_hi| 0|acc |55.79|± | 1.28| |xstory_cloze_id| 0|acc |57.97|± | 1.27| |xstory_cloze_my| 0|acc |53.81|± | 1.28| |xstory_cloze_ru| 0|acc |59.83|± | 1.26| |xstory_cloze_sw| 0|acc |55.99|± | 1.28| |xstory_cloze_te| 0|acc |58.04|± | 1.27| |xstory_cloze_zh| 0|acc |56.19|± | 1.28| ## xglm-1.7B_xwinograd_0-shot.json | Task |Version|Metric|Value| |Stderr| |------------|------:|------|----:|---|-----:| |xwinograd_en| 0|acc |71.05|± | 0.94| |xwinograd_fr| 0|acc |60.24|± | 5.40| |xwinograd_jp| 0|acc |60.58|± | 1.58| |xwinograd_pt| 0|acc |63.88|± | 2.97| |xwinograd_ru| 0|acc |59.68|± | 2.77| |xwinograd_zh| 0|acc |69.84|± | 2.05|