Commit e3c70cad authored by lintangsutawika's avatar lintangsutawika
Browse files

fix bleu

parent 26196917
...@@ -15,5 +15,6 @@ metric_list: ...@@ -15,5 +15,6 @@ metric_list:
higher_is_better: true higher_is_better: true
ignore_case: true ignore_case: true
ignore_punctuation: true ignore_punctuation: true
- metric: f1 - metric: !function "t5_utils.mean_3class_f1"
aggregation: !function "aggregate.cb_multi_fi" aggregation: !function "t5_utils.agg_mean_3class_f1"
higher_is_better: true
import evaluate import evaluate
def bleu(references, predictions):
bleu_fn = evaluate.load('bleu') def bleu(predictions, references):
return bleu_fn.compute(predictions=predictions, references=references) return (predictions[0], references[0])
def agg_bleu(items):
bleu_fn = evaluate.load("bleu")
predictions, references = zip(*items)
return bleu_fn.compute(predictions=predictions, references=references)["bleu"]
...@@ -13,5 +13,5 @@ metric_list: ...@@ -13,5 +13,5 @@ metric_list:
aggregation: mean aggregation: mean
higher_is_better: false higher_is_better: false
- metric: !function metrics.bleu - metric: !function metrics.bleu
aggregation: mean aggregation: !function metrics.agg_bleu
higher_is_better: true higher_is_better: true
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment