Optimization for evalita-llm rouge computation (#2878)
* feat: initial commit with templates for evalita evaluation
* fix: change rule for generate_until
* feat: modified yaml to use reduced version of NER test datasets
* feat: added templates to use reduced dataset for summarization (fanpage and ilpost)
* Add Six Prompts for Each Multiple-Choice Task
* fix: fastest eval for summarization
* chore: linted with ruff
* chore: linted with ruff
---------
Co-authored-by:
rzanoli <zanoli@fbk.eu>
Showing
Please register or sign in to comment