{"es":{"description":"HEAD-QA is a multi-choice HEAlthcare Dataset. The questions come from exams to access a specialized position in the\nSpanish healthcare system, and are challenging even for highly specialized humans. They are designed by the Ministerio\nde Sanidad, Consumo y Bienestar Social.\nThe dataset contains questions about the following topics: medicine, nursing, psychology, chemistry, pharmacology and biology.\n","citation":"@inproceedings{vilares-gomez-rodriguez-2019-head,\n title = \"{HEAD}-{QA}: A Healthcare Dataset for Complex Reasoning\",\n author = \"Vilares, David and\n G{'o}mez-Rodr{'i}guez, Carlos\",\n booktitle = \"Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics\",\n month = jul,\n year = \"2019\",\n address = \"Florence, Italy\",\n publisher = \"Association for Computational Linguistics\",\n url = \"https://www.aclweb.org/anthology/P19-1092\",\n doi = \"10.18653/v1/P19-1092\",\n pages = \"960--966\",\n abstract = \"We present HEAD-QA, a multi-choice question answering testbed to encourage research on complex reasoning. The questions come from exams to access a specialized position in the Spanish healthcare system, and are challenging even for highly specialized humans. We then consider monolingual (Spanish) and cross-lingual (to English) experiments with information retrieval and neural techniques. We show that: (i) HEAD-QA challenges current methods, and (ii) the results lag well behind human performance, demonstrating its usefulness as a benchmark for future work.\",\n}\n","homepage":"https://aghie.github.io/head-qa/","license":"MIT License","features":{"name":{"dtype":"string","id":null,"_type":"Value"},"year":{"dtype":"string","id":null,"_type":"Value"},"category":{"dtype":"string","id":null,"_type":"Value"},"qid":{"dtype":"int32","id":null,"_type":"Value"},"qtext":{"dtype":"string","id":null,"_type":"Value"},"ra":{"dtype":"int32","id":null,"_type":"Value"},"answers":[{"aid":{"dtype":"int32","id":null,"_type":"Value"},"atext":{"dtype":"string","id":null,"_type":"Value"}}]},"post_processed":null,"supervised_keys":null,"task_templates":null,"builder_name":"head_qa","config_name":"es","version":{"version_str":"1.1.0","description":null,"major":1,"minor":1,"patch":0},"splits":{"train":{"name":"train","num_bytes":1196021,"num_examples":2657,"dataset_name":"head_qa"},"test":{"name":"test","num_bytes":1169819,"num_examples":2742,"dataset_name":"head_qa"},"validation":{"name":"validation","num_bytes":556924,"num_examples":1366,"dataset_name":"head_qa"}},"download_checksums":{"https://drive.google.com/uc?export=download&confirm=t&id=1a_95N5zQQoUCq8IBNVZgziHbeM-QxG2t":{"num_bytes":79365502,"checksum":"6ec29a3f55153d167f0bdf05395558919ba0b1df9c63e79ffceda2a09884ad8b"}},"download_size":79365502,"post_processing_size":null,"dataset_size":2922764,"size_in_bytes":82288266},"en":{"description":"HEAD-QA is a multi-choice HEAlthcare Dataset. The questions come from exams to access a specialized position in the\nSpanish healthcare system, and are challenging even for highly specialized humans. They are designed by the Ministerio\nde Sanidad, Consumo y Bienestar Social.\nThe dataset contains questions about the following topics: medicine, nursing, psychology, chemistry, pharmacology and biology.\n","citation":"@inproceedings{vilares-gomez-rodriguez-2019-head,\n title = \"{HEAD}-{QA}: A Healthcare Dataset for Complex Reasoning\",\n author = \"Vilares, David and\n G{'o}mez-Rodr{'i}guez, Carlos\",\n booktitle = \"Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics\",\n month = jul,\n year = \"2019\",\n address = \"Florence, Italy\",\n publisher = \"Association for Computational Linguistics\",\n url = \"https://www.aclweb.org/anthology/P19-1092\",\n doi = \"10.18653/v1/P19-1092\",\n pages = \"960--966\",\n abstract = \"We present HEAD-QA, a multi-choice question answering testbed to encourage research on complex reasoning. The questions come from exams to access a specialized position in the Spanish healthcare system, and are challenging even for highly specialized humans. We then consider monolingual (Spanish) and cross-lingual (to English) experiments with information retrieval and neural techniques. We show that: (i) HEAD-QA challenges current methods, and (ii) the results lag well behind human performance, demonstrating its usefulness as a benchmark for future work.\",\n}\n","homepage":"https://aghie.github.io/head-qa/","license":"MIT License","features":{"name":{"dtype":"string","id":null,"_type":"Value"},"year":{"dtype":"string","id":null,"_type":"Value"},"category":{"dtype":"string","id":null,"_type":"Value"},"qid":{"dtype":"int32","id":null,"_type":"Value"},"qtext":{"dtype":"string","id":null,"_type":"Value"},"ra":{"dtype":"int32","id":null,"_type":"Value"},"answers":[{"aid":{"dtype":"int32","id":null,"_type":"Value"},"atext":{"dtype":"string","id":null,"_type":"Value"}}]},"post_processed":null,"supervised_keys":null,"task_templates":null,"builder_name":"head_qa","config_name":"en","version":{"version_str":"1.1.0","description":null,"major":1,"minor":1,"patch":0},"splits":{"train":{"name":"train","num_bytes":1123151,"num_examples":2657,"dataset_name":"head_qa"},"test":{"name":"test","num_bytes":1097349,"num_examples":2742,"dataset_name":"head_qa"},"validation":{"name":"validation","num_bytes":523462,"num_examples":1366,"dataset_name":"head_qa"}},"download_checksums":{"https://drive.google.com/uc?export=download&confirm=t&id=1a_95N5zQQoUCq8IBNVZgziHbeM-QxG2t":{"num_bytes":79365502,"checksum":"6ec29a3f55153d167f0bdf05395558919ba0b1df9c63e79ffceda2a09884ad8b"}},"download_size":79365502,"post_processing_size":null,"dataset_size":2743962,"size_in_bytes":82109464}}
abstract = "We present HEAD-QA, a multi-choice question answering testbed to encourage research on complex reasoning. The questions come from exams to access a specialized position in the Spanish healthcare system, and are challenging even for highly specialized humans. We then consider monolingual (Spanish) and cross-lingual (to English) experiments with information retrieval and neural techniques. We show that: (i) HEAD-QA challenges current methods, and (ii) the results lag well behind human performance, demonstrating its usefulness as a benchmark for future work.",
}
"""
_DESCRIPTION="""\
HEAD-QA is a multi-choice HEAlthcare Dataset. The questions come from exams to access a specialized position in the
Spanish healthcare system, and are challenging even for highly specialized humans. They are designed by the Ministerio
de Sanidad, Consumo y Bienestar Social.
The dataset contains questions about the following topics: medicine, nursing, psychology, chemistry, pharmacology and biology.