README.md 2.25 KB
Newer Older
Lintang Sutawika's avatar
Lintang Sutawika committed
1
2
3
4
5
6
# bAbI

### Paper

Title: Towards ai-complete question answering: A set of prerequisite toy tasks
Abstract: https://arxiv.org/abs/1502.05698
Lintang Sutawika's avatar
Lintang Sutawika committed
7
8
9

One long-term goal of machine learning research is to produce methods that are applicable to reasoning and natural language, in particular building an intelligent dialogue agent. To measure progress towards that goal, we argue for the usefulness of a set of proxy tasks that evaluate reading comprehension via question answering. Our tasks measure understanding in several ways: whether a system is able to answer questions via chaining facts, simple induction, deduction and many more. The tasks are designed to be prerequisites for any system that aims to be capable of conversing with a human. We believe many existing learning systems can currently not solve them, and hence our aim is to classify these tasks into skill sets, so that researchers can identify (and then rectify) the failings of their systems. We also extend and improve the recently introduced Memory Networks model, and show it is able to solve some, but not all, of the tasks.

Lintang Sutawika's avatar
Lintang Sutawika committed
10
11
12
13
Homepage: https://github.com/facebookarchive/bAbI-tasks


### Citation
Lintang Sutawika's avatar
Lintang Sutawika committed
14

Lintang Sutawika's avatar
Lintang Sutawika committed
15
```
Lintang Sutawika's avatar
Lintang Sutawika committed
16
17
18
19
20
21
@article{weston2015towards,
  title={Towards ai-complete question answering: A set of prerequisite toy tasks},
  author={Weston, Jason and Bordes, Antoine and Chopra, Sumit and Rush, Alexander M and Van Merri{\"e}nboer, Bart and Joulin, Armand and Mikolov, Tomas},
  journal={arXiv preprint arXiv:1502.05698},
  year={2015}
}
Lintang Sutawika's avatar
Lintang Sutawika committed
22
```
Lintang Sutawika's avatar
Lintang Sutawika committed
23

24
### Groups, Tags, and Tasks
Lintang Sutawika's avatar
Lintang Sutawika committed
25

lintangsutawika's avatar
lintangsutawika committed
26
27
28
#### Groups

* Not part of a group yet
Lintang Sutawika's avatar
Lintang Sutawika committed
29

30
31
32
33
#### Tags

* No tags applied.

lintangsutawika's avatar
lintangsutawika committed
34
#### Tasks
Lintang Sutawika's avatar
Lintang Sutawika committed
35

lintangsutawika's avatar
lintangsutawika committed
36
* `babi`
Lintang Sutawika's avatar
Lintang Sutawika committed
37
38
39
40
41
42
43
44

### Checklist

For adding novel benchmarks/datasets to the library:
* [ ] Is the task an existing benchmark in the literature?
  * [ ] Have you referenced the original paper that introduced the task?
  * [ ] If yes, does the original paper provide a reference implementation? If so, have you checked against the reference implementation and documented how to run such a test?

Lintang Sutawika's avatar
Lintang Sutawika committed
45

Lintang Sutawika's avatar
Lintang Sutawika committed
46
47
48
49
If other tasks on this dataset are already supported:
* [ ] Is the "Main" variant of this task clearly denoted?
* [ ] Have you provided a short sentence in a README on what each new variant adds / evaluates?
* [ ] Have you noted which, if any, published evaluation setups are matched by this variant?