README.md 967 Bytes
Newer Older
1
2
3
4
5
6
# Tree-LSTM
This is a re-implementation of the following paper:

> [**Improved Semantic Representations From Tree-Structured Long Short-Term Memory Networks**](http://arxiv.org/abs/1503.00075) 
> *Kai Sheng Tai, Richard Socher, and Christopher Manning*. 

7
The provided implementation can achieve a test accuracy of 51.72 which is comparable with the result reported in the original paper: 51.0(±0.5).
8
9
10
11
12
13
14
15

## Data
The script will download the [SST dataset] (http://nlp.stanford.edu/sentiment/index.html) automatically, and you need to download the GloVe word vectors yourself. For the command line, you can use this.
```
wget http://nlp.stanford.edu/data/glove.840B.300d.zip
unzip glove.840B.300d.zip
```

16
17
18
19
20
21
22
23
24
## Dependencies
* PyTorch 0.4.1+
* requests
* nltk

```
pip install torch requests nltk
```

25
26
## Usage
```
Chao Ma's avatar
Chao Ma committed
27
python3 train.py --gpu 0
28
```
Gan Quan's avatar
Gan Quan committed
29
30
31
32

## Speed

On AWS p3.2x instance, it can achieve 3.18s per epoch when setting batch size to 256.