installation.rst 2.9 KB
Newer Older
1
2
3
Installation
================================================

thomwolf's avatar
thomwolf committed
4
PyTorch-Transformers is tested on Python 2.7 and 3.5+ (examples are tested only on python 3.5+) and PyTorch 1.1.0
5
6
7
8

With pip
^^^^^^^^

thomwolf's avatar
thomwolf committed
9
PyTorch Transformers can be installed using pip as follows:
10
11
12

.. code-block:: bash

thomwolf's avatar
thomwolf committed
13
   pip install pytorch-transformers
14

thomwolf's avatar
thomwolf committed
15
16
From source
^^^^^^^^^^^
17

thomwolf's avatar
thomwolf committed
18
To install from source, clone the repository and install with:
19

thomwolf's avatar
thomwolf committed
20
.. code-block:: bash
21

thomwolf's avatar
thomwolf committed
22
23
24
    git clone https://github.com/huggingface/pytorch-transformers.git
    cd pytorch-transformers
    pip install [--editable] .
25
26


thomwolf's avatar
thomwolf committed
27
28
Tests
^^^^^
29

thomwolf's avatar
thomwolf committed
30
An extensive test suite is included to test the library behavior and several examples. Library tests can be found in the `tests folder <https://github.com/huggingface/pytorch-transformers/tree/master/pytorch_transformers/tests>`_ and examples tests in the `examples folder <https://github.com/huggingface/pytorch-transformers/tree/master/examples>`_.
31

thomwolf's avatar
thomwolf committed
32
Tests can be run using `pytest` (install pytest if needed with `pip install pytest`).
33

thomwolf's avatar
thomwolf committed
34
Run all the tests from the root of the cloned repository with the commands:
35
36
37

.. code-block:: bash

thomwolf's avatar
thomwolf committed
38
39
    python -m pytest -sv ./pytorch_transformers/tests/
    python -m pytest -sv ./examples/
40
41


thomwolf's avatar
thomwolf committed
42
43
OpenAI GPT original tokenization workflow
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
44

thomwolf's avatar
thomwolf committed
45
If you want to reproduce the original tokenization process of the ``OpenAI GPT`` paper, you will need to install ``ftfy`` (use version 4.4.3 if you are using Python 2) and ``SpaCy`` :
46
47
48

.. code-block:: bash

thomwolf's avatar
thomwolf committed
49
50
51
52
   pip install spacy ftfy==4.4.3
   python -m spacy download en

If you don't install ``ftfy`` and ``SpaCy``\ , the ``OpenAI GPT`` tokenizer will default to tokenize using BERT's ``BasicTokenizer`` followed by Byte-Pair Encoding (which should be fine for most usage, don't worry).
53
54


55
56
57
58
59
60
Note on model downloads (Continuous Integration or large-scale deployments)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^

If you expect to be downloading large volumes of models (more than 1,000) from our hosted bucket (for instance through your CI setup, or a large-scale production deployment), please cache the model files on your end. It will be way faster, and cheaper. Feel free to contact us privately if you need any help.


61
62
63
64
65
66
67
68
69
70
Do you want to run a Transformer model on a mobile device?
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^

You should check out our `swift-coreml-transformers <https://github.com/huggingface/swift-coreml-transformers>`_ repo.

It contains an example of a conversion script from a Pytorch trained Transformer model (here, ``GPT-2``) to a CoreML model that runs on iOS devices.

It also contains an implementation of BERT for Question answering.

At some point in the future, you'll be able to seamlessly move from pre-training or fine-tuning models in PyTorch to productizing them in CoreML,
71
or prototype a model or an app in CoreML then research its hyperparameters or architecture from PyTorch. Super exciting!