CONTRIBUTING.md 9.71 KB
Newer Older
1
2
3
# How to contribute to transformers?

Everyone is welcome to contribute, and we value everybody's contribution. Code
4
5
6
is thus not the only way to help the community. Answering questions, helping
others, reaching out and improving the documentations are immensely valuable to
the community.
7
8

It also helps us if you spread the word: reference the library from blog posts
9
on the awesome projects it made possible, shout out on Twitter every time it has
10
11
12
13
14
15
16
helped you, or simply star the repo to say "thank you".

## You can contribute in so many ways!

There are 4 ways you can contribute to transformers:
* Fixing outstanding issues with the existing code;
* Implementing new models;
17
* Contributing to the examples or to the documentation;
18
19
20
21
22
23
24
25
26
27
28
29
30
* Submitting issues related to bugs or desired new features.

*All are equally valuable to the community.*

## Submitting a new issue or feature request

Do your best to follow these guidelines when submitting an issue or a feature
request. It will make it easier for us to come back to you quickly and with good
feedback.

### Did you find a bug?

The transformers are robust and reliable thanks to the users who notify us of
31
the problems they encounter. So thank you for reporting an issue.
32

33
34
First, we would really appreciate it if you could **make sure the bug was not
already reported** (use the search bar on Github under Issues).
35
36
37
38
39
40

Did not find it? :( So we can act quickly on it, please follow these steps:

* Include your **OS type and version**, the versions of **Python**, **PyTorch** and
  **Tensorflow** when applicable;
* A short, self-contained, code snippet that allows us to reproduce the bug in
41
  less than 30s;
42
43
* Provide the *full* traceback if an exception is raised.

44
To get the OS and software versions automatically, you can run the following command:
45

46
```bash
47
transformers-cli env
48
49
```

50
51
52
53
54
55
56
or from the root of the repository the following command:

```bash
python src/transformers/commands/transformers_cli.py env
```


57
58
### Do you want to implement a new model?

59
Awesome! Please provide the following information:
60

61
62
63
* Short description of the model and link to the paper;
* Link to the implementation if it is open-source;
* Link to the model weights if they are available.
64

65
66
If you are willing to contribute the model yourself, let us know so we can best
guide you.
67

Thomas Wolf's avatar
Thomas Wolf committed
68
We have added a **detailed guide and templates** to guide you in the process of adding a new model. You can find them in the [`templates`](./templates) folder.
69

70
71
72
73
74
75
76
77
78
79
80
### Do you want a new feature (that is not a model)?

A world-class feature request addresses the following points:

1. Motivation first:
  * Is it related to a problem/frustration with the library? If so, please explain
    why. Providing a code snippet that demonstrates the problem is best.
  * Is it related to something you would need for a project? We'd love to hear
    about it!
  * Is it something you worked on and think could benefit the community?
    Awesome! Tell us what problem it solved for you.
81
82
83
2. Write a *full paragraph* describing the feature;
3. Provide a **code snippet** that demonstrates its future use;
4. In case this is related to a paper, please attach a link;
84
85
5. Attach any additional information (drawings, screenshots, etc.) you think may help.

86
If your issue is well written we're already 80% of the way there by the time you
87
88
post it.

Thomas Wolf's avatar
Thomas Wolf committed
89
We have added **templates** to guide you in the process of adding a new example script for training or testing the models in the library. You can find them in the [`templates`](./templates) folder.
90

Rémi Louf's avatar
Rémi Louf committed
91
92
93
## Start contributing! (Pull Requests)

Before writing code, we strongly advise you to search through the exising PRs or
94
95
issues to make sure that nobody is already working on the same thing. If you are
unsure, it is always a good idea to open an issue to get some feedback.
Rémi Louf's avatar
Rémi Louf committed
96
97
98
99
100
101
102
103
104

You will need basic `git` proficiency to be able to contribute to
`transformers`. `git` is not the easiest tool to use but it has the greatest
manual. Type `git --help` in a shell and enjoy. If you prefer books, [Pro
Git](https://git-scm.com/book/en/v2) is a very good reference.

Follow these steps to start contributing:

1. Fork the [repository](https://github.com/huggingface/transformers) by
105
   clicking on the 'Fork' button on the repository's page. This creates a copy of the code
106
107
   under your GitHub user account.

Rémi Louf's avatar
Rémi Louf committed
108
2. Clone your fork to your local disk, and add the base repository as a remote:
109

Rémi Louf's avatar
Rémi Louf committed
110
111
112
   ```bash
   $ git clone git@github.com:<your Github handle>/transformers.git
   $ cd transformers
113
   $ git remote add upstream https://github.com/huggingface/transformers.git
Rémi Louf's avatar
Rémi Louf committed
114
115
116
117
118
119
120
   ```

3. Create a new branch to hold your development changes:

   ```bash
   $ git checkout -b a-descriptive-name-for-my-changes
   ```
121

Rémi Louf's avatar
Rémi Louf committed
122
   **do not** work on the `master` branch.
123

124
4. Set up a development environment by running the following command in a virtual environment:
Rémi Louf's avatar
Rémi Louf committed
125
126

   ```bash
127
   $ pip install -e ".[dev]"
Rémi Louf's avatar
Rémi Louf committed
128
129
   ```

130
131
132
133
   (If transformers was already installed in the virtual environment, remove
   it with `pip uninstall transformers` before reinstalling it in editable
   mode with the `-e` flag.)

134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
   Right now, we need an unreleased version of `isort` to avoid a
   [bug](https://github.com/timothycrosley/isort/pull/1000):

   ```bash
   $ pip install -U git+git://github.com/timothycrosley/isort.git@e63ae06ec7d70b06df9e528357650281a3d3ec22#egg=isort
   ```
5. Develop the features on your branch.

   As you work on the features, you should make sure that the test suite
   passes:

   ```bash
   $ make test
   ```

   `transformers` relies on `black` and `isort` to format its source code
   consistently. After you make changes, format them with:

   ```bash
   $ make style
   ```

   `transformers` also uses `flake8` to check for coding mistakes. Quality
   control runs in CI, however you can also run the same checks with:

   ```bash
   $ make quality
   ```

   Once you're happy with your changes, add changed files using `git add` and
   make a commit with `git commit` to record your changes locally:
165

Rémi Louf's avatar
Rémi Louf committed
166
167
168
169
   ```bash
   $ git add modified_file.py
   $ git commit
   ```
170

Rémi Louf's avatar
Rémi Louf committed
171
   Please write [good commit
172
173
174
175
   messages](https://chris.beams.io/posts/git-commit/).

   It is a good idea to sync your copy of the code with the original
   repository regularly. This way you can quickly account for changes:
176

Rémi Louf's avatar
Rémi Louf committed
177
178
179
180
   ```bash
   $ git fetch upstream
   $ git rebase upstream/master
   ```
181

Rémi Louf's avatar
Rémi Louf committed
182
   Push the changes to your account using:
183

Rémi Louf's avatar
Rémi Louf committed
184
185
186
   ```bash
   $ git push -u origin a-descriptive-name-for-my-changes
   ```
187

Rémi Louf's avatar
Rémi Louf committed
188
6. Once you are satisfied (**and the checklist below is happy too**), go to the
189
   webpage of your fork on GitHub. Click on 'Pull request' to send your changes
Rémi Louf's avatar
Rémi Louf committed
190
   to the project maintainers for review.
191

192
193
194
195
7. It's ok if maintainers ask you for changes. It happens to core contributors
   too! So everyone can see the changes in the Pull request, work in your local
   branch and push the changes to your fork. They will automatically appear in
   the pull request.
Rémi Louf's avatar
Rémi Louf committed
196
197
198
199
200
201


### Checklist

1. The title of your pull request should be a summary of its contribution;
2. If your pull request adresses an issue, please mention the issue number in
202
203
   the pull request description to make sure they are linked (and people
   consulting the issue know you are working on it);
Rémi Louf's avatar
Rémi Louf committed
204
205
206
3. To indicate a work in progress please prefix the title with `[WIP]`. These
   are useful to avoid duplicated work, and to differentiate it from PRs ready
   to be merged;
207
4. Make sure existing tests pass;
208
5. Add high-coverage tests. No quality testing = no merge. 
sshleifer's avatar
sshleifer committed
209
210
 - If you are adding a new model, make sure that you use `ModelTester.all_model_classes = (MyModel, MyModelWithLMHead,...)`, which triggers the common tests.
 - If you are adding new `@slow` tests, make sure they pass using `RUN_SLOW=1 python -m pytest tests/test_my_new_model.py`. 
211
 - If you are adding a new tokenizer, write tests, and make sure `RUN_SLOW=1 python -m pytest tests/test_tokenization_{your_model_name}.py` passes.
212
CircleCI does not run them. 
213
6. All public methods must have informative docstrings that work nicely with sphinx. See `modeling_ctrl.py` for an example.
Rémi Louf's avatar
Rémi Louf committed
214

215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
### Tests

You can run 🤗 Transformers tests with `unittest` or `pytest`.

We like `pytest` and `pytest-xdist` because it's faster. From the root of the
repository, here's how to run tests with `pytest` for the library:

```bash
$ python -m pytest -n auto --dist=loadfile -s -v ./tests/
```

and for the examples:

```bash
$ pip install -r examples/requirements.txt  # only needed the first time
$ python -m pytest -n auto --dist=loadfile -s -v ./examples/
```

In fact, that's how `make test` and `make test-examples` are implemented!

You can specify a smaller set of tests in order to test only the feature
you're working on.

By default, slow tests are skipped. Set the `RUN_SLOW` environment variable to
`yes` to run them. This will download many gigabytes of models — make sure you
have enough disk space and a good Internet connection, or a lot of patience!

```bash
$ RUN_SLOW=yes python -m pytest -n auto --dist=loadfile -s -v ./tests/
$ RUN_SLOW=yes python -m pytest -n auto --dist=loadfile -s -v ./examples/
```

Likewise, set the `RUN_CUSTOM_TOKENIZERS` environment variable to `yes` to run
tests for custom tokenizers, which don't run by default either.

🤗 Transformers uses `pytest` as a test runner only. It doesn't use any
`pytest`-specific features in the test suite itself.

This means `unittest` is fully supported. Here's how to run tests with
`unittest`:

```bash
$ python -m unittest discover -s tests -t . -v
$ python -m unittest discover -s examples -t examples -v
```


Rémi Louf's avatar
Rémi Louf committed
262
263
264
265
266
267
### Style guide

For documentation strings, `transformers` follows the [google
style](https://google.github.io/styleguide/pyguide.html).

#### This guide was heavily inspired by the awesome [scikit-learn guide to contributing](https://github.com/scikit-learn/scikit-learn/blob/master/CONTRIBUTING.md)