Commit d1a3b52f authored by Tri Dao's avatar Tri Dao
Browse files

Add instruction about limiting number of ninja jobs

parent b4cc152e
...@@ -54,6 +54,14 @@ Alternatively you can compile from source: ...@@ -54,6 +54,14 @@ Alternatively you can compile from source:
python setup.py install python setup.py install
``` ```
If your machine has less than 96GB of RAM and lots of CPU cores, `ninja` might
run too many parallel compilation jobs that could exhaust the amount of RAM. To
limit the number of parallel compilation jobs, you can set the environment
variable `MAX_JOBS`:
```
MAX_JOBS=4 pip install flash-attn --no-build-isolation
```
Interface: `src/flash_attention_interface.py` Interface: `src/flash_attention_interface.py`
FlashAttention-2 currently supports: FlashAttention-2 currently supports:
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment