-`NVSHMEM_DIR`: the path to the NVSHMEM directory, disable all internode and low-latency features if not specified
-`NVSHMEM_DIR`: the path to the NVSHMEM directory, disable all internode and low-latency features if not specified
-`DISABLE_SM90_FEATURES`: 0 or 1, whether to disable SM90 features, it is required for SM90 devices or CUDA 11
-`TORCH_CUDA_ARCH_LIST`: the list of target architectures, e.g. `TORCH_CUDA_ARCH_LIST="9.0"`
-`DISABLE_AGGRESSIVE_PTX_INSTRS`: 0 or 1, whether to disable aggressive load/store instructions, see [Undefined-behavior PTX usage](#undefined-behavior-ptx-usage) for more details
...
...
@@ -137,7 +137,7 @@ Buffer.set_num_sms(24)
# You may call this function at the framework initialization