Commit 3c4b7890 authored by Michael Carilli's avatar Michael Carilli
Browse files

Merge branch 'api_refactor' of https://github.com/NVIDIA/apex into api_refactor

parents 254cad2d a128fa54
......@@ -103,7 +103,7 @@ python setup.py install [--cuda_ext] [--cpp_ext]
Currently, `--cuda_ext` enables
- Fused kernels that improve the performance and numerical stability of `apex.parallel.SyncBatchNorm`.
- Fused kernels required to use `apex.optimizers.FusedAdam`.
- Fused kernels required to use 'apex.normalization.FusedLayerNorm'.
- Fused kernels required to use `apex.normalization.FusedLayerNorm`.
`--cpp_ext` enables
- C++-side flattening and unflattening utilities that reduce the CPU overhead of `apex.parallel.DistributedDataParallel`.
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment