"vscode:/vscode.git/clone" did not exist on "1b5b7de5daef4fbc93934e525ab3c0e8c7d029d1"
- 10 Apr, 2019 3 commits
-
-
Lam Dang authored
-
Lam Dang authored
-
Michael Carilli authored
-
- 04 Apr, 2019 1 commit
-
-
mcarilli authored
* Refactor to allow more flexible treatment of multiple optimizers/models/losses * Adding _process_optimizers.py * Created L0 tests (now passing). * fix: minor print typo (#234) * make L1 results easier to read * L0 multiple model/optimizer/loss test fleshed out * Adding test that master params remain synced across distributed processes * Docstring updates * Docstring updates
-
- 22 Mar, 2019 1 commit
-
-
mcarilli authored
* Adding Torch + bare-metal nvcc version check and container build tests * Putting a canary in the coalmine * canary proved elusive * Trying direct setup.py install * this should work * Removing canary * hopefully this works
-
- 19 Mar, 2019 1 commit
-
-
Michael Carilli authored
-
- 13 Mar, 2019 1 commit
-
-
Michael Carilli authored
-
- 12 Mar, 2019 1 commit
-
-
Michael Carilli authored
-
- 10 Mar, 2019 1 commit
-
-
Michael Carilli authored
-
- 08 Mar, 2019 3 commits
-
-
Michael Carilli authored
-
Michael Carilli authored
-
Michael Carilli authored
-
- 07 Mar, 2019 1 commit
-
-
Michael Carilli authored
-
- 02 Mar, 2019 1 commit
-
-
Michael Carilli authored
-
- 01 Mar, 2019 4 commits
-
-
Michael Carilli authored
-
Michael Carilli authored
-
Michael Carilli authored
-
Michael Carilli authored
-
- 28 Feb, 2019 1 commit
-
-
Michael Carilli authored
-
- 26 Feb, 2019 1 commit
-
-
Michael Carilli authored
-
- 24 Feb, 2019 1 commit
-
-
Michael Carilli authored
-
- 22 Feb, 2019 1 commit
-
-
Michael Carilli authored
Allow multi-tensor unscale to handle FP16 output, so it can also be used for copy-scatter. Rename some options.
-
- 19 Feb, 2019 1 commit
-
-
Michael Carilli authored
-
- 16 Feb, 2019 3 commits
- 13 Feb, 2019 1 commit
-
-
Michael Carilli authored
-
- 08 Feb, 2019 2 commits
-
-
Evgeni Krimer authored
-
Evgeni Krimer authored
-
- 06 Feb, 2019 1 commit
-
-
Michael Carilli authored
-
- 05 Feb, 2019 1 commit
-
-
Jerry Ma authored
This commit adds an FP16Model class as a successor to network_to_half. The benefits of this class are: - Preservation of single-precision for BatchNorm layers. The models generated by network_to_half() convert BatchNorm moment tensors to half-precision, then back to single-precision, which hurts the accuracy of the moment estimators and occasionally results in NaNs. - Support for multi-argument nn.Modules (self-explanatory from code).
-
- 03 Feb, 2019 1 commit
-
-
Michael Carilli authored
-
- 01 Feb, 2019 1 commit
-
-
Michael Carilli authored
-
- 29 Jan, 2019 3 commits
- 28 Jan, 2019 1 commit
-
-
jiej authored
test update to resolve https://github.com/NVIDIA/apex/issues/134#issue-403525480 Using identical learning rate for both DDP with sync BN and single process BN. The previous configure leaves the impression that sync BN requires adjusting lr in the script, which is not true.
-
- 25 Jan, 2019 1 commit
-
-
Michael Carilli authored
-
- 15 Jan, 2019 1 commit
-
-
Jie authored
Added kernel to support sync BN for channel last tensor
-
- 15 Dec, 2018 1 commit
-
-
Deyu Fu authored
-