"megatron/vscode:/vscode.git/clone" did not exist on "c9a595549c6dbf8338d4b13dab4a758ccdcb110c"
- 28 Mar, 2019 4 commits
-
-
Shining Sun authored
* initial commit * bug fix * Move build_stats from common to keras main, because it is only applicable in keras * remove tailing blank line * add test for synth data * add kwargs to init * add kwargs to function invokation * correctly pass kwargs * debug * debug * debug * fix super init * bug fix * fix local_flags * fix import * bug fix * fix log_steps flag * bug fix * bug fix: add missing return value * resolve double-defined flags * lint fix * move log_steps flag to benchmarK flag * fix lint * lint fix * lint fix * try flag core default values * bug fix * bug fix * bug fix * debug * debug * remove debug prints * rename benchmark methods * flag bug fix for synth benchmark
-
Ayush Dubey authored
-
Shining Sun authored
* Initial commit * Fininshed * bug fix * bug fix * bug fix * Resolve review comments * Typo fix * resolve comments * fix number error * Resolve comments
-
Haoyu Zhang authored
-
- 27 Mar, 2019 1 commit
-
-
Haoyu Zhang authored
-
- 26 Mar, 2019 1 commit
-
-
Yuefeng Zhou authored
required by multi-node collective ops in eager mode.
-
- 25 Mar, 2019 1 commit
-
-
Haoyu Zhang authored
-
- 22 Mar, 2019 1 commit
-
-
Haoyu Zhang authored
-
- 20 Mar, 2019 2 commits
-
-
Ayush Dubey authored
* Add `input_context` to `input_fn` in cifar10_main. * Change sharding log message to be consistent with `dataset.shard` params. * Lint
-
Haoyu Zhang authored
-
- 19 Mar, 2019 3 commits
-
-
Haoyu Zhang authored
-
Ayush Dubey authored
* Shard input for distribution strategy. * Pass in input_context from real input_fn. * Pass in input_context from real input_fn. * Make pipeline id base 1 for better readability.
-
Soroush Radpour authored
-
- 13 Mar, 2019 1 commit
-
-
Toby Boyd authored
-
- 12 Mar, 2019 2 commits
- 11 Mar, 2019 1 commit
-
-
pkanwar23 authored
* Adding LARS to ResNet * Fixes for the LARS patch * Fixes for the LARS patch * more fixes * 1 more fix
-
- 07 Mar, 2019 3 commits
-
-
Ayush Dubey authored
-
Ayush Dubey authored
* s/CollectiveAllReduceStrategy/MultiWorkerMirroredStrategy * More s/contrib.distribute/distribute.experimental * Collective communication options in MultiWorkerMirroredStrategy. * Minor fixes * No checkpointing if multi worker. * turn off checkpointing * fix lint
-
Reed authored
-
- 06 Mar, 2019 1 commit
-
-
Reed authored
* Mixed precision support * Add TODOs
-
- 02 Mar, 2019 1 commit
-
-
Taylor Robie authored
* fix resnet breakage and add keras end-to-end tests * delint * address PR comments
-
- 01 Mar, 2019 3 commits
-
-
Shining Sun authored
* tmp commit * tmp commit * first attempt (without eval) * Bug fixes * bug fixes * training done * Loss NAN, no eval * Loss weight problem solved * resolve the NAN loss problem * Problem solved. Clean up needed * Added a todo * Remove debug prints * Extract get_optimizer to ncf_common * Move metrics computation back to neumf; use DS.scope api * Extract DS.scope code to utils * lint fixes * Move obtaining DS above producer.start to avoid race condition * move pt 1 * move pt 2 * Update the run script * Wrap keras_model related code into functions * Update the doc for softmax_logitfy and change the method name * Resolve PR comments * working version with: eager, DS, batch and no masks * Remove git conflict indicator * move reshape to neumf_model * working version, not converge * converged * fix a test * more lint fix * more lint fix * more lint fixes * more lint fix * Removed unused i...
-
Haoyu Zhang authored
* Added XLA test with a monkey-patched op to avoid OOM * Added doc strings in Keras benchmarks to avoid Lint error
-
Yash Katariya authored
-
- 28 Feb, 2019 1 commit
-
-
Yuefeng Zhou authored
* Add benchmarks for thread tuning. * Address comment/ * Add a comment.
-
- 25 Feb, 2019 1 commit
-
-
Dong Lin authored
Add root_data_dir to constructor of Resnet50KerasBenchmarkSynth and Resnet50KerasBenchmarkReal (#6259)
-
- 22 Feb, 2019 4 commits
-
-
Dong Lin authored
-
guptapriya authored
* Remove isintance change for contrib strategy Replace it with class name check instead which should work regardless * Add quotes for string * fix quote type
-
Dong Lin authored
This is needed to avoid breaking benchmark execution if PerfZero provides more Named arguments before the benchmark class constructor is updated.
-
Dong Lin authored
* Allow user to specify root_data_dir in the benchmark class constructor * Address comments
-
- 21 Feb, 2019 2 commits
-
-
Ayush Dubey authored
* Update official resnet for multi worker training with distribution strategies. * Fixes for multi worker training. * Fix call to `get_distribution_strategy`. * Undo test change. * Fix spacing. * Move cluster configuration to distribution_utils. * Move train_and_evaluate out of loop. Also, update docstrings for multi-worker flags and add use_train_and_evaluate flag. * Update distribution_strategy flag to match exported name for collective strategy.
-
Haoyu Zhang authored
* Add flag to enable XLA in Keras models * Fix lint errors (some of them are old errors)
-
- 19 Feb, 2019 1 commit
-
-
Yuefeng Zhou authored
-
- 15 Feb, 2019 1 commit
-
-
Toby Boyd authored
-
- 14 Feb, 2019 5 commits
-
-
Toby Boyd authored
-
Toby Boyd authored
-
Toby Boyd authored
-
Yuefeng Zhou authored
* Workarond for memory issue in eager mode. * Add a TODO * Fix typo * Address comments * remove patch which appear hacky. * fix typo
-
Toby Boyd authored
* Add exp_per_second history to hook. - Add tracking exp_per_second to benchmark tests. * remove turn off dist strat. * Average all results.
-