- 28 Mar, 2019 3 commits
-
-
Shining Sun authored
* initial commit * bug fix * Move build_stats from common to keras main, because it is only applicable in keras * remove tailing blank line * add test for synth data * add kwargs to init * add kwargs to function invokation * correctly pass kwargs * debug * debug * debug * fix super init * bug fix * fix local_flags * fix import * bug fix * fix log_steps flag * bug fix * bug fix: add missing return value * resolve double-defined flags * lint fix * move log_steps flag to benchmarK flag * fix lint * lint fix * lint fix * try flag core default values * bug fix * bug fix * bug fix * debug * debug * remove debug prints * rename benchmark methods * flag bug fix for synth benchmark
-
Shining Sun authored
* Initial commit * Fininshed * bug fix * bug fix * bug fix * Resolve review comments * Typo fix * resolve comments * fix number error * Resolve comments
-
Haoyu Zhang authored
-
- 27 Mar, 2019 1 commit
-
-
Haoyu Zhang authored
-
- 26 Mar, 2019 1 commit
-
-
Yuefeng Zhou authored
required by multi-node collective ops in eager mode.
-
- 25 Mar, 2019 1 commit
-
-
Haoyu Zhang authored
-
- 22 Mar, 2019 1 commit
-
-
Haoyu Zhang authored
-
- 20 Mar, 2019 1 commit
-
-
Haoyu Zhang authored
-
- 19 Mar, 2019 2 commits
-
-
Haoyu Zhang authored
-
Soroush Radpour authored
-
- 13 Mar, 2019 1 commit
-
-
Toby Boyd authored
-
- 12 Mar, 2019 1 commit
-
-
Toby Boyd authored
* xla to bs=128 for num_gpu=8 * remove todo
-
- 07 Mar, 2019 1 commit
-
-
Reed authored
-
- 06 Mar, 2019 1 commit
-
-
Reed authored
* Mixed precision support * Add TODOs
-
- 02 Mar, 2019 1 commit
-
-
Taylor Robie authored
* fix resnet breakage and add keras end-to-end tests * delint * address PR comments
-
- 01 Mar, 2019 2 commits
-
-
Shining Sun authored
* tmp commit * tmp commit * first attempt (without eval) * Bug fixes * bug fixes * training done * Loss NAN, no eval * Loss weight problem solved * resolve the NAN loss problem * Problem solved. Clean up needed * Added a todo * Remove debug prints * Extract get_optimizer to ncf_common * Move metrics computation back to neumf; use DS.scope api * Extract DS.scope code to utils * lint fixes * Move obtaining DS above producer.start to avoid race condition * move pt 1 * move pt 2 * Update the run script * Wrap keras_model related code into functions * Update the doc for softmax_logitfy and change the method name * Resolve PR comments * working version with: eager, DS, batch and no masks * Remove git conflict indicator * move reshape to neumf_model * working version, not converge * converged * fix a test * more lint fix * more lint fix * more lint fixes * more lint fix * Removed unused imports * fix test * dummy commit for kicking of checks * fix lint issue * dummy input to kick off checks * dummy input to kick off checks * add collective to dist strat * addressed review comments * add a doc string
-
Haoyu Zhang authored
* Added XLA test with a monkey-patched op to avoid OOM * Added doc strings in Keras benchmarks to avoid Lint error
-
- 28 Feb, 2019 1 commit
-
-
Yuefeng Zhou authored
* Add benchmarks for thread tuning. * Address comment/ * Add a comment.
-
- 25 Feb, 2019 1 commit
-
-
Dong Lin authored
Add root_data_dir to constructor of Resnet50KerasBenchmarkSynth and Resnet50KerasBenchmarkReal (#6259)
-
- 22 Feb, 2019 3 commits
- 21 Feb, 2019 1 commit
-
-
Haoyu Zhang authored
* Add flag to enable XLA in Keras models * Fix lint errors (some of them are old errors)
-
- 19 Feb, 2019 1 commit
-
-
Yuefeng Zhou authored
-
- 15 Feb, 2019 1 commit
-
-
Toby Boyd authored
-
- 14 Feb, 2019 3 commits
-
-
Toby Boyd authored
-
Toby Boyd authored
-
Yuefeng Zhou authored
* Workarond for memory issue in eager mode. * Add a TODO * Fix typo * Address comments * remove patch which appear hacky. * fix typo
-
- 13 Feb, 2019 2 commits
-
-
Toby Boyd authored
-
Yuefeng Zhou authored
* Add a flag to specify distribution strategies. * Fix a small error. * Address comments. * Address comments. * Fix typos.
-
- 12 Feb, 2019 2 commits
-
-
Toby Boyd authored
* fix test benchmark_graph_1_gpu_no_dist_strat failing - Failure only occurs when all 1_gpu tests are run together with the error: tensorflow.python.framework.errors_impl.NotFoundError: Resource localhost/logdir:/tmp/cifar10_model/ N10tensorflow22SummaryWriterInterfaceE does not exist. [Op:WriteScalarSummary] name: epoch_loss/ Another fix might be to generate a different model_dir in the core code, but that has other draw backs such as restarting from the checkpoint. * Model_dir for all tests.
-
Toby Boyd authored
- Modest speedup for CIFAR-10 - Slightly greater speedup expected for ImageNet ResNet50.
-
- 11 Feb, 2019 1 commit
-
-
Toby Boyd authored
-
- 09 Feb, 2019 1 commit
-
-
Yuefeng Zhou authored
* Add pure synthetic data to keras resnet mode. * Add imports. * Address comments. * update comment * Undo set up synthetic data for real data path. * update comment * Address comment * Remove trailing whiltespaces. * s/make_data_set_iterator/make_dataset_iterator/
-
- 08 Feb, 2019 1 commit
-
-
Goldie Gadde authored
This reverts commit 57e07520.
-
- 06 Feb, 2019 1 commit
-
-
Goldie Gadde authored
This reverts commit d6b2b83c.
-
- 05 Feb, 2019 1 commit
-
-
Goldie Gadde authored
* Add resnet56 short tests. (#6101) * Add resnet56 short tests. - created base benchmark module - renamed accuracy test class to contain the word Accuracy which will result in a need to update all the jobs and a loss of history but is worth it. - short tests are mostly copied from shining with oss refactor * Address feedback. * Move flag_methods to init - Address setting default flags repeatedly. * Rename accuracy tests. * Lint errors resolved. * fix model_dir set to flags.data_dir. * fixed not fulling pulling out flag_methods. * Use core mirrored strategy in official models (#6126) * Imagenet short tests (#6132) * Add short imagenet tests (taken from seemuch) - also rename to match go forward naming * fix method name * Update doc strings. * Fixe gpu number. * points default data_dir to child folder. (#6131) Failed test is python2 and was a kokoro failure * Imagenet short tests (#6136) * Add short imagenet tests (taken from seemuch) - also rename to match go forward naming * fix method name * Update doc strings. * Fixe gpu number. * Add fill_objects * fixed calling wrong class in super. * fix lint issue. * Flag (#6121) * Fix the turn_off_ds flag problem * add param names to all args * Export benchmark stats using tf.test.Benchmark.report_benchmark() (#6103) * Export benchmark stats using tf.test.Benchmark.report_benchmark() * Fix python style using pyformat * Typos. (#6120) * log verbosity=2 logs every epoch no progress bars (#6142) * tf_upgrade_v2 on resnet and utils folder. * tf_upgrade_v2 on resnet and utils folder.
-
- 03 Feb, 2019 1 commit
-
-
Toby Boyd authored
-
- 01 Feb, 2019 2 commits
-
-
Dong Lin authored
* Export benchmark stats using tf.test.Benchmark.report_benchmark() * Fix python style using pyformat
-
Shining Sun authored
* Fix the turn_off_ds flag problem * add param names to all args
-