"git@developer.sourcefind.cn:zhaoyu6/sglang.git" did not exist on "c882b5ae7555b24d40b61df2de61b3441c02fbc8"
- 01 Mar, 2019 3 commits
-
-
Shining Sun authored
* tmp commit * tmp commit * first attempt (without eval) * Bug fixes * bug fixes * training done * Loss NAN, no eval * Loss weight problem solved * resolve the NAN loss problem * Problem solved. Clean up needed * Added a todo * Remove debug prints * Extract get_optimizer to ncf_common * Move metrics computation back to neumf; use DS.scope api * Extract DS.scope code to utils * lint fixes * Move obtaining DS above producer.start to avoid race condition * move pt 1 * move pt 2 * Update the run script * Wrap keras_model related code into functions * Update the doc for softmax_logitfy and change the method name * Resolve PR comments * working version with: eager, DS, batch and no masks * Remove git conflict indicator * move reshape to neumf_model * working version, not converge * converged * fix a test * more lint fix * more lint fix * more lint fixes * more lint fix * Removed unused imports * fix test * dummy commit for kicking of checks * fix lint issue * dummy input to kick off checks * dummy input to kick off checks * add collective to dist strat * addressed review comments * add a doc string
-
Haoyu Zhang authored
* Added XLA test with a monkey-patched op to avoid OOM * Added doc strings in Keras benchmarks to avoid Lint error
-
Yash Katariya authored
-
- 28 Feb, 2019 4 commits
-
-
Ayush Dubey authored
* s/CollectiveAllReduceStrategy/MultiWorkerMirroredStrategy * More s/contrib.distribute/distribute.experimental
-
Yuefeng Zhou authored
* Add benchmarks for thread tuning. * Address comment/ * Add a comment.
-
aquariusjay authored
-
Tayo Oguntebi authored
-
- 27 Feb, 2019 2 commits
-
-
aquariusjay authored
Open-source FEELVOS model, which was developed by Paul Voigtlaender during his 2018 summer internship at Google. The work has been accepted to CVPR 2019. (#6274)
-
Adrian Boguszewski authored
* Fixed incorrect tensor * Replaced xrange with range for python3 compatibility
-
- 25 Feb, 2019 1 commit
-
-
Dong Lin authored
Add root_data_dir to constructor of Resnet50KerasBenchmarkSynth and Resnet50KerasBenchmarkReal (#6259)
-
- 24 Feb, 2019 1 commit
-
-
huihui-personal authored
* \nRefactor deeplab to use MonitoredTrainingSession\n PiperOrigin-RevId: 234237190 * Update export_model.py * Update nas_cell.py * Update nas_network.py * Update train.py * Update deeplab_demo.ipynb * Update nas_cell.py
-
- 23 Feb, 2019 1 commit
-
-
Haichao Yu authored
-
- 22 Feb, 2019 4 commits
-
-
Dong Lin authored
-
guptapriya authored
* Remove isintance change for contrib strategy Replace it with class name check instead which should work regardless * Add quotes for string * fix quote type
-
Dong Lin authored
This is needed to avoid breaking benchmark execution if PerfZero provides more Named arguments before the benchmark class constructor is updated.
-
Dong Lin authored
* Allow user to specify root_data_dir in the benchmark class constructor * Address comments
-
- 21 Feb, 2019 5 commits
-
-
Ayush Dubey authored
* Update official resnet for multi worker training with distribution strategies. * Fixes for multi worker training. * Fix call to `get_distribution_strategy`. * Undo test change. * Fix spacing. * Move cluster configuration to distribution_utils. * Move train_and_evaluate out of loop. Also, update docstrings for multi-worker flags and add use_train_and_evaluate flag. * Update distribution_strategy flag to match exported name for collective strategy.
-
chardch authored
This import from google3.third_party doesn't work.
-
Haoyu Zhang authored
* Add flag to enable XLA in Keras models * Fix lint errors (some of them are old errors)
-
Mike Arpaia authored
-
Chris Tessum authored
This file was inadvertantly renamed in #6071.
-
- 19 Feb, 2019 1 commit
-
-
Yuefeng Zhou authored
-
- 15 Feb, 2019 1 commit
-
-
Toby Boyd authored
-
- 14 Feb, 2019 7 commits
-
-
Toby Boyd authored
-
Toby Boyd authored
-
Toby Boyd authored
-
Yuefeng Zhou authored
* Workarond for memory issue in eager mode. * Add a TODO * Fix typo * Address comments * remove patch which appear hacky. * fix typo
-
Toby Boyd authored
* Add exp_per_second history to hook. - Add tracking exp_per_second to benchmark tests. * remove turn off dist strat. * Average all results.
-
Toby Boyd authored
* One device from contrib to core. * remove test code.
-
rxsang authored
* Manually scale the loss in Resnet DS model * Update resnet_run_loop.py
-
- 13 Feb, 2019 2 commits
-
-
Toby Boyd authored
-
Yuefeng Zhou authored
* Add a flag to specify distribution strategies. * Fix a small error. * Address comments. * Address comments. * Fix typos.
-
- 12 Feb, 2019 4 commits
-
-
Toby Boyd authored
* fix test benchmark_graph_1_gpu_no_dist_strat failing - Failure only occurs when all 1_gpu tests are run together with the error: tensorflow.python.framework.errors_impl.NotFoundError: Resource localhost/logdir:/tmp/cifar10_model/ N10tensorflow22SummaryWriterInterfaceE does not exist. [Op:WriteScalarSummary] name: epoch_loss/ Another fix might be to generate a different model_dir in the core code, but that has other draw backs such as restarting from the checkpoint. * Model_dir for all tests.
-
Toby Boyd authored
-
Toby Boyd authored
* Remove contrib thread pool. * Remove commented out contrib import. * Fix lint issues. * move tf.data.options higher. Tweak line breaks. * do not monkey patch on or off if dist_strat is off * Do not monkey patch if no_dist_strat. * Fix file permissions. * fix file permissions. * Revert change to main. Add hasattr(tf, 'contrib') to utils * compat.v1.logging * tf.compat.v1.get_local_variables.
-
Toby Boyd authored
- Modest speedup for CIFAR-10 - Slightly greater speedup expected for ImageNet ResNet50.
-
- 11 Feb, 2019 3 commits
-
-
Toby Boyd authored
* Remove contrib thread pool. * Remove commented out contrib import. * Fix lint issues. * move tf.data.options higher. Tweak line breaks.
-
Toby Boyd authored
-
Chris Tessum authored
As discussed here: https://github.com/tensorflow/tensorflow/issues/11312, this change allows this script to work in Python 3.
-
- 10 Feb, 2019 1 commit
-
-
Zack Hsiao authored
-