Commits · d3d7f15f92e8dd0de8b2de3f63dff2baaa591223 · ModelZoo / ResNet50_tensorflow

05 Mar, 2020 1 commit
- Remove force_v2_in_keras_compile. experimental_run_tf_function is no-op now. · d3d7f15f
  Hongkun Yu authored Mar 05, 2020
```
PiperOrigin-RevId: 299160422
```
  d3d7f15f
25 Feb, 2020 1 commit
- Creates modeling/performance.py to include mix prediction related stuff · fb35d6be
  Hongkun Yu authored Feb 24, 2020
```
PiperOrigin-RevId: 297002741
```
  fb35d6be
28 Oct, 2019 1 commit
- Add Resnet50 benchmark suite that read training data from remote storage · 06f22a59
  Zongwei Zhou authored Oct 28, 2019
```
PiperOrigin-RevId: 277082247
```
  06f22a59
16 Oct, 2019 1 commit

Add support for the tf.keras.mixed_precision API in NCF · cb913691

Reed Wanderman-Milne authored Oct 16, 2019

To test, I did 50 fp32 runs and 50 fp16 runs. I used the following command:

python ncf_keras_main.py --dataset=ml-20m --num_gpus=1 --train_epochs=10 --clean --batch_size=99000 --learning_rate=0.00382059 --beta1=0.783529 --beta2=0.909003 --epsilon=1.45439e-7 --layers=256,256,128,64 --num_factors=64 --hr_threshold=0.635 --ml_perf --nouse_synthetic_data --data_dir ~/ncf_data_dir_python3 --model_dir ~/tmp_model_dir --keras_use_ctl

For the fp16 runs, I added --dtype=fp16. The average hit-rate for both fp16 and fp32 was 0.6365. I also did 50 runs with the mixed precision graph rewrite, and the average hit-rate was 0.6363. The difference is likely due to noise.

PiperOrigin-RevId: 275059871

cb913691

07 Oct, 2019 1 commit
- Internal change · 41293260
  A. Unique TensorFlower authored Oct 07, 2019
```
PiperOrigin-RevId: 273371605
```
  41293260
30 Aug, 2019 1 commit
- Fix bug where dynamic loss scaling was broken · 765da424
  Reed Wanderman-Milne authored Aug 30, 2019
```
PiperOrigin-RevId: 266376708
```
  765da424
26 Aug, 2019 1 commit

Unexpose some flags from models which do not use them. · 560b3af4

Reed Wanderman-Milne authored Aug 26, 2019

--synthetic_data, --dtype, --all_reduce_alg, and --num_packs have been unexposed from models which do not use them

PiperOrigin-RevId: 265483564

560b3af4

23 Aug, 2019 1 commit

Unexpose some flags from models which do not use them. · 882e51a4

Reed Wanderman-Milne authored Aug 22, 2019

--num_parallel_calls, --inter_op_parallelism_threads, and --intra_op_parallelism_threads have been unexposed from models which do not use them

PiperOrigin-RevId: 264965788

882e51a4

20 Aug, 2019 2 commits
- fix transformer amp · c9c05e9b
  Vinh Nguyen authored Aug 20, 2019
  
  c9c05e9b
- change default fp16_implementation to graph_rewrite · c186c85a
  Vinh Nguyen authored Aug 20, 2019
  
  c186c85a
19 Aug, 2019 1 commit

Do not expose --max_train_steps in models that do not use it. · 824ff2d6

Reed Wanderman-Milne authored Aug 19, 2019

Only the V1 resnet model uses --max_train_steps. This unexposes the flag in the keras_application_models, mnist, keras resnet, CTL resnet Models. Before this change, such models allowed the flag to be specified, but ignored it.

I also removed the "max_train" argument from the run_synthetic function, since this only had any meaning for the V1 resnet model. Instead, the V1 resnet model now directly passes --max_train_steps=1 to run_synthetic.

PiperOrigin-RevId: 264269836

824ff2d6

06 Aug, 2019 1 commit

[ResNet / NCF] Test force V1 path and allow V2 path as default (#7383) · 97622ffc

Toby Boyd authored Aug 05, 2019

* force_v2_in_keras_compile FLAG default to None and added seperate temp path.

* switch to force testing 1v path not force v2 path.

* Rename function force_v1_path.

97622ffc

23 Jul, 2019 1 commit

Single execution path tests for ResNet50, ResNet56, NCF, and Shakespeare LSTM. (#7276) · 9d8c9aa4

Toby Boyd authored Jul 23, 2019

* Add force_run_distributed tests.

* Added enable_eager

* r/force_run_distributed/force_v2_in_keras_compile

* Adding force_v2 tests and FLAGs.

* Rename method to avoid conflict.

* Add cpu force_v2 tests.

* fix lint, wrap line.

* change to force_v2_in_keras_compile

* Update method name.

* Lower mlperf target to 0.736.

9d8c9aa4

19 Jun, 2019 1 commit

Add XLA to transformer (#7048) · 269581dc

Toby Boyd authored Jun 19, 2019



* set default steps to 300K.

* Log flags to perfzero.

* Add XLA support to transformer

- Moved config logic to keras_utils
- Added enable_xla flag to _performance flags
- Did not refactor enable_xla flag from keras resnet due to
  reliance on calling FLAGs in estimator keras and that is
  a needed refactor for another time.

* fix g3 lint complaint.

* Refactor set config into keras_utils.

* Move flags out of main.

* pipe through enable_xla

* Update official/transformer/v2/misc.py
Co-Authored-By: Reed <reedwm@google.com>

269581dc

06 Jun, 2019 1 commit

Have each model provide a default loss scale. (#6930) · 42a8af1d

Reed authored Jun 06, 2019

Before, there was a global default loss scale for all models. Currently, only resnet uses loss scaling, but this will be useful once more models support it.

42a8af1d

15 May, 2019 1 commit

Adds keras imagenet benchmarks which use tf.data's `experimental_slack` option. (#6744) · 6aa6bac5

Rachel Lim authored May 15, 2019

* Added 'tfdata_exp' version of all benchmarks which set
FLAGS.tf_data_experimental_slack = True. Renamed
`data_prefetch_with_slack` to `data_delay_prefetch` (haoyu's change)
to make the names more distinct.

* Add flag to resnet input pipeline and surface through
keras_imagenet_main.py

6aa6bac5

11 May, 2019 1 commit

Add FP16 to transformer with benchmark tests. (#6756) · b7e97bec

Toby Boyd authored May 10, 2019

* Add FP16 and benchmarks.

* add missing run and report.

* Add loss_scale as option not included with dtype.

* move loss_scale validation under dtype conditional.

* add loss_scale to flags tested.

b7e97bec

01 May, 2019 1 commit

Add --fp16_implementation option. (#6703) · b691578c

Reed authored May 01, 2019

This options allows the new tf.train.experimental.enable_mixed_precision_graph_rewrite() function to be used for fp16, instead of manual casts.

b691578c

26 Apr, 2019 1 commit

Add num_packs flag for MirroredStrategy's cross device ops. (#6676) · 4a1fba0b

Ayush Dubey authored Apr 26, 2019

* Add num_packs flag for MirroredStrategy's cross device ops.

* fix parens

* Fix lint errors and make all_reduce_alg more robust.

* Set default num_packs to 1

4a1fba0b

03 Apr, 2019 1 commit
- Add dynamic loss scaling support (#6518) · 17e923da
  Reed authored Apr 03, 2019
  
  17e923da
20 Mar, 2019 1 commit
- Added thread tuning and tweaked tests to improve Keras model performance (#6396) · 7b5606a5
  Haoyu Zhang authored Mar 19, 2019
  
  7b5606a5
07 Mar, 2019 1 commit

Add command line option for multi worker collective implementations, disable checkpointing. (#6317) · 05a79f5a

Ayush Dubey authored Mar 07, 2019

* s/CollectiveAllReduceStrategy/MultiWorkerMirroredStrategy

* More s/contrib.distribute/distribute.experimental

* Collective communication options in MultiWorkerMirroredStrategy.

* Minor fixes

* No checkpointing if multi worker.

* turn off checkpointing

* fix lint

05a79f5a

13 Oct, 2018 1 commit
- refactor method and flag names. · 26301e74
  Toby Boyd authored Oct 13, 2018
  
  26301e74
12 Oct, 2018 1 commit
- Add option to run perf tuned args. · 2894bb53
  Toby Boyd authored Oct 12, 2018
  
  2894bb53
12 Jun, 2018 1 commit

Transformer multi gpu, remove multi_gpu flag, distribution helper functions (#4457) · 29c9f985

Katherine Wu authored Jun 12, 2018

* Add DistributionStrategy to transformer model

* add num_gpu flag

* Calculate per device batch size for transformer

* remove reference to flags_core

* Add synthetic data option to transformer

* fix typo

* add import back in

* Use hierarchical copy

* address PR comments

* lint

* fix spaces

* group train op together to fix single GPU error

* Fix translate bug (sorted_keys is a dict, not a list)

* Change params to a default dict (translate.py was throwing errors because params didn't have the TPU parameters.)

* Address PR comments. Removed multi gpu flag + more

* fix lint

* fix more lints

* add todo for Synthetic dataset

* Update docs

29c9f985

03 May, 2018 1 commit

Move argparsing from builtin argparse to absl (#4099) · 5f9f6b84

Taylor Robie authored May 02, 2018

* squash of modular absl usage commits

* delint

* address PR comments

* change hooks to comma separated list, as absl behavior for space separated lists is not as expected

5f9f6b84