Commits · b7e97bec293f668238aba8467286f3321b6ebd83 · ModelZoo / ResNet50_tensorflow

11 May, 2019 1 commit

Add FP16 to transformer with benchmark tests. (#6756) · b7e97bec

Toby Boyd authored May 10, 2019

* Add FP16 and benchmarks.

* add missing run and report.

* Add loss_scale as option not included with dtype.

* move loss_scale validation under dtype conditional.

* add loss_scale to flags tested.

b7e97bec

10 May, 2019 5 commits
- Fix trivial model to work properly with fp16 (#6760) · c0b31c51
  Haoyu Zhang authored May 10, 2019
```
* Fix trivial model to work properly with fp16

* Add comment on manual casting
```
  c0b31c51
- Minimize variables and computation in trivial model (#6759) · 5e876e6e
  Haoyu Zhang authored May 10, 2019
```
Previously we had one dense layer in trivial model. The weight was [224*224*3, num_classes].
Using two dense layers, the weights are [224*224*3, 1] and [1, num_classes].
```
  5e876e6e
- Do not report accuracy metrics for benchmark tests (#6757) · cfa37aab
  Haoyu Zhang authored May 10, 2019
```
* Do not report metrics in performance benchmarks

* Rename flag
```
  cfa37aab
- Fix broken test in V2 (#6755) · bae940dc
  Haoyu Zhang authored May 10, 2019
  
  bae940dc
- Use LR schedule ops instead of LR callback for tweaked tests (#6745) · 4b4dbad1
  Haoyu Zhang authored May 09, 2019
```
* Modified tweaked tests to use tensor learning rate
```
  4b4dbad1
09 May, 2019 2 commits

Transformer instrumented for benchmarking (#6734) · 40543869

Toby Boyd authored May 09, 2019

* Add first benchmark and return stats.

* Remove print statements update training steps.

* Revert print T: in print statement.

* Remove print(stats)

* add 2 gpu accuracy test for base.

* Fixed total_batch_size when using gpu + gFile deprecations.

* 8 GPU test name fix

* Add 4 and 8 GPU tests.

* typo fixes.

* Clean up test names and methods.

* bleu uncased.  docstring format fix.

40543869

Use TensorFlow ops for Keras LearningRateSchedule (#6739) · 9d38e894

Haoyu Zhang authored May 09, 2019

* Add learning rate tensor. This makes training slower

* Improve LearningRateSchedule with better efficiency

* Fix lint error

* Replace constant definition with existing one

9d38e894

08 May, 2019 1 commit
- r/tf.random_uniform/tf.random.uniform (#6735) · 9c5253f1
  Toby Boyd authored May 08, 2019
  
  9c5253f1
07 May, 2019 2 commits
- Use flags to define collective ops when initializing MirroredStrategy (#6724) · f5073f49
  Haoyu Zhang authored May 07, 2019
  
  f5073f49
- Move tests_data to gcs and upgrade data_download. (#6722) · 0f76239b
  Toby Boyd authored May 06, 2019
  
  0f76239b
06 May, 2019 1 commit
- Fix ResNet model convergence problem (#6721) · a182abc1
  Haoyu Zhang authored May 06, 2019
  
  a182abc1
04 May, 2019 1 commit

Enable CuDNN BatchNorm spatial persistent by default (#6710) · 58deb059

Haoyu Zhang authored May 03, 2019

* Enable CuDNN BatchNorm spatial persistent by default; Remove 2nd zero padding layer

* Apply scale=False and fused=True consistently to BatchNorm layers

* Undo remove padding layer

* Replace zero padding with padding attribute in max pooling for better performance

* Resolve comments

* Revert "Replace zero padding with padding attribute in max pooling for better performance"

This reverts commit ad49db057c800ecac008eec1057005bd2c08ac73.

58deb059

03 May, 2019 1 commit
- Add graph rewrite convergence benchmark (#6712) · 0a96c7b4
  Reed authored May 02, 2019
  
  0a96c7b4
02 May, 2019 1 commit
- Add graph rewrite benchmarks (#6708) · e172ac82
  Reed authored May 02, 2019
  
  e172ac82
01 May, 2019 1 commit

Add --fp16_implementation option. (#6703) · b691578c

Reed authored May 01, 2019

This options allows the new tf.train.experimental.enable_mixed_precision_graph_rewrite() function to be used for fp16, instead of manual casts.

b691578c

30 Apr, 2019 1 commit
- Eval every 10 epochs to better match estimator tests. (#6696) · 3ee027fb
  Toby Boyd authored Apr 30, 2019
  
  3ee027fb
29 Apr, 2019 5 commits

Replace per_device with per_replica and PerDevice with PerReplica, because the... · b00783d7

Igor authored Apr 29, 2019

Replace per_device with per_replica and PerDevice with PerReplica, because the PerDevice concept was renamed and doesn't exist anymore. (#6693)

* Replace per_device with per_replica and PerDevice with PerReplica, because the PerReplica concept was renamed and doesn't exist anymore.

b00783d7

Add accuracy check. (#6694) · 294660bd

Toby Boyd authored Apr 29, 2019

* Add accuracy check.

* Avoid double flag init, move data_dir to real data.

* Comment on lower accuracy target.

294660bd

bug fix (#6695) · 0f6f656f
Shining Sun authored Apr 29, 2019
```
* bug fix

* bug fix
```
0f6f656f

Add benchmarks with the --cloning flag to Resnet and NFC. (#6675) · af47736d

Igor authored Apr 29, 2019

* Add benchmarks with the --cloning flag to Resnet and NFC.

* Renamed cloning to clone_model_in_keras_dist_strat. Dropped a few tests that aren't essential.

* Fixed up the formatting after re-naming the flag to a much longer  name.  Thanks, lint.
* Fixed the lint error in nfc_common.py

af47736d

fixed simple typo (#6686) · d087c89b
Songyi Blair Han authored Apr 30, 2019

d087c89b

26 Apr, 2019 3 commits

Combined imagenet and cifar-10 estimator tests (#6672) · acc6f6d7

Toby Boyd authored Apr 26, 2019

* Combined imagenet and cifar-10 benchmarks

* Comments and epochs_between_evals.

* Added tuned tests and cleaned up benchmark flags

* Fix names.

* Return results and add images/sec hook.

* updated doc strings for return values.

* 128 to 256 batch for FP16 test

* added more doc strings to fix lint.

acc6f6d7

Add num_packs flag for MirroredStrategy's cross device ops. (#6676) · 4a1fba0b

Ayush Dubey authored Apr 26, 2019

* Add num_packs flag for MirroredStrategy's cross device ops.

* fix parens

* Fix lint errors and make all_reduce_alg more robust.

* Set default num_packs to 1

4a1fba0b

Do not query GPU compatibility before app main (#6679) · 9b17d796

Gaurav Jain authored Apr 25, 2019

tf.test.is_gpu_available() should not be called in flags since this is
called before app.main() and the runtime has not yet been initialized.

9b17d796

25 Apr, 2019 2 commits
- Remove contrib cross device ops and update all_reduce_alg options. (#6673) · ece99414
  Ayush Dubey authored Apr 25, 2019
```
* Remove contrib AllReduceCrossDeviceOps and update all_reduce_alg options with MirroredStrategy.

* cleanup
```
  ece99414
- Revert "Specify NCCL as the all reduce algorithm (#6662)" (#6671) · a7338771
  Haoyu Zhang authored Apr 25, 2019
```
Reason: test failures because contrib is not available in V2

This reverts commit 325dd761.
```
  a7338771
24 Apr, 2019 6 commits
- Add top_1 accuracy check. (#6663) · ff5cef9a
  Toby Boyd authored Apr 24, 2019
  
  ff5cef9a
- Added none check for output_dir (#6664) · 05b9122f
  Shining Sun authored Apr 24, 2019
```
* Added none check for output_dir

* Change double quote to single
```
  05b9122f
- Specify NCCL as the all reduce algorithm (#6662) · 325dd761
  Haoyu Zhang authored Apr 24, 2019
  
  325dd761
- Update distribution_utils.py (#6615) · 98672351
  Yuefeng Zhou authored Apr 24, 2019
  
  98672351
- Add tests to track 8 GPU fp16 performance in legacy graph mode (#6653) · 4ad73a1c
  Haoyu Zhang authored Apr 23, 2019
  
  4ad73a1c
- Add experimental tf.data sleep tuning for better performance (#6634) · 50dfb31d
  Haoyu Zhang authored Apr 23, 2019
```
* Introduce a short sleep before ds.prefetch in tf.data.

* Further limit dataset threads to reduce CPU contention

* Tuned dataset sleep time

* Rename dataset sleep flag; enable it only for Keras Graph mode
```
  50dfb31d
23 Apr, 2019 2 commits
- Small word tweak (#6650) · 4698a41e
  Toby Boyd authored Apr 23, 2019
```
* Small word tweak

* Few more tweaks
```
  4698a41e
- Update README.md (#6612) · 9d299984
  Usama Muneeb authored Apr 23, 2019
```
Added additional information on using the `SavedModel` for prediction purposes.
```
  9d299984
22 Apr, 2019 3 commits
- Ncf metric tweaks (#6633) · 042c9aaa
  Toby Boyd authored Apr 22, 2019
```
* Use tf.image.resize_with_crop_or_pad

* exp_per_second and hr_at_10
```
  042c9aaa
- Add usernames to TODOs (#6619) · 5c37e69c
  Shining Sun authored Apr 22, 2019
  
  5c37e69c
- Use tf.image.resize_with_crop_or_pad (#6632) · 7772cb1d
  Toby Boyd authored Apr 22, 2019
  
  7772cb1d
20 Apr, 2019 2 commits

Add 2-GPU benchmark for NCF (#6589) · d11aa330
Shining Sun authored Apr 19, 2019

d11aa330

Remove contrib imports, or move them inline (#6591) · 8ff9eb54

Shining Sun authored Apr 19, 2019

* Remove contrib imports, or move them inline

* Use exposed API for FixedLenFeature

* Replace tf.logging with absl logging

* Change GFile to v2 APIs

* replace tf.logging with absl loggin in movielens

* Fixing an import bug

* Change gfile to v2 APIs in code

* Swap to keras optimizer v2

* Bug fix for optimizer

* Change tf.log to tf.keras.backend.log

* Change the loss function to keras loss

* convert another loss to keras loss

* Resolve comments and fix lint

* Add a doc string

* Fix existing tests and add new tests for DS

* Added tests for multi-replica

* Fix lint

* resolve comments

* make estimator run in tf2.0

* use compat v1 loss

* fix lint issue

8ff9eb54