Commits · bd488858d610e44df69da6f89277e9de8a03722c · ModelZoo / ResNet50_tensorflow

17 Mar, 2020 1 commit
- tf.compat.v1.logging implemented with absl · 3043566d
  ayushmankumar7 authored Mar 18, 2020
  
  3043566d
13 Feb, 2020 1 commit
- [Refactor] TF models: move all contents of transformer to nlp/transformer · 7f926353
  Hongkun Yu authored Feb 13, 2020
```
PiperOrigin-RevId: 294997928
```
  7f926353
27 Jan, 2020 1 commit
- Remove 'num_workers' arg from get_distribution_strategy() method. · 1af7172d
  Yanhui Liang authored Jan 27, 2020
```
PiperOrigin-RevId: 291810091
```
  1af7172d
15 Dec, 2019 1 commit
- Clearly demarcate contrib symbols from standard tf symbols by importing them directly. · 722d9e57
  Hongkun Yu authored Dec 14, 2019
```
PiperOrigin-RevId: 285618209
```
  722d9e57
14 Dec, 2019 2 commits
- Internal change · 0788a23c
  Hongkun Yu authored Dec 13, 2019
```
PiperOrigin-RevId: 285533511
```
  0788a23c
- Clearly demarcate contrib symbols from standard tf symbols by importing them directly. · 357f30f4
  A. Unique TensorFlower authored Dec 13, 2019
```
PiperOrigin-RevId: 285503670
```
  357f30f4
27 Nov, 2019 1 commit
- Remove 'default' in get_distribution_strategy which is complex and error-prone · 04256053
  Hongkun Yu authored Nov 26, 2019
```
PiperOrigin-RevId: 282669615
```
  04256053
24 Sep, 2019 1 commit
- Use experimental_connect_to_cluster API in TPU lib to support training on a slice of a TPU pod. · 497989e0
  Bruce Fontaine authored Sep 24, 2019
```
PiperOrigin-RevId: 270926016
```
  497989e0
19 Aug, 2019 1 commit
- Extend synthetic data monkey patch to MultiWorkerMirroredStrategy. · ee584397
  Ayush Dubey authored Aug 19, 2019
```
PiperOrigin-RevId: 264244022
```
  ee584397
16 Aug, 2019 2 commits
- Consolidation & readability. · b1d9ac5b
  Hongkun Yu authored Aug 16, 2019
```
PiperOrigin-RevId: 263863438
```
  b1d9ac5b
- fix monkey patch for synthetic data for resnet keras model. · 1f2cebfa
  Priya Gupta authored Aug 16, 2019
```
PiperOrigin-RevId: 263854996
```
  1f2cebfa
12 Aug, 2019 1 commit

Merged commit includes the following changes: (#7430) · 03b4a0af

Hongjun Choi authored Aug 12, 2019

262988559  by A. Unique TensorFlower<gardener@tensorflow.org>:

    Enable NCF TF 2.0 model to run on TPUStrategy.

--
262971756  by A. Unique TensorFlower<gardener@tensorflow.org>:

    Internal change

262967691  by hongkuny<hongkuny@google.com>:

    Internal

--

PiperOrigin-RevId: 262988559

03b4a0af

02 Jul, 2019 1 commit
- Allow distibution_utils.py to worker with PSStrategy or none strategy (#7135) · 680eb35c
  Yuefeng Zhou authored Jul 02, 2019
```
when there are multiple workers.
```
  680eb35c
29 Apr, 2019 1 commit

Replace per_device with per_replica and PerDevice with PerReplica, because the... · b00783d7

Igor authored Apr 29, 2019

Replace per_device with per_replica and PerDevice with PerReplica, because the PerDevice concept was renamed and doesn't exist anymore. (#6693)

* Replace per_device with per_replica and PerDevice with PerReplica, because the PerReplica concept was renamed and doesn't exist anymore.

b00783d7

26 Apr, 2019 1 commit

Add num_packs flag for MirroredStrategy's cross device ops. (#6676) · 4a1fba0b

Ayush Dubey authored Apr 26, 2019

* Add num_packs flag for MirroredStrategy's cross device ops.

* fix parens

* Fix lint errors and make all_reduce_alg more robust.

* Set default num_packs to 1

4a1fba0b

25 Apr, 2019 1 commit
- Remove contrib cross device ops and update all_reduce_alg options. (#6673) · ece99414
  Ayush Dubey authored Apr 25, 2019
```
* Remove contrib AllReduceCrossDeviceOps and update all_reduce_alg options with MirroredStrategy.

* cleanup
```
  ece99414
24 Apr, 2019 1 commit
- Update distribution_utils.py (#6615) · 98672351
  Yuefeng Zhou authored Apr 24, 2019
  
  98672351
08 Apr, 2019 1 commit

Add DS support for NCF keras (#6447) · 1255d5b9

Shining Sun authored Apr 08, 2019

* add ds support for ncf

* remove comments for in_top_k

* avoid expanding the input layers

* resolve comments and fix lint

* Added some comments in code and fix lint

* fix lint

* add some documentation

* add tensorflow imports

1255d5b9

01 Apr, 2019 1 commit
- Add synthetic data monkey patch to OneDeviceStrategy as well (#6505) · d9823dae
  Haoyu Zhang authored Apr 01, 2019
  
  d9823dae
19 Mar, 2019 1 commit
- Add the option to run Keras resnet model on multiple workers. (#6368) · 3024bde6
  Soroush Radpour authored Mar 19, 2019
  
  3024bde6
07 Mar, 2019 1 commit

Add command line option for multi worker collective implementations, disable checkpointing. (#6317) · 05a79f5a

Ayush Dubey authored Mar 07, 2019

* s/CollectiveAllReduceStrategy/MultiWorkerMirroredStrategy

* More s/contrib.distribute/distribute.experimental

* Collective communication options in MultiWorkerMirroredStrategy.

* Minor fixes

* No checkpointing if multi worker.

* turn off checkpointing

* fix lint

05a79f5a

02 Mar, 2019 1 commit
- fix resnet breakage and add keras end-to-end tests (#6295) · 8367cf6d
  Taylor Robie authored Mar 02, 2019
```
* fix resnet breakage and add keras end-to-end tests

* delint

* address PR comments
```
  8367cf6d
01 Mar, 2019 1 commit

Keras-fy NCF Model (#6092) · 048e5bff

Shining Sun authored Mar 01, 2019

* tmp commit

* tmp commit

* first attempt (without eval)

* Bug fixes

* bug fixes

* training done

* Loss NAN, no eval

* Loss weight problem solved

* resolve the NAN loss problem

* Problem solved. Clean up needed

* Added a todo

* Remove debug prints

* Extract get_optimizer to ncf_common

* Move metrics computation back to neumf; use DS.scope api

* Extract DS.scope code to utils

* lint fixes

* Move obtaining DS above producer.start to avoid race condition

* move pt 1

* move pt 2

* Update the run script

* Wrap keras_model related code into functions

* Update the doc for softmax_logitfy and change the method name

* Resolve PR comments

* working version with: eager, DS, batch and no masks

* Remove git conflict indicator

* move reshape to neumf_model

* working version, not converge

* converged

* fix a test

* more lint fix

* more lint fix

* more lint fixes

* more lint fix

* Removed unused imports

* fix test

* dummy commit for kicking of checks

* fix lint issue

* dummy input to kick off checks

* dummy input to kick off checks

* add collective to dist strat

* addressed review comments

* add a doc string

048e5bff

28 Feb, 2019 1 commit
- Change `CollectiveAllReduceStrategy` to `MultiWorkerMirroredStrategy`. (#6282) · d793ea82
  Ayush Dubey authored Feb 28, 2019
```
* s/CollectiveAllReduceStrategy/MultiWorkerMirroredStrategy

* More s/contrib.distribute/distribute.experimental
```
  d793ea82
21 Feb, 2019 1 commit

Multi-worker support for Resnet. (#6206) · f2e90945

Ayush Dubey authored Feb 21, 2019

* Update official resnet for multi worker training with distribution strategies.

* Fixes for multi worker training.

* Fix call to `get_distribution_strategy`.

* Undo test change.

* Fix spacing.

* Move cluster configuration to distribution_utils.

* Move train_and_evaluate out of loop.  Also, update docstrings for multi-worker flags and add use_train_and_evaluate flag.

* Update distribution_strategy flag to match exported name for collective strategy.

f2e90945

14 Feb, 2019 1 commit
- One device strat (#6196) · b66ef95e
  Toby Boyd authored Feb 13, 2019
```
* One device from contrib to core.

* remove test code.
```
  b66ef95e
13 Feb, 2019 1 commit

Add a flag to specify distribution strategies. (#6185) · 79b57a3f

Yuefeng Zhou authored Feb 12, 2019

* Add a flag to specify distribution strategies.

* Fix a small error.

* Address comments.

* Address comments.

* Fix typos.

79b57a3f

12 Feb, 2019 1 commit

V2 contrib tweaks (#6184) · a1ee97e6

Toby Boyd authored Feb 11, 2019

* Remove contrib thread pool.

* Remove commented out contrib import.

* Fix lint issues.

* move tf.data.options higher. Tweak line breaks.

* do not monkey patch on or off if dist_strat is off

* Do not monkey patch if no_dist_strat.

* Fix file permissions.

* fix file permissions.

* Revert change to main.  Add hasattr(tf, 'contrib') to utils

* compat.v1.logging

* tf.compat.v1.get_local_variables.

a1ee97e6

09 Feb, 2019 1 commit

Add pure synthetic data to keras resnet model. (#6174) · 05383c7b

Yuefeng Zhou authored Feb 08, 2019

* Add pure synthetic data to keras resnet mode.

* Add imports.

* Address comments.

* update comment

* Undo set up synthetic data for real data path.

* update comment

* Address comment

* Remove trailing whiltespaces.

* s/make_data_set_iterator/make_dataset_iterator/

05383c7b

01 Feb, 2019 1 commit
- Use core mirrored strategy in official models (#6126) · a66d4713
  guptapriya authored Jan 31, 2019
  
  a66d4713
27 Dec, 2018 1 commit
- Fixed lint and flag issues · 03c35ec6
  Shining Sun authored Dec 27, 2018
  
  03c35ec6
24 Dec, 2018 1 commit
- fix lint errors. · 122bb012
  Toby Boyd authored Dec 23, 2018
  
  122bb012
21 Dec, 2018 1 commit
- bug fixes · c923a420
  Shining Sun authored Dec 21, 2018
  
  c923a420
20 Dec, 2018 2 commits
- bug fixes and clean ups · 6f881f77
  Shining Sun authored Dec 20, 2018
  
  6f881f77
- Inlude the distribution_utils file · b1b4c805
  Shining Sun authored Dec 19, 2018
  
  b1b4c805
21 Nov, 2018 1 commit

cross_tower_ops -> cross_device_ops (#5776) · 9a4848a2

josh11b authored Nov 20, 2018

We've deprecated the "tower" terminology in DistributionStrategy, so the "cross_tower_ops" argument is now "cross_device_ops", matching the current name of "AllReduceCrossDeviceOps".

9a4848a2

25 Oct, 2018 1 commit
- Update distribution_utils.py · c5dbd487
  josh11b authored Oct 24, 2018
  
  c5dbd487
24 Oct, 2018 1 commit
- AllReduceCrossTowerOps -> AllReduceCrossDeviceOps · 6c560cb3
  josh11b authored Oct 24, 2018
  
  6c560cb3
12 Oct, 2018 1 commit
- forced nccl has same num_packs as default. · 1f21b69e
  Toby Boyd authored Oct 12, 2018
  
  1f21b69e
12 Jun, 2018 1 commit

Transformer multi gpu, remove multi_gpu flag, distribution helper functions (#4457) · 29c9f985

Katherine Wu authored Jun 12, 2018

* Add DistributionStrategy to transformer model

* add num_gpu flag

* Calculate per device batch size for transformer

* remove reference to flags_core

* Add synthetic data option to transformer

* fix typo

* add import back in

* Use hierarchical copy

* address PR comments

* lint

* fix spaces

* group train op together to fix single GPU error

* Fix translate bug (sorted_keys is a dict, not a list)

* Change params to a default dict (translate.py was throwing errors because params didn't have the TPU parameters.)

* Address PR comments. Removed multi gpu flag + more

* fix lint

* fix more lints

* add todo for Synthetic dataset

* Update docs

29c9f985