Commits · 33c61588cabda85598fbfbdce0d0329bbe42b4a4 · ModelZoo / ResNet50_tensorflow

13 Oct, 2020 1 commit
- Move core configs to core/ folder. Leave the configs for legacy models. · 33c61588
  Hongkun Yu authored Oct 12, 2020
```
PiperOrigin-RevId: 336795303
```
  33c61588
12 Aug, 2020 2 commits
- Internal change · 999fae62
  Hongkun Yu authored Aug 12, 2020
```
PiperOrigin-RevId: 326286926
```
  999fae62
- Internal change · 88253ce5
  Hongkun Yu authored Aug 12, 2020
```
PiperOrigin-RevId: 326286926
```
  88253ce5
06 Aug, 2020 2 commits
- Move trainers to core/ · cbbba228
  Hongkun Yu authored Aug 06, 2020
```
Move mock_task to utils/testing/

PiperOrigin-RevId: 325275356
```
  cbbba228
- Move trainers to core/ · 45da63a9
  Hongkun Yu authored Aug 06, 2020
```
Move mock_task to utils/testing/

PiperOrigin-RevId: 325275356
```
  45da63a9
29 Apr, 2020 1 commit
- [Clean up] Move utils/logs to r1/utils. · ec7fbf0d
  Hongkun Yu authored Apr 29, 2020
```
PiperOrigin-RevId: 309079916
```
  ec7fbf0d
14 Apr, 2020 1 commit
- Move benchmark_wrappers to benchmark folder. · d70eca30
  Hongkun Yu authored Apr 14, 2020
```
PiperOrigin-RevId: 306521269
```
  d70eca30
09 Apr, 2020 1 commit
- move perfzero benchmark to benchmark/ · 7351c164
  Hongkun Yu authored Apr 09, 2020
```
PiperOrigin-RevId: 305709689
```
  7351c164
17 Mar, 2020 1 commit
- tf.compat.v1.logging implemented with absl · 3043566d
  ayushmankumar7 authored Mar 18, 2020
  
  3043566d
14 Mar, 2020 1 commit
- Add ability to take TPU address and output dir from environment variables. · 9ff8aa21
  Sai Ganesh Bandiatmakuri authored Mar 13, 2020
```
PiperOrigin-RevId: 300858086
```
  9ff8aa21
29 Jan, 2020 1 commit
- remove reference_data to reduce github files. · d33a0b42
  Hongkun Yu authored Jan 29, 2020
```
PiperOrigin-RevId: 292178802
```
  d33a0b42
11 Dec, 2019 1 commit
- Internal change · 26bbda73
  David Chen authored Dec 10, 2019
```
PiperOrigin-RevId: 284874717
```
  26bbda73
21 Nov, 2019 1 commit
- Internal change · 4c872f63
  Sai Ganesh Bandiatmakuri authored Nov 21, 2019
```
PiperOrigin-RevId: 281846531
```
  4c872f63
19 Nov, 2019 1 commit
- Properly save/restore flags in model tests · 1b8c0ee8
  Jose Baiocchi authored Nov 18, 2019
```
PiperOrigin-RevId: 281192912
```
  1b8c0ee8
10 Oct, 2019 1 commit

change benchmark's log verbosity to logging.INFO. it seems to me that DEBUG... · 1f9efe10

A. Unique TensorFlower authored Oct 10, 2019

change benchmark's log verbosity to logging.INFO. it seems to me that DEBUG map to ---v=1 internally, which is way to verbose for the purpose of benchmarking.

PiperOrigin-RevId: 274040907

1f9efe10

04 Sep, 2019 1 commit

Unexpose some flags from models which do not use them. · a85c40e3

Reed Wanderman-Milne authored Sep 03, 2019

--clean, --train_epochs, and --epochs_between_evals have been unexposed from models which do not use them

PiperOrigin-RevId: 267065651

a85c40e3

21 Aug, 2019 1 commit
- Mute no-name-in-module · dd03f167
  Hongkun Yu authored Aug 20, 2019
```
PiperOrigin-RevId: 264527204
```
  dd03f167
19 Aug, 2019 1 commit

Do not expose --max_train_steps in models that do not use it. · 824ff2d6

Reed Wanderman-Milne authored Aug 19, 2019

Only the V1 resnet model uses --max_train_steps. This unexposes the flag in the keras_application_models, mnist, keras resnet, CTL resnet Models. Before this change, such models allowed the flag to be specified, but ignored it.

I also removed the "max_train" argument from the run_synthetic function, since this only had any meaning for the V1 resnet model. Instead, the V1 resnet model now directly passes --max_train_steps=1 to run_synthetic.

PiperOrigin-RevId: 264269836

824ff2d6

02 Aug, 2019 2 commits
- Update to use py3 lint (#7367) · a76e250f
  Hongkun Yu authored Aug 02, 2019
```
* Update to use py3 lint

* Update model_saving_utils.py

Testing. To be reverted

* Update model_saving_utils.py
```
  a76e250f
- Update presubmit.sh (#7366) · 126134f2
  Hongkun Yu authored Aug 02, 2019
```
old lint is no longer used.
```
  126134f2
23 Jul, 2019 1 commit
- Update lint presubmit to be consistent with tensorflow (#7278) · 609260cd
  Hongkun Yu authored Jul 22, 2019
```
Only care about errors and output into an error file.
```
  609260cd
22 Jul, 2019 1 commit

Add a new sanity check script that is able to only check incremental changes. (#7265) · 6a6c3616

Hongkun Yu authored Jul 22, 2019

* Update pylint.rcfile

* Update pylint.rcfile

* Update pylint.rcfile

* add new sanity check script for lint to replace current lint script.

* Revert "Update pylint.rcfile"

This reverts commit f6036cd7e7c4b9e3eeb47bb56a63927a040a2761.

* Revert "Update pylint.rcfile"

This reverts commit e3af497342e26bbbbecfc8c8f79cb0e24a2ef960.

* Revert "Update pylint.rcfile"

This reverts commit 6136636eee6e90fd191ebbb4ccaa9fb89c0290f4.

* update scripts

* disable trailing-newlines

6a6c3616

03 Jul, 2019 1 commit

Unit tests pass TF 2.0 GPU and CPU locally. (#7101) · 49097655

Toby Boyd authored Jul 03, 2019

* Fix unit tests failures.

* 96% of TF 2.0 tests on GPU are passing.

* Currently all passing GPU and CPU TF 2.0

* Address code comments.

* use tf 2.0 cast.

* Comment about working on TF 2.0 CPU

* Uses contrib turn off for TF 2.0.

* Fix wide_deep and add keras_common_tests.

* use context to get num_gpus.

* Switch to tf.keras.metrics

49097655

22 Jun, 2019 1 commit
- Fix unit tests failures. (#7086) · 47a59023
  Toby Boyd authored Jun 22, 2019
  
  47a59023
24 May, 2019 1 commit

Transformer v2 benchmark (#6860) · f2ea2f53

Toby Boyd authored May 24, 2019

* Moved common keras code to utils.

* Initial 1 gpu benchmark

- Aligned flags with resnet example
- removed code/features that are not super useful
- eval as part of train if bleu source/ref provided
- add exp_per_second hook

* Rename benchmark classes, pass batch-size and log_steps.

* fix docstring

* Predict done with checkpoints inline

- perfzero baseclass

* steps not epochs with smoother training loop.

* do not initialize history outside loop.

* 5000 between eval not 500

* estimator to keras.

* remove epochs var.

* use range not xrange.

* 200K steps for 1 gpu

* fix global step

f2ea2f53

11 May, 2019 1 commit
- Remove flacky test: test_bad_seed (#6761) · 03242e38
  Toby Boyd authored May 10, 2019
```
- Test passes locally python3 and test is already
    skipped for python2.
```
  03242e38
11 Feb, 2019 1 commit

Remove contrib thread pool. (#6175) · b6c0c7f9

Toby Boyd authored Feb 11, 2019

* Remove contrib thread pool.

* Remove commented out contrib import.

* Fix lint issues.

* move tf.data.options higher. Tweak line breaks.

b6c0c7f9

08 Feb, 2019 1 commit
- Revert "Revert "tf_upgrade_v2 on resnet and utils folders. (#6154)" (#6162)" (#6167) · b2c9e3f5
  Goldie Gadde authored Feb 08, 2019
```
This reverts commit 57e07520.
```
  b2c9e3f5
06 Feb, 2019 1 commit
- Revert "tf_upgrade_v2 on resnet and utils folders. (#6154)" (#6162) · 57e07520
  Goldie Gadde authored Feb 06, 2019
```
This reverts commit d6b2b83c.
```
  57e07520
05 Feb, 2019 1 commit

tf_upgrade_v2 on resnet and utils folders. (#6154) · d6b2b83c

Goldie Gadde authored Feb 05, 2019

* Add resnet56 short tests. (#6101)

* Add resnet56 short tests.
- created base benchmark module
- renamed accuracy test class to contain the word Accuracy
which will result in a need to update all the jobs
and a loss of history but is worth it.
- short tests are mostly copied from shining with oss refactor

* Address feedback.

* Move flag_methods to init
- Address setting default flags repeatedly.

* Rename accuracy tests.

* Lint errors resolved.

* fix model_dir set to flags.data_dir.

* fixed not fulling pulling out flag_methods.

* Use core mirrored strategy in official models (#6126)

* Imagenet short tests (#6132)

* Add short imagenet tests (taken from seemuch)
- also rename to match go forward naming

* fix method name

* Update doc strings.

* Fixe gpu number.

* points default data_dir to child folder. (#6131)

Failed test is python2  and was a kokoro failure

* Imagenet short tests (#6136)

* Add short imagenet tests (taken from seemuch)
- also rename to match go forward naming

* fix method name

* Update doc strings.

* Fixe gpu number.

* Add fill_objects

* fixed calling wrong class in super.

* fix lint issue.

* Flag (#6121)

* Fix the turn_off_ds flag problem

* add param names to all args

* Export benchmark stats using tf.test.Benchmark.report_benchmark() (#6103)

* Export benchmark stats using tf.test.Benchmark.report_benchmark()

* Fix python style using pyformat

* Typos. (#6120)

* log verbosity=2 logs every epoch no progress bars (#6142)

* tf_upgrade_v2 on resnet and utils folder.

* tf_upgrade_v2 on resnet and utils folder.

d6b2b83c

07 Jan, 2019 1 commit

Add bisection based producer for increased scalability, enable fully... · 4fb325da

Taylor Robie authored Dec 27, 2018

Add bisection based producer for increased scalability, enable fully deterministic data production, and use the materialized and bisection producer to check each other (via expected output md5's)

4fb325da

30 Jul, 2018 1 commit

NCF pipeline refactor (take 2) and initial TPU port. (#4935) · 6518c1c7

Taylor Robie authored Jul 30, 2018

* intermediate commit

* ncf now working

* reorder pipeline

* allow batched decode for file backed dataset

* fix bug

* more tweaks

* parallize false negative generation

* shared pool hack

* workers ignore sigint

* intermediate commit

* simplify buffer backed dataset creation to fixed length record approach only. (more cleanup needed)

* more tweaks

* simplify pipeline

* fix misplaced cleanup() calls. (validation works\!)

* more tweaks

* sixify memoryview usage

* more sixification

* fix bug

* add future imports

* break up training input pipeline

* more pipeline tuning

* first pass at moving negative generation to async

* refactor async pipeline to use files instead of ipc

* refactor async pipeline

* move expansion and concatenation from reduce worker to generation workers

* abandon complete async due to interactions with the tensorflow threadpool

* cleanup

* remove performance_comparison.py

* experiment with rough generator + interleave pipeline

* yet more pipeline tuning

* update on-the-fly pipeline

* refactor preprocessing, and move train generation behind a GRPC server

* fix leftover call

* intermediate commit

* intermediate commit

* fix index error in data pipeline, and add logging to train data server

* make sharding more robust to imbalance

* correctly sample with replacement

* file buffers are no longer needed for this branch

* tweak sampling methods

* add README for data pipeline

* fix eval sampling, and vectorize eval metrics

* add spillover and static training batch sizes

* clean up cruft from earlier iterations

* rough delint

* delint 2 / n

* add type annotations

* update run script

* make run.sh a bit nicer

* change embedding initializer to match reference

* rough pass at pure estimator model_fn

* impose static shape hack (revisit later)

* refinements

* fix dir error in run.sh

* add documentation

* add more docs and fix an assert

* old data test is no longer valid. Keeping it around as reference for the new one

* rough draft of data pipeline validation script

* don't rely on shuffle default

* tweaks and documentation

* add separate eval batch size for performance

* initial commit

* terrible hacking

* mini hacks

* missed a bug

* messing about trying to get TPU running

* TFRecords based TPU attempt

* bug fixes

* don't log remotely

* more bug fixes

* TPU tweaks and bug fixes

* more tweaks

* more adjustments

* rework model definition

* tweak data pipeline

* refactor async TFRecords generation

* temp commit to run.sh

* update log behavior

* fix logging bug

* add check for subprocess start to avoid cryptic hangs

* unify deserialize and make it TPU compliant

* delint

* remove gRPC pipeline code

* fix logging bug

* delint and remove old test files

* add unit tests for NCF pipeline

* delint

* clean up run.sh, and add run_tpu.sh

* forgot the most important line

* fix run.sh bugs

* yet more bash debugging

* small tweak to add keras summaries to model_fn

* Clean up sixification issues

* address PR comments

* delinting is never over

6518c1c7

25 May, 2018 1 commit

Fix/log ex per sec (#4360) · d626b908

Karmel Allison authored May 25, 2018

* Using BenchmarkLogger

* Using BenchmarkLogger

* Fixing tests

* Linting fixes.

* Adding comments

* Moving mock logger

* Moving mock logger

* Glinting

* Responding to CR

* Reverting assertEmpty

d626b908

03 May, 2018 1 commit

Move argparsing from builtin argparse to absl (#4099) · 5f9f6b84

Taylor Robie authored May 02, 2018

* squash of modular absl usage commits

* delint

* address PR comments

* change hooks to comma separated list, as absl behavior for space separated lists is not as expected

5f9f6b84

10 Apr, 2018 2 commits
- change reference_data.py to use tf.gfile (#3921) · 2661eb97
  Taylor Robie authored Apr 10, 2018
```
* change reference_data.py to use tf.gfile

* simplify json treatment

* Update reference files to account for a superficial change in batch_norm
```
  2661eb97
- Add copyright header for testing script. (#3943) · c93409cf
  Qianli Scott Zhu authored Apr 10, 2018
  
  c93409cf
03 Apr, 2018 1 commit
- Fix the testing script exit code for python test. (#3858) · c3b26603
  Qianli Scott Zhu authored Apr 03, 2018
  
  c3b26603
02 Apr, 2018 1 commit

Add testing script for local lint and python test. (#3797) · 03bf0d38

Qianli Scott Zhu authored Apr 02, 2018

* Add presubmit testing script for local testing.

* Update the test script to be more modularized.

1. Check the script file location and cd into repo root dir.
2. Allow caller to call differnt tests.

03bf0d38

29 Mar, 2018 1 commit
- Add End-to-end tests for wide deep, and fix "wide" and "deep" configurations. (#3798) · 9cc7eac1
  Taylor Robie authored Mar 29, 2018
```
* add end-to-end tests for wide_deep

delint

* address PR comments
```
  9cc7eac1
28 Mar, 2018 1 commit

Add benchmark upload util to Bigquery. (#3776) · 932364b6

Qianli Scott Zhu authored Mar 28, 2018

* Add benchmark upload util to bigquery.

Also update the benchmark logger and bigquery schema for the
errors found during the integration test.

* Fix lint error.

* Update test to clear all the env vars during test.

This was causing error since the Kokoro test has TF_PKG=tf-nightly
injected during test.

* Update lintrc to ignore google related package.

* Another attempt to fix lint import error.

* Address the review comment.

* Fix lint error.

* Another fix for lint.

* Update test comment for env var clean up.

932364b6