Commits · 6a6c361642e5fd0864fe87a9dfe77640a7cfcb2f · ModelZoo / ResNet50_tensorflow

22 Jul, 2019 1 commit

Add a new sanity check script that is able to only check incremental changes. (#7265) · 6a6c3616

Hongkun Yu authored Jul 22, 2019

* Update pylint.rcfile

* Update pylint.rcfile

* Update pylint.rcfile

* add new sanity check script for lint to replace current lint script.

* Revert "Update pylint.rcfile"

This reverts commit f6036cd7e7c4b9e3eeb47bb56a63927a040a2761.

* Revert "Update pylint.rcfile"

This reverts commit e3af497342e26bbbbecfc8c8f79cb0e24a2ef960.

* Revert "Update pylint.rcfile"

This reverts commit 6136636eee6e90fd191ebbb4ccaa9fb89c0290f4.

* update scripts

* disable trailing-newlines

6a6c3616

03 Jul, 2019 1 commit

Unit tests pass TF 2.0 GPU and CPU locally. (#7101) · 49097655

Toby Boyd authored Jul 03, 2019

* Fix unit tests failures.

* 96% of TF 2.0 tests on GPU are passing.

* Currently all passing GPU and CPU TF 2.0

* Address code comments.

* use tf 2.0 cast.

* Comment about working on TF 2.0 CPU

* Uses contrib turn off for TF 2.0.

* Fix wide_deep and add keras_common_tests.

* use context to get num_gpus.

* Switch to tf.keras.metrics

49097655

22 Jun, 2019 1 commit
- Fix unit tests failures. (#7086) · 47a59023
  Toby Boyd authored Jun 22, 2019
  
  47a59023
24 May, 2019 1 commit

Transformer v2 benchmark (#6860) · f2ea2f53

Toby Boyd authored May 24, 2019

* Moved common keras code to utils.

* Initial 1 gpu benchmark

- Aligned flags with resnet example
- removed code/features that are not super useful
- eval as part of train if bleu source/ref provided
- add exp_per_second hook

* Rename benchmark classes, pass batch-size and log_steps.

* fix docstring

* Predict done with checkpoints inline

- perfzero baseclass

* steps not epochs with smoother training loop.

* do not initialize history outside loop.

* 5000 between eval not 500

* estimator to keras.

* remove epochs var.

* use range not xrange.

* 200K steps for 1 gpu

* fix global step

f2ea2f53

11 May, 2019 1 commit
- Remove flacky test: test_bad_seed (#6761) · 03242e38
  Toby Boyd authored May 10, 2019
```
- Test passes locally python3 and test is already
    skipped for python2.
```
  03242e38
11 Feb, 2019 1 commit

Remove contrib thread pool. (#6175) · b6c0c7f9

Toby Boyd authored Feb 11, 2019

* Remove contrib thread pool.

* Remove commented out contrib import.

* Fix lint issues.

* move tf.data.options higher. Tweak line breaks.

b6c0c7f9

08 Feb, 2019 1 commit
- Revert "Revert "tf_upgrade_v2 on resnet and utils folders. (#6154)" (#6162)" (#6167) · b2c9e3f5
  Goldie Gadde authored Feb 08, 2019
```
This reverts commit 57e07520.
```
  b2c9e3f5
06 Feb, 2019 1 commit
- Revert "tf_upgrade_v2 on resnet and utils folders. (#6154)" (#6162) · 57e07520
  Goldie Gadde authored Feb 06, 2019
```
This reverts commit d6b2b83c.
```
  57e07520
05 Feb, 2019 1 commit

tf_upgrade_v2 on resnet and utils folders. (#6154) · d6b2b83c

Goldie Gadde authored Feb 05, 2019

* Add resnet56 short tests. (#6101)

* Add resnet56 short tests.
- created base benchmark module
- renamed accuracy test class to contain the word Accuracy
which will result in a need to update all the jobs
and a loss of history but is worth it.
- short tests are mostly copied from shining with oss refactor

* Address feedback.

* Move flag_methods to init
- Address setting default flags repeatedly.

* Rename accuracy tests.

* Lint errors resolved.

* fix model_dir set to flags.data_dir.

* fixed not fulling pulling out flag_methods.

* Use core mirrored strategy in official models (#6126)

* Imagenet short tests (#6132)

* Add short imagenet tests (taken from seemuch)
- also rename to match go forward naming

* fix method name

* Update doc strings.

* Fixe gpu number.

* points default data_dir to child folder. (#6131)

Failed test is python2  and was a kokoro failure

* Imagenet short tests (#6136)

* Add short imagenet tests (taken from seemuch)
- also rename to match go forward naming

* fix method name

* Update doc strings.

* Fixe gpu number.

* Add fill_objects

* fixed calling wrong class in super.

* fix lint issue.

* Flag (#6121)

* Fix the turn_off_ds flag problem

* add param names to all args

* Export benchmark stats using tf.test.Benchmark.report_benchmark() (#6103)

* Export benchmark stats using tf.test.Benchmark.report_benchmark()

* Fix python style using pyformat

* Typos. (#6120)

* log verbosity=2 logs every epoch no progress bars (#6142)

* tf_upgrade_v2 on resnet and utils folder.

* tf_upgrade_v2 on resnet and utils folder.

d6b2b83c

07 Jan, 2019 1 commit

Add bisection based producer for increased scalability, enable fully... · 4fb325da

Taylor Robie authored Dec 27, 2018

Add bisection based producer for increased scalability, enable fully deterministic data production, and use the materialized and bisection producer to check each other (via expected output md5's)

4fb325da

30 Jul, 2018 1 commit

NCF pipeline refactor (take 2) and initial TPU port. (#4935) · 6518c1c7

Taylor Robie authored Jul 30, 2018

* intermediate commit

* ncf now working

* reorder pipeline

* allow batched decode for file backed dataset

* fix bug

* more tweaks

* parallize false negative generation

* shared pool hack

* workers ignore sigint

* intermediate commit

* simplify buffer backed dataset creation to fixed length record approach only. (more cleanup needed)

* more tweaks

* simplify pipeline

* fix misplaced cleanup() calls. (validation works\!)

* more tweaks

* sixify memoryview usage

* more sixification

* fix bug

* add future imports

* break up training input pipeline

* more pipeline tuning

* first pass at moving negative generation to async

* refactor async pipeline to use files instead of ipc

* refactor async pipeline

* move expansion and concatenation from reduce worker to generation workers

* abandon complete async due to interactions with the tensorflow threadpool

* cleanup

* remove performance_comparison.py

* experiment with rough generator + interleave pipeline

* yet more pipeline tuning

* update on-the-fly pipeline

* refactor preprocessing, and move train generation behind a GRPC server

* fix leftover call

* intermediate commit

* intermediate commit

* fix index error in data pipeline, and add logging to train data server

* make sharding more robust to imbalance

* correctly sample with replacement

* file buffers are no longer needed for this branch

* tweak sampling methods

* add README for data pipeline

* fix eval sampling, and vectorize eval metrics

* add spillover and static training batch sizes

* clean up cruft from earlier iterations

* rough delint

* delint 2 / n

* add type annotations

* update run script

* make run.sh a bit nicer

* change embedding initializer to match reference

* rough pass at pure estimator model_fn

* impose static shape hack (revisit later)

* refinements

* fix dir error in run.sh

* add documentation

* add more docs and fix an assert

* old data test is no longer valid. Keeping it around as reference for the new one

* rough draft of data pipeline validation script

* don't rely on shuffle default

* tweaks and documentation

* add separate eval batch size for performance

* initial commit

* terrible hacking

* mini hacks

* missed a bug

* messing about trying to get TPU running

* TFRecords based TPU attempt

* bug fixes

* don't log remotely

* more bug fixes

* TPU tweaks and bug fixes

* more tweaks

* more adjustments

* rework model definition

* tweak data pipeline

* refactor async TFRecords generation

* temp commit to run.sh

* update log behavior

* fix logging bug

* add check for subprocess start to avoid cryptic hangs

* unify deserialize and make it TPU compliant

* delint

* remove gRPC pipeline code

* fix logging bug

* delint and remove old test files

* add unit tests for NCF pipeline

* delint

* clean up run.sh, and add run_tpu.sh

* forgot the most important line

* fix run.sh bugs

* yet more bash debugging

* small tweak to add keras summaries to model_fn

* Clean up sixification issues

* address PR comments

* delinting is never over

6518c1c7

25 May, 2018 1 commit

Fix/log ex per sec (#4360) · d626b908

Karmel Allison authored May 25, 2018

* Using BenchmarkLogger

* Using BenchmarkLogger

* Fixing tests

* Linting fixes.

* Adding comments

* Moving mock logger

* Moving mock logger

* Glinting

* Responding to CR

* Reverting assertEmpty

d626b908

03 May, 2018 1 commit

Move argparsing from builtin argparse to absl (#4099) · 5f9f6b84

Taylor Robie authored May 02, 2018

* squash of modular absl usage commits

* delint

* address PR comments

* change hooks to comma separated list, as absl behavior for space separated lists is not as expected

5f9f6b84

10 Apr, 2018 2 commits
- change reference_data.py to use tf.gfile (#3921) · 2661eb97
  Taylor Robie authored Apr 10, 2018
```
* change reference_data.py to use tf.gfile

* simplify json treatment

* Update reference files to account for a superficial change in batch_norm
```
  2661eb97
- Add copyright header for testing script. (#3943) · c93409cf
  Qianli Scott Zhu authored Apr 10, 2018
  
  c93409cf
03 Apr, 2018 1 commit
- Fix the testing script exit code for python test. (#3858) · c3b26603
  Qianli Scott Zhu authored Apr 03, 2018
  
  c3b26603
02 Apr, 2018 1 commit

Add testing script for local lint and python test. (#3797) · 03bf0d38

Qianli Scott Zhu authored Apr 02, 2018

* Add presubmit testing script for local testing.

* Update the test script to be more modularized.

1. Check the script file location and cd into repo root dir.
2. Allow caller to call differnt tests.

03bf0d38

29 Mar, 2018 1 commit
- Add End-to-end tests for wide deep, and fix "wide" and "deep" configurations. (#3798) · 9cc7eac1
  Taylor Robie authored Mar 29, 2018
```
* add end-to-end tests for wide_deep

delint

* address PR comments
```
  9cc7eac1
28 Mar, 2018 1 commit

Add benchmark upload util to Bigquery. (#3776) · 932364b6

Qianli Scott Zhu authored Mar 28, 2018

* Add benchmark upload util to bigquery.

Also update the benchmark logger and bigquery schema for the
errors found during the integration test.

* Fix lint error.

* Update test to clear all the env vars during test.

This was causing error since the Kokoro test has TF_PKG=tf-nightly
injected during test.

* Update lintrc to ignore google related package.

* Another attempt to fix lint import error.

* Address the review comment.

* Fix lint error.

* Another fix for lint.

* Update test comment for env var clean up.

932364b6

27 Mar, 2018 1 commit

Add reference data tests to official. (#3723) · 587f5792

Taylor Robie authored Mar 27, 2018

* Add golden test util to streamline symbolic and numerical comparison to reference graphs, and apply golden tests to ResNet.

update tests

use more concise logic for path property

delint

add some comments

delint

address PR comments

make resnet tests more concise, and supress warning test in py2

change resnet name template

more shuffling of data dirs

address PR comments and add tensorflow version info

Remove subTest due to py2

switch from tf.__version__ to tf.VERSION, and include tf.GIT_VERSION

supress lint error from json load unpack

* address PR comments

* address PR comments

* delint

587f5792

21 Mar, 2018 1 commit
- Fixing linting for kokoro (#3676) · bea947de
  Karmel Allison authored Mar 20, 2018
  
  bea947de
20 Mar, 2018 2 commits

Glint everything (#3654) · 7cfb6bbd

Karmel Allison authored Mar 20, 2018

* Glint everything

* Adding rcfile and pylinting

* Extra newline

* Few last lints

7cfb6bbd

Use util functions hooks_helper and parser in mnist and wide_deep, and rename... · adfd5a3a
Katherine Wu authored Mar 20, 2018
```
Use util functions hooks_helper and parser in mnist and wide_deep, and rename epochs_between_eval (from epochs_per_eval) (#3650)
```
adfd5a3a

19 Mar, 2018 1 commit
- Improve directory treatment in ResNet end-to-end tests. (#3651) · f85ab4c8
  Taylor Robie authored Mar 19, 2018
```
* use proper temp directory for end to end tests.

* add supers to tearDown
```
  f85ab4c8
16 Mar, 2018 1 commit

Basic end to end test of resnet. (#3598) · dcca6b44

Taylor Robie authored Mar 16, 2018

This commit adds a basic end to end test for resnet cifar10 and imagenet models to check for syntax errors outside of the core neural net code.

dcca6b44