- 24 May, 2019 2 commits
-
-
Priya Gupta authored
Add early stopping logic to ncf keras when desired threshold is met. Also change the default batch size to match the tuned hyperparams
-
Tian Lin authored
* Merged commit includes the following changes: 249776315 by tianlin<tianlin@google.com>: Internal change 249763206 by tianlin<tianlin@google.com>: For TF 2.0 (related to Beam Search), expand cond dims in tf.where(cond, x, y) to make all parameters broadcastable. -- 249392724 by hongkuny<hongkuny@google.com>: Internal change PiperOrigin-RevId: 249776315 * Merged commit includes the following changes: 249823043 by tianlin<tianlin@google.com>: Bring back v2 test for predict and eval. -- PiperOrigin-RevId: 249823043
-
- 23 May, 2019 2 commits
-
-
guptapriya authored
Adding validation every epoch allows us to view the progress during training instead of having to wait until the last eval. Mostly useful for manual runs.
-
guptapriya authored
Current batch size 160000 does not converge to the desired HR. So we decrease to 99k which is known to converge. Tested locally and got to 63.5 at epoch 7. Also decreasing number of epochs as I don't see any improvement after epoch 7-8.
-
- 15 May, 2019 1 commit
-
-
Igor authored
* Set the --clone_model_in_keras_dist_strat to None. Remove the separate no_cloning benchmarks and add a couple of cloning ones. Fixes the learning rate schedule to cache its ops per graph.
-
- 08 May, 2019 1 commit
-
-
Toby Boyd authored
-
- 29 Apr, 2019 3 commits
-
-
Igor authored
Replace per_device with per_replica and PerDevice with PerReplica, because the PerDevice concept was renamed and doesn't exist anymore. (#6693) * Replace per_device with per_replica and PerDevice with PerReplica, because the PerReplica concept was renamed and doesn't exist anymore.
-
Toby Boyd authored
* Add accuracy check. * Avoid double flag init, move data_dir to real data. * Comment on lower accuracy target.
-
Igor authored
* Add benchmarks with the --cloning flag to Resnet and NFC. * Renamed cloning to clone_model_in_keras_dist_strat. Dropped a few tests that aren't essential. * Fixed up the formatting after re-naming the flag to a much longer name. Thanks, lint. * Fixed the lint error in nfc_common.py
-
- 22 Apr, 2019 2 commits
-
-
Toby Boyd authored
* Use tf.image.resize_with_crop_or_pad * exp_per_second and hr_at_10
-
Shining Sun authored
-
- 20 Apr, 2019 2 commits
-
-
Shining Sun authored
-
Shining Sun authored
* Remove contrib imports, or move them inline * Use exposed API for FixedLenFeature * Replace tf.logging with absl logging * Change GFile to v2 APIs * replace tf.logging with absl loggin in movielens * Fixing an import bug * Change gfile to v2 APIs in code * Swap to keras optimizer v2 * Bug fix for optimizer * Change tf.log to tf.keras.backend.log * Change the loss function to keras loss * convert another loss to keras loss * Resolve comments and fix lint * Add a doc string * Fix existing tests and add new tests for DS * Added tests for multi-replica * Fix lint * resolve comments * make estimator run in tf2.0 * use compat v1 loss * fix lint issue
-
- 18 Apr, 2019 1 commit
-
-
Shining Sun authored
-
- 08 Apr, 2019 1 commit
-
-
Shining Sun authored
* add ds support for ncf * remove comments for in_top_k * avoid expanding the input layers * resolve comments and fix lint * Added some comments in code and fix lint * fix lint * add some documentation * add tensorflow imports
-
- 02 Apr, 2019 1 commit
-
-
Shining Sun authored
-
- 28 Mar, 2019 1 commit
-
-
Shining Sun authored
* initial commit * bug fix * Move build_stats from common to keras main, because it is only applicable in keras * remove tailing blank line * add test for synth data * add kwargs to init * add kwargs to function invokation * correctly pass kwargs * debug * debug * debug * fix super init * bug fix * fix local_flags * fix import * bug fix * fix log_steps flag * bug fix * bug fix: add missing return value * resolve double-defined flags * lint fix * move log_steps flag to benchmarK flag * fix lint * lint fix * lint fix * try flag core default values * bug fix * bug fix * bug fix * debug * debug * remove debug prints * rename benchmark methods * flag bug fix for synth benchmark
-
- 27 Mar, 2019 1 commit
-
-
cclauss authored
* from NCF_input import NCFDataset for line 181 The type __NCFDataset__ is used in the type declaration on line 81 but it is never imported. [flake8](http://flake8.pycqa.org) testing of https://github.com/tensorflow/models on Python 3.7.1 $ __flake8 . --count --select=E9,F63,F72,F82 --show-source --statistics__ ``` ./official/recommendation/data_preprocessing.py:180:3: F821 undefined name 'NCFDataset' # type: (str, str, dict, typing.Optional[str], bool, typing.Optional[str]) -> (NCFDataset, typing.Callable) ^ 1 F821 undefined name 'NCFDataset' 1 ``` __E901,E999,F821,F822,F823__ are the "_showstopper_" [flake8](http://flake8.pycqa.org) issues that can halt the runtime with a SyntaxError, NameError, etc. These 5 are different from most other flake8 issues which are merely "style violations" -- useful for readability but they do not effect runtime safety. * F821: undefined name `name` * F822: undefined name `name` in `__all__` * F823: local variable name referenced before assignment * E901: SyntaxError or IndentationError * E999: SyntaxError -- failed to compile a file into an Abstract Syntax Tree * int, int, data_pipeline.BaseDataConstructor
-
- 26 Mar, 2019 1 commit
-
-
cclauss authored
https://mypy.readthedocs.io/en/latest/cheat_sheet.html [flake8](http://flake8.pycqa.org) testing of https://github.com/tensorflow/models on Python 3.7.1 $ __flake8 . --count --select=E9,F63,F72,F82 --show-source --statistics__ ``` ./official/recommendation/data_pipeline.py:346:41: F821 undefined name 'string' epoch_dir=None # type: string ^ ```
-
- 18 Mar, 2019 1 commit
-
-
Bruce Fontaine authored
* Add support for TPUEstimator to data processing pipeline and add the ability to store epochs in user specified location.
-
- 13 Mar, 2019 1 commit
-
-
Shining Sun authored
* Fix ncf test for keras * add a todo for batch_size and eval_batch_size for ncf keras * lint fix * fix typos * Lint fix * fix lint * resolve pr comment * resolve pr comment
-
- 02 Mar, 2019 1 commit
-
-
Taylor Robie authored
* fix resnet breakage and add keras end-to-end tests * delint * address PR comments
-
- 01 Mar, 2019 1 commit
-
-
Shining Sun authored
* tmp commit * tmp commit * first attempt (without eval) * Bug fixes * bug fixes * training done * Loss NAN, no eval * Loss weight problem solved * resolve the NAN loss problem * Problem solved. Clean up needed * Added a todo * Remove debug prints * Extract get_optimizer to ncf_common * Move metrics computation back to neumf; use DS.scope api * Extract DS.scope code to utils * lint fixes * Move obtaining DS above producer.start to avoid race condition * move pt 1 * move pt 2 * Update the run script * Wrap keras_model related code into functions * Update the doc for softmax_logitfy and change the method name * Resolve PR comments * working version with: eager, DS, batch and no masks * Remove git conflict indicator * move reshape to neumf_model * working version, not converge * converged * fix a test * more lint fix * more lint fix * more lint fixes * more lint fix * Removed unused imports * fix test * dummy commit for kicking of checks * fix lint issue * dummy input to kick off checks * dummy input to kick off checks * add collective to dist strat * addressed review comments * add a doc string
-
- 30 Jan, 2019 1 commit
-
-
Tayo Oguntebi authored
Explicitly allow for script execution from any directory. Make env vars visible in python script. (#6105)
-
- 08 Jan, 2019 5 commits
-
-
Taylor Robie authored
-
Taylor Robie authored
-
Taylor Robie authored
-
Taylor Robie authored
-
Taylor Robie authored
-
- 07 Jan, 2019 11 commits
-
-
Taylor Robie authored
-
Taylor Robie authored
-
Taylor Robie authored
-
Taylor Robie authored
-
Taylor Robie authored
This reverts commit 63f5827d.
-
Taylor Robie authored
-
Taylor Robie authored
-
Taylor Robie authored
-
Taylor Robie authored
-
Taylor Robie authored
-
Taylor Robie authored
-