1. 21 Jul, 2019 1 commit
  2. 20 Jul, 2019 1 commit
  3. 19 Jul, 2019 2 commits
  4. 18 Jul, 2019 1 commit
  5. 11 Jul, 2019 1 commit
  6. 03 Jul, 2019 1 commit
    • Toby Boyd's avatar
      Unit tests pass TF 2.0 GPU and CPU locally. (#7101) · 49097655
      Toby Boyd authored
      * Fix unit tests failures.
      
      * 96% of TF 2.0 tests on GPU are passing.
      
      * Currently all passing GPU and CPU TF 2.0
      
      * Address code comments.
      
      * use tf 2.0 cast.
      
      * Comment about working on TF 2.0 CPU
      
      * Uses contrib turn off for TF 2.0.
      
      * Fix wide_deep and add keras_common_tests.
      
      * use context to get num_gpus.
      
      * Switch to tf.keras.metrics
      49097655
  7. 02 Jul, 2019 2 commits
  8. 22 Jun, 2019 1 commit
  9. 21 Jun, 2019 2 commits
  10. 19 Jun, 2019 1 commit
    • Toby Boyd's avatar
      Add XLA to transformer (#7048) · 269581dc
      Toby Boyd authored
      
      
      * set default steps to 300K.
      
      * Log flags to perfzero.
      
      * Add XLA support to transformer
      
      - Moved config logic to keras_utils
      - Added enable_xla flag to _performance flags
      - Did not refactor enable_xla flag from keras resnet due to
        reliance on calling FLAGs in estimator keras and that is
        a needed refactor for another time.
      
      * fix g3 lint complaint.
      
      * Refactor set config into keras_utils.
      
      * Move flags out of main.
      
      * pipe through enable_xla
      
      * Update official/transformer/v2/misc.py
      Co-Authored-By: default avatarReed <reedwm@google.com>
      269581dc
  11. 06 Jun, 2019 1 commit
  12. 24 May, 2019 2 commits
    • Toby Boyd's avatar
      Transformer v2 benchmark (#6860) · f2ea2f53
      Toby Boyd authored
      * Moved common keras code to utils.
      
      * Initial 1 gpu benchmark
      
      - Aligned flags with resnet example
      - removed code/features that are not super useful
      - eval as part of train if bleu source/ref provided
      - add exp_per_second hook
      
      * Rename benchmark classes, pass batch-size and log_steps.
      
      * fix docstring
      
      * Predict done with checkpoints inline
      
      - perfzero baseclass
      
      * steps not epochs with smoother training loop.
      
      * do not initialize history outside loop.
      
      * 5000 between eval not 500
      
      * estimator to keras.
      
      * remove epochs var.
      
      * use range not xrange.
      
      * 200K steps for 1 gpu
      
      * fix global step
      f2ea2f53
    • Toby Boyd's avatar
      Moved common keras code to utils. (#6859) · 3254cabb
      Toby Boyd authored
      3254cabb
  13. 18 May, 2019 1 commit
  14. 15 May, 2019 1 commit
  15. 11 May, 2019 2 commits
  16. 01 May, 2019 1 commit
    • Reed's avatar
      Add --fp16_implementation option. (#6703) · b691578c
      Reed authored
      This options allows the new tf.train.experimental.enable_mixed_precision_graph_rewrite() function to be used for fp16, instead of manual casts.
      b691578c
  17. 29 Apr, 2019 1 commit
  18. 26 Apr, 2019 2 commits
  19. 25 Apr, 2019 1 commit
  20. 24 Apr, 2019 1 commit
  21. 11 Apr, 2019 1 commit
  22. 08 Apr, 2019 1 commit
    • Shining Sun's avatar
      Add DS support for NCF keras (#6447) · 1255d5b9
      Shining Sun authored
      * add ds support for ncf
      
      * remove comments for in_top_k
      
      * avoid expanding the input layers
      
      * resolve comments and fix lint
      
      * Added some comments in code and fix lint
      
      * fix lint
      
      * add some documentation
      
      * add tensorflow imports
      1255d5b9
  23. 03 Apr, 2019 1 commit
  24. 01 Apr, 2019 1 commit
  25. 29 Mar, 2019 1 commit
  26. 28 Mar, 2019 1 commit
    • Shining Sun's avatar
      Added benchmark test and convergence test for the NCF model (#6318) · 4c11b84b
      Shining Sun authored
      * initial commit
      
      * bug fix
      
      * Move build_stats from common to keras main, because it is only applicable in keras
      
      * remove tailing blank line
      
      * add test for synth data
      
      * add kwargs to init
      
      * add kwargs to function invokation
      
      * correctly pass kwargs
      
      * debug
      
      * debug
      
      * debug
      
      * fix super init
      
      * bug fix
      
      * fix local_flags
      
      * fix import
      
      * bug fix
      
      * fix log_steps flag
      
      * bug fix
      
      * bug fix: add missing return value
      
      * resolve double-defined flags
      
      * lint fix
      
      * move log_steps flag to benchmarK flag
      
      * fix lint
      
      * lint fix
      
      * lint fix
      
      * try flag core default values
      
      * bug fix
      
      * bug fix
      
      * bug fix
      
      * debug
      
      * debug
      
      * remove debug prints
      
      * rename benchmark methods
      
      * flag bug fix for synth benchmark
      4c11b84b
  27. 20 Mar, 2019 1 commit
  28. 19 Mar, 2019 1 commit
  29. 07 Mar, 2019 1 commit
  30. 02 Mar, 2019 1 commit
  31. 01 Mar, 2019 1 commit
    • Shining Sun's avatar
      Keras-fy NCF Model (#6092) · 048e5bff
      Shining Sun authored
      * tmp commit
      
      * tmp commit
      
      * first attempt (without eval)
      
      * Bug fixes
      
      * bug fixes
      
      * training done
      
      * Loss NAN, no eval
      
      * Loss weight problem solved
      
      * resolve the NAN loss problem
      
      * Problem solved. Clean up needed
      
      * Added a todo
      
      * Remove debug prints
      
      * Extract get_optimizer to ncf_common
      
      * Move metrics computation back to neumf; use DS.scope api
      
      * Extract DS.scope code to utils
      
      * lint fixes
      
      * Move obtaining DS above producer.start to avoid race condition
      
      * move pt 1
      
      * move pt 2
      
      * Update the run script
      
      * Wrap keras_model related code into functions
      
      * Update the doc for softmax_logitfy and change the method name
      
      * Resolve PR comments
      
      * working version with: eager, DS, batch and no masks
      
      * Remove git conflict indicator
      
      * move reshape to neumf_model
      
      * working version, not converge
      
      * converged
      
      * fix a test
      
      * more lint fix
      
      * more lint fix
      
      * more lint fixes
      
      * more lint fix
      
      * Removed unused imports
      
      * fix test
      
      * dummy commit for kicking of checks
      
      * fix lint issue
      
      * dummy input to kick off checks
      
      * dummy input to kick off checks
      
      * add collective to dist strat
      
      * addressed review comments
      
      * add a doc string
      048e5bff
  32. 28 Feb, 2019 2 commits
  33. 21 Feb, 2019 1 commit
    • Ayush Dubey's avatar
      Multi-worker support for Resnet. (#6206) · f2e90945
      Ayush Dubey authored
      * Update official resnet for multi worker training with distribution strategies.
      
      * Fixes for multi worker training.
      
      * Fix call to `get_distribution_strategy`.
      
      * Undo test change.
      
      * Fix spacing.
      
      * Move cluster configuration to distribution_utils.
      
      * Move train_and_evaluate out of loop.  Also, update docstrings for multi-worker flags and add use_train_and_evaluate flag.
      
      * Update distribution_strategy flag to match exported name for collective strategy.
      f2e90945