1. 18 Jul, 2019 3 commits
    • Hongkun Yu's avatar
      Merged commit includes the following changes: (#7252) · 1fb34e76
      Hongkun Yu authored
      258597234  by rxsang<rxsang@google.com>:
      
          Update all the TPUStrategy examples to use the new v2 APIs, i.e.
          make_dataset_iterator -> experimental_distribute_dataset,
          make_input_fn_iterator -> experimental_distribute_datasets_from_function,
          unwrap -> experimental_local_results,
          experimental_run -> experimental_run_v2
      
      --
      258581998  by taylorrobie<taylorrobie@google.com>:
      
          Update keras v2 optimizers to reuse coefficients which are shared across all updates, which reduces the total number of ops created by between 5% (for simple optimizers such as SGD and Adagrad) and 25% (for complicated optimizers such as Adam and NAdam). Separate copies are made for each device and dtype.
      
          The effect of this change on run time is fairly minimal since Grappler is expected to consolidate most of these ops; however it does improve graph construction time.
      
      --
      
      PiperOrigin-RevId: 258597234
      1fb34e76
    • Toby Boyd's avatar
      Refactor and add benchmarks as well as accuracy tests for GPU and CPU (#7248) · e0a2b8c3
      Toby Boyd authored
      * Added benchmarks and common flags.
      
      * Add cpu tests.
      
      * Add tracking epoch times.
      
      * fix transformer.
      
      * Add examples_per_second.
      
      * fix pylint
      e0a2b8c3
    • Haoyu Zhang's avatar
      Improve Keras graph performance for ResNet56 (#7241) · dd5a91d3
      Haoyu Zhang authored
      * Config threadpool, cuDNN persistent BN, and grappler layout optimizer properly for ResNet56
      
      * Add tweaked tests for Resnet56
      
      * Avoid triggering the last partial batch overhead by explicitly dropping remainder
      dd5a91d3
  2. 16 Jul, 2019 2 commits
  3. 15 Jul, 2019 2 commits
  4. 11 Jul, 2019 5 commits
  5. 09 Jul, 2019 1 commit
  6. 08 Jul, 2019 2 commits
  7. 03 Jul, 2019 1 commit
    • Toby Boyd's avatar
      Unit tests pass TF 2.0 GPU and CPU locally. (#7101) · 49097655
      Toby Boyd authored
      * Fix unit tests failures.
      
      * 96% of TF 2.0 tests on GPU are passing.
      
      * Currently all passing GPU and CPU TF 2.0
      
      * Address code comments.
      
      * use tf 2.0 cast.
      
      * Comment about working on TF 2.0 CPU
      
      * Uses contrib turn off for TF 2.0.
      
      * Fix wide_deep and add keras_common_tests.
      
      * use context to get num_gpus.
      
      * Switch to tf.keras.metrics
      49097655
  8. 02 Jul, 2019 3 commits
  9. 28 Jun, 2019 4 commits
    • Toby Boyd's avatar
      Add FP16 end-to-end tests (#7122) · 58a3de6c
      Toby Boyd authored
      58a3de6c
    • nnigania's avatar
      NCF CTL Perf optimization to convert gradients from sparse to dense (#7102) · 44ff121d
      nnigania authored
      * borrowing a tf1.x optimization which converts gradients from sparse to dense for better perf
      
      * cleanup after code review
      44ff121d
    • saberkun's avatar
      Merged commit includes the following changes: (#7119) · 5afa9569
      saberkun authored
      * Merged commit includes the following changes:
      255493073  by hongkuny<hongkuny@google.com>:
      
          BERT initial OSS readme update.
      
      --
      255470372  by dmchen<dmchen@google.com>:
      
          Slightly expand expected range for F1 score in BERT SQuAD accuracy test
      
      --
      255109240  by hongkuny<hongkuny@google.com>:
      
          Update eval/predict batch sizes.
      
      --
      255010016  by hongkuny<hongkuny@google.com>:
      
          Internal
      
      --
      254874613  by hongkuny<hongkuny@google.com>:
      
          Update glue tasks enum to match directory name
      
      --
      254866171  by taylorrobie<taylorrobie@google.com>:
      
          Internal change
      
      254785517  by zongweiz<zongweiz@google.com>:
      
          Use train_single_step for BERT GPU models to temporarily work around some performance bugs in GPU runs
      
      --
      254497647  by hongkuny<hongkuny@google.com>:
      
          Fix device placement for TPU export model.
      
      --
      
      PiperOrigin-RevId: 255493073
      
      * Update README.md
      5afa9569
    • David M. Chen's avatar
      Merged commit includes the following changes: (#7116) · 76995053
      David M. Chen authored
      255493073  by hongkuny<hongkuny@google.com>:
      
          BERT initial OSS readme update.
      
      --
      255470372  by dmchen<dmchen@google.com>:
      
          Slightly expand expected range for F1 score in BERT SQuAD accuracy test
      
      --
      255109240  by hongkuny<hongkuny@google.com>:
      
          Update eval/predict batch sizes.
      
      --
      255010016  by hongkuny<hongkuny@google.com>:
      
          Internal
      
      --
      
      PiperOrigin-RevId: 255493073
      76995053
  10. 25 Jun, 2019 1 commit
  11. 24 Jun, 2019 2 commits
  12. 22 Jun, 2019 1 commit
  13. 21 Jun, 2019 5 commits
  14. 20 Jun, 2019 8 commits