1. 05 Feb, 2019 1 commit
    • Goldie Gadde's avatar
      tf_upgrade_v2 on resnet and utils folders. (#6154) · d6b2b83c
      Goldie Gadde authored
      * Add resnet56 short tests. (#6101)
      
      * Add resnet56 short tests.
      - created base benchmark module
      - renamed accuracy test class to contain the word Accuracy
      which will result in a need to update all the jobs
      and a loss of history but is worth it.
      - short tests are mostly copied from shining with oss refactor
      
      * Address feedback.
      
      * Move flag_methods to init
      - Address setting default flags repeatedly.
      
      * Rename accuracy tests.
      
      * Lint errors resolved.
      
      * fix model_dir set to flags.data_dir.
      
      * fixed not fulling pulling out flag_methods.
      
      * Use core mirrored strategy in official models (#6126)
      
      * Imagenet short tests (#6132)
      
      * Add short imagenet tests (taken from seemuch)
      - also rename to match go forward naming
      
      * fix method name
      
      * Update doc strings.
      
      * Fixe gpu number.
      
      * points default data_dir to child folder. (#6131)
      
      Failed test is python2  and was a kokoro failure
      
      * Imagenet short tests (#6136)
      
      * Add short imagenet tests (taken from seemuch)
      - also rename to match go forward naming
      
      * fix method name
      
      * Update doc strings.
      
      * Fixe gpu number.
      
      * Add fill_objects
      
      * fixed calling wrong class in super.
      
      * fix lint issue.
      
      * Flag (#6121)
      
      * Fix the turn_off_ds flag problem
      
      * add param names to all args
      
      * Export benchmark stats using tf.test.Benchmark.report_benchmark() (#6103)
      
      * Export benchmark stats using tf.test.Benchmark.report_benchmark()
      
      * Fix python style using pyformat
      
      * Typos. (#6120)
      
      * log verbosity=2 logs every epoch no progress bars (#6142)
      
      * tf_upgrade_v2 on resnet and utils folder.
      
      * tf_upgrade_v2 on resnet and utils folder.
      d6b2b83c
  2. 20 Jun, 2018 1 commit
    • Taylor Robie's avatar
      Wide Deep refactor and deep movies (#4506) · 20070ca4
      Taylor Robie authored
      * begin branch
      
      * finish download script
      
      * rename download to dataset
      
      * intermediate commit
      
      * intermediate commit
      
      * misc tweaks
      
      * intermediate commit
      
      * intermediate commit
      
      * intermediate commit
      
      * delint and update census test.
      
      * add movie tests
      
      * delint
      
      * fix py2 issue
      
      * address PR comments
      
      * intermediate commit
      
      * intermediate commit
      
      * intermediate commit
      
      * finish wide deep transition to vanilla movielens
      
      * delint
      
      * intermediate commit
      
      * intermediate commit
      
      * intermediate commit
      
      * intermediate commit
      
      * fix import
      
      * add default ncf csv construction
      
      * change default on download_if_missing
      
      * shard and vectorize example serialization
      
      * fix import
      
      * update ncf data unittests
      
      * delint
      
      * delint
      
      * more delinting
      
      * fix wide-deep movielens serialization
      
      * address PR comments
      
      * add file_io tests
      
      * investigate wide-deep test failure
      
      * remove hard coded path and properly use flags.
      
      * address file_io test PR comments
      
      * missed a hash_bucked_size
      20070ca4
  3. 12 Jun, 2018 1 commit
  4. 04 Jun, 2018 1 commit
    • Taylor Robie's avatar
      First pass at a TPU loop for Transformer (#4296) · 2eeb85fe
      Taylor Robie authored
      * port changes from previous branch now that transformer util changes are in master
      
      fix incorrect count
      
      correct (hopefully) treatment of batch_size
      
      set eval_metrics to a dummy function for now
      
      add some comments
      
      start bringing metrics to transformer TPU
      
      resolve logits shape
      
      metrics are now working except for tf.py_func metrics
      
      increase batch_size for tpu, and create summary host call
      
      fix host call
      
      reduce tpu default batch size
      
      further tune batch sizes
      
      add minibatch loss to summary
      
      handle case of single_iteration_train_steps > number points in an epoch
      
      begin to incorporate hooks
      
      add sleep workarounds
      
      disable hooks altogether
      
      generalize host call function and move to newly created tpu utils module
      
      remove all traces of params as an object
      
      switch from  to
      
      address some PR comments, and change the number of data points.
      
      minor tweaks
      
      add tpu dry run for testing, and use matmul for TPU embedding
      
      infeed/outfeed queue issue is fixed. Sleeps are no longer necessary
      
      add some documentation.
      
      cleanup and address PR comments
      
      delint
      
      add accelerator __init__
      
      fix embedding
      
      missed PR comment
      
      address PR comments
      
      fix validator bug
      
      rewrite cloud storage validator, and add oauth dependency to requirements.txt
      
      * delint
      2eeb85fe
  5. 25 May, 2018 1 commit
    • Karmel Allison's avatar
      Fix/log ex per sec (#4360) · d626b908
      Karmel Allison authored
      * Using BenchmarkLogger
      
      * Using BenchmarkLogger
      
      * Fixing tests
      
      * Linting fixes.
      
      * Adding comments
      
      * Moving mock logger
      
      * Moving mock logger
      
      * Glinting
      
      * Responding to CR
      
      * Reverting assertEmpty
      d626b908
  6. 11 May, 2018 1 commit
    • Qianli Scott Zhu's avatar
      Add benchmark logger that does stream upload to bigquery. (#4210) · 0270cac7
      Qianli Scott Zhu authored
      * Move the benchmark_uploader to new location.
      
      * Update benchmark logger to streaming upload.
      
      * Fix lint and unit test error.
      
      * delint.
      
      * Update the benchmark uploader test.
      
      Skip the import of benchmark_uploader when bigquery is not installed.
      
      * Merge the 2 classes of benchmark uploader into 1.
      
      * Address review comments.
      
      * delint.
      
      * Execute bigquery upload in a separate thread.
      
      * Change to use python six.moves for importing.
      
      * Address review comments and delint.
      
      * Address review comment.
      
      Adding comment for potential performance impact for model on CPU.
      
      * Fix random failure on py3.
      
      * Fix the order of flag saver to avoid the randomness.
      
      The test is broken when the benchmark_logger_type is set first, and
      validated when the benchmark_log_dir is not set yet.
      0270cac7
  7. 19 Apr, 2018 1 commit
    • Qianli Scott Zhu's avatar
      Benchmark update (#4034) · 21ec0e1b
      Qianli Scott Zhu authored
      * Update the benchmark logger to have default logging.
      
      1. Create global instance of benchmark logger, which default log to
      tf.logging.info
      2. Allow user to config the logging location.
      3. Fix nits in code and comment.
      
      * Fix lint and test error.
      
      * Address review comments.
      
      * Remove the duplicated print statement.
      21ec0e1b
  8. 03 Apr, 2018 1 commit
    • Karmel Allison's avatar
      Rename logging directory (#3860) · a0e3604f
      Karmel Allison authored
      * Updating name of logging package to avoid overwriting Python builtin logging.
      
      * Updating name of logging package to avoid overwriting Python builtin logging.
      a0e3604f
  9. 23 Mar, 2018 1 commit
    • Qianli Scott Zhu's avatar
      Resnet benchmark logging (#3704) · b9b44f7b
      Qianli Scott Zhu authored
      * Update reset model for benchmark logging.
      
      To enable benchmark logging, just add "--hooks LoggingMetricHook"
      
      * Benchmark logger fix for resnet.
      
      1. Update default at_end to False for metric logger to avoid
      checkpoint error.
      2. Update resnet run to log final evaluation result.
      
      * Update log output for final eval_result.
      
      * Typo fix.
      
      * Unset the default value for benchmark_log_dir.
      
      Usually the benchmark should be logged to different directly for
      each run. Having a default value will hide the choice from user.
      
      * Bug fix for benchmark logger initialization.
      
      * Fix lint error.
      
      * Address the review comment.
      
      1. Update the logger to cover evaluation result.
      2. Move the flag to performance parser.
      
      * Undo the change for arg_parser.
      b9b44f7b
  10. 20 Mar, 2018 2 commits
  11. 15 Mar, 2018 1 commit
    • Yanhui Liang's avatar
      Update resnet with logging utils (#3586) · 8f63feaa
      Yanhui Liang authored
      * Update resnet with logging utils
      
      * intermediate commit
      
      * commit before rebase from master
      
      * Add tests of ExamplePerSecondHook
      
      * Done with test
      
      * Fix a style nit
      
      * Fix a style nit
      8f63feaa
  12. 13 Mar, 2018 1 commit
    • Yanhui Liang's avatar
      Add logging utils (#3519) · 81a34cbe
      Yanhui Liang authored
      * Adding logging utils
      
      * restore utils
      
      * delete old file
      
      * update inputs and docstrings
      
      * Update import and fix typos
      
      * Fix formatting and comments
      
      * Update tests
      81a34cbe