1. 24 Apr, 2022 1 commit
  2. 20 Jan, 2022 1 commit
  3. 27 Aug, 2021 1 commit
  4. 06 Aug, 2021 1 commit
  5. 29 Jul, 2021 1 commit
  6. 12 Jul, 2021 1 commit
  7. 21 Jun, 2021 1 commit
  8. 03 Jun, 2021 1 commit
  9. 09 Apr, 2021 2 commits
  10. 08 Mar, 2021 1 commit
  11. 24 Feb, 2021 1 commit
  12. 30 Nov, 2020 1 commit
    • J-shang's avatar
      experiment management backend (#3081) · 95f731e4
      J-shang authored
      
      
      * step 1 nnictl generate experimentId & merge folder
      
      * step 2.1 modify .experiment structure
      
      * step 2.2 add lock for .experiment rw in nnictl
      
      * step 2.2 add filelock dependence
      
      * step 2.2 remove uniqueString from main.js
      
      * fix test bug
      
      * fix test bug
      
      * setp 3.1 add experiment manager
      
      * step 3.2 add getExperimentsInfo
      
      * fix eslint
      
      * add a simple file lock to support stale
      
      * step 3.3 add test
      
      * divide abs experiment manager from manager
      
      * experiment manager refactor
      
      * support .experiment sync update status
      
      * nnictl no longer uses rest api to update status or endtime
      
      * nnictl no longer uses rest api to update status or endtime
      
      * fix eslint
      
      * support .experiment sync update endtime
      
      * fix test
      
      * fix settimeout bug
      
      * fix test
      
      * adjust experiment endTime
      
      * separate simple file lock class
      
      * modify name
      
      * add 'id' in .experiment
      
      * update rest api format
      
      * fix eslint
      
      * fix issue in comments
      
      * fix rest api format
      
      * add indent in json in experiments manager
      
      * fix unittest
      
      * fix unittest
      
      * refector file lock
      
      * fix eslint
      
      * remove '__enter__' in filelock
      
      * filelock support never expire
      Co-authored-by: default avatarNing Shang <nishang@microsoft.com>
      95f731e4
  13. 25 Nov, 2020 1 commit
  14. 19 Oct, 2020 1 commit
  15. 21 Sep, 2020 1 commit
  16. 12 Aug, 2020 1 commit
  17. 07 May, 2020 1 commit
  18. 11 Dec, 2019 1 commit
  19. 10 Dec, 2019 1 commit
  20. 25 Nov, 2019 1 commit
  21. 21 Nov, 2019 1 commit
  22. 26 Sep, 2019 2 commits
  23. 19 Jun, 2019 1 commit
  24. 30 May, 2019 1 commit
  25. 27 May, 2019 1 commit
  26. 19 Apr, 2019 1 commit
  27. 15 Mar, 2019 1 commit
    • SparkSnail's avatar
      Support version check of nni (#807) · d0b22fc7
      SparkSnail authored
      check nni version in trialkeeper, to make sure the version of trialkeeper is consistent with trainingService
      add a debug mode in config file
      d0b22fc7
  28. 30 Jan, 2019 1 commit
  29. 25 Jan, 2019 1 commit
    • chicm-ms's avatar
      Refactoring nnimanager log (#652) · 6d591989
      chicm-ms authored
      * Pull code (#22)
      
      * Support distributed job for frameworkcontroller (#612)
      
      support distributed job for frameworkcontroller
      
      * Multiphase doc (#519)
      
      * multiPhase doc
      
      * updates
      
      * updates
      
      * Add time parser for 'nnictl update duration' (#632)
      
      Current nnictl update duration only support seconds unit, add a parser for this command to support {s, m, h, d}
      
      * fix experiment state bug (#629)
      
      * update top README.md (#622)
      
      * Update README.md
      
      * update (#634)
      
      * Integration tests refactoring (#625)
      
      * Integration test refactoring (#21) (#616)
      
      * Integration test refactoring (#21)
      
      * Refactoring integration tests
      
      * test metrics
      
      * update azure pipeline
      
      * updates
      
      * updates
      
      * updates
      
      * updates
      
      * updates
      
      * updates
      
      * updates
      
      * updates
      
      * updates
      
      * updates
      
      * updates
      
      * updates
      
      * updates
      
      * updates
      
      * updates
      
      * updates
      
      * updates
      
      * updates
      
      * updates
      
      * updates
      
      * update trigger
      
      * Integration test refactoring (#618)
      
      * updates
      
      * updates
      
      * update pipeline (#619)
      
      * update pipeline
      
      * updates
      
      * updates
      
      * updates
      
      * updates
      
      * updates
      
      * test pipeline (#623)
      
      * test pipeline
      
      * updates
      
      * updates
      
      * updates
      
      * Update integration test (#624)
      
      * Update integration test
      
      * updates
      
      * updates
      
      * updates
      
      * updates
      
      * updates
      
      * updates
      
      * Revert "Pull code (#22)"
      
      This reverts commit 62fc165ad7b2ba724eead3b99f010aa34491e2c7.
      
      * Update nnimanager logs
      
      * updates
      
      * Update README.md
      
      * Revert "Update README.md"
      
      This reverts commit bc67061160e5d57305a6e7fb63d491d12d0e9002.
      
      * updates
      
      * updates
      6d591989
  30. 24 Jan, 2019 1 commit
    • chicm-ms's avatar
      Configurable nniManager log path and log level (#644) · d9c83c0c
      chicm-ms authored
      * Pull code (#22)
      
      * Support distributed job for frameworkcontroller (#612)
      
      support distributed job for frameworkcontroller
      
      * Multiphase doc (#519)
      
      * multiPhase doc
      
      * updates
      
      * updates
      
      * Add time parser for 'nnictl update duration' (#632)
      
      Current nnictl update duration only support seconds unit, add a parser for this command to support {s, m, h, d}
      
      * fix experiment state bug (#629)
      
      * update top README.md (#622)
      
      * Update README.md
      
      * update (#634)
      
      * Integration tests refactoring (#625)
      
      * Integration test refactoring (#21) (#616)
      
      * Integration test refactoring (#21)
      
      * Refactoring integration tests
      
      * test metrics
      
      * update azure pipeline
      
      * updates
      
      * updates
      
      * updates
      
      * updates
      
      * updates
      
      * updates
      
      * updates
      
      * updates
      
      * updates
      
      * updates
      
      * updates
      
      * updates
      
      * updates
      
      * updates
      
      * updates
      
      * updates
      
      * updates
      
      * updates
      
      * updates
      
      * updates
      
      * update trigger
      
      * Integration test refactoring (#618)
      
      * updates
      
      * updates
      
      * update pipeline (#619)
      
      * update pipeline
      
      * updates
      
      * updates
      
      * updates
      
      * updates
      
      * updates
      
      * test pipeline (#623)
      
      * test pipeline
      
      * updates
      
      * updates
      
      * updates
      
      * Update integration test (#624)
      
      * Update integration test
      
      * updates
      
      * updates
      
      * updates
      
      * updates
      
      * updates
      
      * updates
      
      * Revert "Pull code (#22)"
      
      This reverts commit 62fc165ad7b2ba724eead3b99f010aa34491e2c7.
      
      * Configurable nniManager log path
      
      * Configure log level
      
      * add --debug command line for nnictl
      
      * updates
      d9c83c0c
  31. 19 Dec, 2018 1 commit
  32. 17 Dec, 2018 1 commit
  33. 20 Nov, 2018 1 commit
    • fishyds's avatar
      [Kubeflow Training Service] V1, merge from kubeflow branch to master branch (#382) · 806afeb6
      fishyds authored
      * Kubeflow TrainingService support, v1 (#373)
      
      1. Create new Training Service: kubeflow trainning service, use 'kubectl' and kubeflow tfjobs CRD to submit and manage jobs
      2. Update nni python SDK to support new kubeflow platform
      3. Update nni python SDK's get_sequende_id() implementation, read NNI_TRIAL_SEQ_ID env variable, instead of reading .nni/sequence_id file
      4. This version only supports Tensorflow operator. Will add more operators' support in future versions
      806afeb6
  34. 16 Nov, 2018 1 commit
  35. 05 Nov, 2018 1 commit
  36. 11 Oct, 2018 1 commit
    • chicm-ms's avatar
      Fix issue #124: Rest service api/v1/nni/metric-data can not retrieve metric... · 93dae28f
      chicm-ms authored
      Fix issue #124:  Rest service api/v1/nni/metric-data can not retrieve metric data for multiple trial jobs (#188)
      
      * Pull latest code (#2)
      
      * webui logpath and document (#135)
      
      * Add webui document and logpath as a href
      
      * fix tslint
      
      * fix comments by Chengmin
      
      * Pai training service bug fix and enhancement (#136)
      
      * Add NNI installation scripts
      
      * Update pai script, update NNI_out_dir
      
      * Update NNI dir in nni sdk local.py
      
      * Create .nni folder in nni sdk local.py
      
      * Add check before creating .nni folder
      
      * Fix typo for PAI_INSTALL_NNI_SHELL_FORMAT
      
      * Improve annotation (#138)
      
      * Improve annotation
      
      * Minor bugfix
      
      * Selectively install through pip (#139)
      
      Selectively install through pip 
      * update setup.py
      
      * fix paiTrainingService bugs (#137)
      
      * fix nnictl bug
      
      * add hdfs host validation
      
      * fix bugs
      
      * fix dockerfile
      
      * fix install.sh
      
      * update install.sh
      
      * fix dockerfile
      
      * Set timeout for HDFSUtility exists function
      
      * remove unused TODO
      
      * fix sdk
      
      * add optional for outputDir and dataDir
      
      * refactor dockerfile.base
      
      * Remove unused import in hdfsclientUtility
      
      * Add documentation for NNI PAI mode experiment (#141)
      
      * Add documentation for NNI PAI mode
      
      * Fix typo based on PR comments
      
      * Exit with subprocess return code of trial keeper
      
      * Remove additional exit code
      
      * Fix typo based on PR comments
      
      * update doc for smac tuner (#140)
      
      * Revert "Selectively install through pip (#139)" due to potential pip install issue (#142)
      
      * Revert "Selectively install through pip (#139)"
      
      This reverts commit 1d174836.
      
      * Add exit code of subprocess for trial_keeper
      
      * Update README, add link to PAImode doc
      
      * fix bug (#147)
      
      * Refactor nnictl and add config_pai.yml (#144)
      
      * fix nnictl bug
      
      * add hdfs host validation
      
      * fix bugs
      
      * fix dockerfile
      
      * fix install.sh
      
      * update install.sh
      
      * fix dockerfile
      
      * Set timeout for HDFSUtility exists function
      
      * remove unused TODO
      
      * fix sdk
      
      * add optional for outputDir and dataDir
      
      * refactor dockerfile.base
      
      * Remove unused import in hdfsclientUtility
      
      * add config_pai.yml
      
      * refactor nnictl create logic and add colorful print
      
      * fix nnictl stop logic
      
      * add annotation for config_pai.yml
      
      * add document for start experiment
      
      * fix config.yml
      
      * fix document
      
      * Fix trial keeper wrongly exit issue (#152)
      
      * Fix trial keeper bug, use actual exitcode to exit rather than 1
      
      * Fix bug of table sort (#145)
      
      * Update doc for PAIMode and v0.2 release notes (#153)
      
      * Update v0.2 documentation regards to release note and PAI training service
      
      * Update document to describe NNI docker image
      
      * Bug fix for SQuAD example tuner. (#134)
      
      * Update Makefile (#151)
      
      * test
      
      * update setup.py
      
      * update Makefile and install.sh
      
      * rever setup.py
      
      * change color
      
      * update doc
      
      * update doc
      
      * fix auto-completion's extra space
      
      * update Makefile
      
      * update webui
      
      * Update doc image (#163)
      
      * update doc
      
      * trivial
      
      * trivial
      
      * trivial
      
      * trivial
      
      * trivial
      
      * trivial
      
      * update image
      
      * update image size
      
      * Update ga squad (#104)
      
      * update readme in ga_squad
      
      * update readme
      
      * fix typo
      
      * Update README.md
      
      * Update README.md
      
      * Update README.md
      
      * update readme
      
      * sklearn examples (#169)
      
      * fix nnictl bug
      
      * fix install.sh
      
      * add sklearn-regression example
      
      * add sklearn classification
      
      * update sklearn
      
      * update example
      
      * remove additional code
      
      * Update batch tuner (#158)
      
      * update readme in ga_squad
      
      * update readme
      
      * fix typo
      
      * Update README.md
      
      * Update README.md
      
      * Update README.md
      
      * update readme
      
      * update batch tuner
      
      * Quickly fix cascading search space bug in tuner (#156)
      
      * update readme in ga_squad
      
      * update readme
      
      * fix typo
      
      * Update README.md
      
      * Update README.md
      
      * Update README.md
      
      * update readme
      
      * quickly fix cascading searchspace bug in tuner
      
      * Add iterative search space example (#119)
      
      * update readme in ga_squad
      
      * update readme
      
      * fix typo
      
      * Update README.md
      
      * Update README.md
      
      * Update README.md
      
      * update readme
      
      * add iterative search space example
      
      * update
      
      * update readme
      
      * change name
      
      * getMetricData supports multiple trial jobs
      93dae28f
  37. 14 Sep, 2018 2 commits