- 02 Aug, 2019 1 commit
-
-
SparkSnail authored
-
- 25 Jun, 2019 3 commits
-
-
Zejun Lin authored
* fix a bug * fix a bug
-
Zejun Lin authored
fix bug for PR #1201
-
Zejun Lin authored
* update * dev-tf-master * fix bugs * fix bugs * remove unnecessary lines * dev oneshot and darts * dev nas * dev enas and oneshot * dev enas and oneshot * dev enas and oneshot * dev oneshot and enas * dev oneshot * add ut * add docstring * add docstring * fix * resolve comments by changing docstring * resolve comments
-
- 21 Jun, 2019 1 commit
-
-
SparkSnail authored
-
- 20 Jun, 2019 1 commit
-
-
demianzhang authored
* fix local and remote training services tslint
-
- 19 Jun, 2019 1 commit
-
-
Hongarc authored
-
- 18 Jun, 2019 1 commit
-
-
SparkSnail authored
-
- 17 Jun, 2019 1 commit
-
-
demianzhang authored
* remove check powershell policy * remove policy dependency * update
-
- 30 May, 2019 1 commit
-
-
Mohit Anand authored
-
- 28 May, 2019 2 commits
-
-
SparkSnail authored
-
demianzhang authored
-
- 22 May, 2019 1 commit
-
-
SparkSnail authored
-
- 24 Apr, 2019 1 commit
-
-
demianzhang authored
-
- 22 Apr, 2019 1 commit
-
-
demianzhang authored
-
- 19 Apr, 2019 1 commit
-
-
SparkSnail authored
Fix issue #890
-
- 18 Apr, 2019 1 commit
-
-
chicm-ms authored
* Refactoring local training service * Designated GPU for local training service * RemoteMachine designated GPU configuration
-
- 29 Mar, 2019 1 commit
-
-
SparkSnail authored
-
- 22 Mar, 2019 1 commit
-
-
SparkSnail authored
If user set remoteloggingType in config file, log content will not be transmitted from trialkeeper
-
- 21 Mar, 2019 1 commit
-
-
SparkSnail authored
In nnictl, we support debug mode from config file and --debug. If users does not set debug: true in config, nnictl will use --debug value.
-
- 18 Mar, 2019 1 commit
-
-
SparkSnail authored
-
- 15 Mar, 2019 1 commit
-
-
SparkSnail authored
check nni version in trialkeeper, to make sure the version of trialkeeper is consistent with trainingService add a debug mode in config file
-
- 25 Feb, 2019 1 commit
-
-
SparkSnail authored
* add trialkeeper_stdout and trialkeeper_stderr * fix nnictl set remote nniManagerIP
-
- 29 Jan, 2019 1 commit
-
-
SparkSnail authored
* fix remote bug * add document * add document * update * update * update * update * fix remote issue * fix forEach * update doc according to comments * update * update * update * remove 'any more' * add base version for remote-log * change launcher.py * test * basic version * debug * debug * basic work version * fix code * update disable_log * remove unused line * add diable log in kubernetesTrainingService * add detect frameworkcontroller * fix comment * update * update * fix kubernetesData * debug * debug * debug * fix comment * fix conflict * remove local temp files * revert launcher.py * update code by comments * remove disableLog * remove disable Log * set timeout for cleanup * fix code by comments * update variable names * add comments * add delay function * update * update * update by comments * add in remote script path * rename variables * update variable name * add mkdir -p for subfolder
-
- 24 Jan, 2019 1 commit
-
-
chicm-ms authored
* Pull code (#22) * Support distributed job for frameworkcontroller (#612) support distributed job for frameworkcontroller * Multiphase doc (#519) * multiPhase doc * updates * updates * Add time parser for 'nnictl update duration' (#632) Current nnictl update duration only support seconds unit, add a parser for this command to support {s, m, h, d} * fix experiment state bug (#629) * update top README.md (#622) * Update README.md * update (#634) * Integration tests refactoring (#625) * Integration test refactoring (#21) (#616) * Integration test refactoring (#21) * Refactoring integration tests * test metrics * update azure pipeline * updates * updates * updates * updates * updates * updates * updates * updates * updates * updates * updates * updates * updates * updates * updates * updates * updates * updates * updates * updates * update trigger * Integration test refactoring (#618) * updates * updates * update pipeline (#619) * update pipeline * updates * updates * updates * updates * updates * test pipeline (#623) * test pipeline * updates * updates * updates * Update integration test (#624) * Update integration test * updates * updates * updates * updates * updates * updates * Revert "Pull code (#22)" This reverts commit 62fc165ad7b2ba724eead3b99f010aa34491e2c7. * Configurable nniManager log path * Configure log level * add --debug command line for nnictl * updates
-
- 23 Jan, 2019 1 commit
-
-
Zejun Lin authored
* fix nni-sdk version problem * fix * fix installation * fix installation * fix installation * resolve suggestion * resolve suggestion * resolve suggestion * resolve suggestion * refactor logic * refactor logic * resolve comments
-
- 11 Jan, 2019 1 commit
-
-
Linjie Xu authored
* add module info for launcher check * update launcher.py Add packages check before start restful server. * check sub-key * modify mistype * delete non-tuner in constants/ModuleName * Delete ModuleName to prevent double maintain * only catch ModuleNotFoundError
-
- 06 Jan, 2019 2 commits
-
-
Zejun Lin authored
-
Zejun Lin authored
* support venv * adapt venv * adapt venv * adapt venv * adapt venv * new test * new test * new test * support venv * support venv * support venv * support venv * support venv * support venv * support venv * colorful output for mac * colorful output for mac * permission denied in /tmp * permission denied in /tmp * permission denied in /tmp * remove unused variable * final * remove build python
-
- 25 Dec, 2018 1 commit
-
-
SparkSnail authored
Add frameworkcontroller training service based on kubeflow training service. Refactor code structure, add kubernetes training service as father class, and set kubeflow training service and frameworkcontroller training service as child class.
-
- 13 Dec, 2018 1 commit
-
-
SparkSnail authored
Add nnictl top command to monitor the nni experiments.
-
- 11 Dec, 2018 1 commit
-
-
chicm-ms authored
-
- 30 Nov, 2018 1 commit
-
-
QuanluZhang authored
* support hyperband * add example for hyperband * register Hyperband in tuner * after debug * update doc * trivial change * update spec validation of yaml config * modify nnictl launcher * modify nnimanager and util to support advisor * Quick fix nnictl config logic (#289) * fix nnictl bug * fix install.sh * add desc for Dockerfile.build.base * update document for Dockerfile * update * refactor port detect * update * refactor NNICTLDOC.md * add document for pai and nnictl * add default value for port * add exception handling in trial_keeper.py * fix port bug * fix resume * fix nnictl resume and fix nnictl stop * fix document * update * refactor nnictl * update * update doc * update * update nnictl * fix comment * revert dockerfile * update * update * update * fix nnictl error hit * fix comments * fix bash-completion * fix paramiko install * quick fix resume logic * update * quick fix nnictl * refactor sdk main * update unit test accordingly * update example's config file * update restserver validation * PR merge to 0.3 (#297) * refactor doc * update with Mao's suggestions * Set theme jekyll-theme-dinky * update doc * fix links * fix links * fix links * merge * fix links and doc errors * merge * merge * merge * merge * Update README.md (#288) added License badge * merge * updated the "Contribute" part (merged Gems' wiki in, updated ReadMe) * fix link * fix doc mistakes and broken links. (#271) * refactor doc * update with Mao's suggestions * Set theme jekyll-theme-dinky * updated the "Contribute" part (merged Gems' wiki in, updated ReadMe) * fix link * Update README.md * Fix misspelling in examples/trials/ga_squad/README.md * revise the installation cmd to v0.2 * revise to install v0.2 * remove files * update * remove enas readme (#292) * support checkpoint directory * Fix datastore performance issue (#301) * fix pylint * Fix nnictl in v0.3 (#299) Fix old version of config file fix sklearn requirements Fix resume log logic * modify log * trivial changes * update example * update makefile * update launcher.py to fix the problem of finding main.js * debug * add hyperparameter info into trial_end api * fix bug and update example * fix error induced by merge * support initialize * add doc for hyperband * fix bugs and add config_pai * fix bugs and add config_pai * fix bugs and add config_pai * fix bugs and add config_pai * update doc * add doc for advisor * fit * modification based on hui's comments * update doc
-
- 23 Nov, 2018 1 commit
-
-
SparkSnail authored
Add nniManager Ip in nnictl, pai TrainingService and kubeflow TrainingService. If users set nniManagerIp, pai and kubeflow will use this ip instead of using getIPV4() function. Web UI will also use this nniManagerIp.
-
- 22 Nov, 2018 2 commits
-
-
fishyds authored
[Kubeflow training service] Update kubeflow exp job config schema to support distributed training (#387) * Support distributed training on tf-operator, for worker and ps * Update validation rule for kubeflow config * small code refactor adjustment for private methods * Use different output folder for ps and worker
-
chicm-ms authored
* Asynchronous dispatcher * updates * updates * updates * updates
-
- 20 Nov, 2018 1 commit
-
-
fishyds authored
* Kubeflow TrainingService support, v1 (#373) 1. Create new Training Service: kubeflow trainning service, use 'kubectl' and kubeflow tfjobs CRD to submit and manage jobs 2. Update nni python SDK to support new kubeflow platform 3. Update nni python SDK's get_sequende_id() implementation, read NNI_TRIAL_SEQ_ID env variable, instead of reading .nni/sequence_id file 4. This version only supports Tensorflow operator. Will add more operators' support in future versions
-
- 12 Nov, 2018 1 commit
-
-
QuanluZhang authored
* update makefile * update launcher.py to fix the problem of finding main.js * remove duplicated lib
-
- 09 Nov, 2018 2 commits
-
-
SparkSnail authored
Fix nnictl stop logic Fix localTrainingService cancelJob logic Show port information in "nnictl experiment list" cmd. Show more information when config file validate failed. Add nnictl detect adjacent port logic if the platform is pai
-
Gems Guo authored
-