"docs/vscode:/vscode.git/clone" did not exist on "d2c610a1ddf464da9d83f1a6a7eec61d0dabba1e"
- 26 Apr, 2020 1 commit
-
-
Chi Song authored
Add shell support for ssh connection, so that remote script can be started with user environment. Minor fixes, 1. Fix gpu_metrics_collector to support pyenv. As pyenv will create one more process, so that original pgrep code always got extra processes, and cannot start gpu_metrics_collector. 2. Fix NASUI failure on dev-install-node-modules, to create subfolder every time. 3. Fix MakeFile to reduce mis-created links, and other minor issues. 4. Add node --watch for nni_manager for better dev experience.
-
- 01 Apr, 2020 1 commit
-
-
liuzhe-lz authored
-
- 20 Mar, 2020 1 commit
-
-
liuzhe-lz authored
-
- 17 Feb, 2020 1 commit
-
-
chicm-ms authored
-
- 09 Feb, 2020 1 commit
-
-
QuanluZhang authored
-
- 07 Feb, 2020 1 commit
-
-
SparkSnail authored
-
- 15 Jan, 2020 1 commit
-
-
chicm-ms authored
-
- 02 Jan, 2020 1 commit
-
-
chicm-ms authored
-
- 18 Dec, 2019 1 commit
-
-
chicm-ms authored
-
- 11 Dec, 2019 1 commit
-
-
chicm-ms authored
* enable eslint * remove tslint
-
- 05 Dec, 2019 1 commit
-
-
chicm-ms authored
* swith to eslint
-
- 20 Aug, 2019 1 commit
-
-
chicm-ms authored
-
- 19 Jul, 2019 1 commit
-
-
chicm-ms authored
* upgrade version of nni dependencies
-
- 10 Jul, 2019 1 commit
-
-
chicm-ms authored
-
- 20 Jun, 2019 1 commit
-
-
demianzhang authored
* fix local and remote training services tslint
-
- 22 Apr, 2019 2 commits
-
-
Zejun Lin authored
* fs's API changed * fix version * roll back utils
-
demianzhang authored
-
- 01 Apr, 2019 1 commit
-
-
SparkSnail authored
-
- 25 Feb, 2019 1 commit
-
-
demianzhang authored
Unit test for nnimanager
-
- 29 Jan, 2019 1 commit
-
-
SparkSnail authored
* fix remote bug * add document * add document * update * update * update * update * fix remote issue * fix forEach * update doc according to comments * update * update * update * remove 'any more' * add base version for remote-log * change launcher.py * test * basic version * debug * debug * basic work version * fix code * update disable_log * remove unused line * add diable log in kubernetesTrainingService * add detect frameworkcontroller * fix comment * update * update * fix kubernetesData * debug * debug * debug * fix comment * fix conflict * remove local temp files * revert launcher.py * update code by comments * remove disableLog * remove disable Log * set timeout for cleanup * fix code by comments * update variable names * add comments * add delay function * update * update * update by comments * add in remote script path * rename variables * update variable name * add mkdir -p for subfolder
-
- 03 Jan, 2019 1 commit
-
-
chicm-ms authored
* Add UT code coverage report * updates * updates * updates * updates * updates * updates * integration test python code coverage report
-
- 26 Dec, 2018 2 commits
-
-
Zejun Lin authored
* fix bug * add docs * add ut * add ut * add to ci * update doc * update doc * update ut * add ut to ci * add ut to ci * add ut to ci * add ut to ci * add ut to ci * add ut to ci * add ut to ci * add ut to ci * test * test * test * test * test * test * test * test * test * test * revert * refactor * refactor * s * merge
-
goooxu authored
-
- 21 Dec, 2018 1 commit
-
-
fishyds authored
* Dockerfile improvement * Fix UT broken for kubeflow training service
-
- 17 Dec, 2018 1 commit
-
-
Gems Guo authored
-
- 14 Dec, 2018 1 commit
-
-
SparkSnail authored
rest api of kubernetes does not use base64 to encode chars, now use base64 to encode username and then create secret.
-
- 13 Dec, 2018 1 commit
-
-
fishyds authored
[Kubeflow training service] Use Kubernete API server to replace kubectl dependency
-
- 28 Nov, 2018 1 commit
-
-
SparkSnail authored
Support aks of kuberflow training service Support nnictl set nniManagerIp
-
- 27 Nov, 2018 1 commit
-
-
Yan Ni authored
* update Makefile for mac support, wait for aka.ms support * refix Makefile for colorful echo * update Makefile with shorturl * fix false fail on mac webui * fix cross os remote tmpdir issue * add readonly to RemoteMachineTrainingService.remoteOS * fix var name for PR 386
-
- 20 Nov, 2018 1 commit
-
-
fishyds authored
* Kubeflow TrainingService support, v1 (#373) 1. Create new Training Service: kubeflow trainning service, use 'kubectl' and kubeflow tfjobs CRD to submit and manage jobs 2. Update nni python SDK to support new kubeflow platform 3. Update nni python SDK's get_sequende_id() implementation, read NNI_TRIAL_SEQ_ID env variable, instead of reading .nni/sequence_id file 4. This version only supports Tensorflow operator. Will add more operators' support in future versions
-
- 27 Sep, 2018 1 commit
-
-
fishyds authored
* PAI Training service implementation **1. Implement PAITrainingService **2. Add trial-keeper python module, and modify setup.py to install the module **3. Add PAItrainingService rest server to collect metrics from PAI container.
-
- 14 Sep, 2018 2 commits
-
-
fishyds authored
* Merge latest code changes into Github Master * temporary modification for travis * temporary modification for travis
-
liuzhe-lz authored
* Improve Makefile * Bugfix * Add make target for remote machine workers * Update travis config * Debug travis * Debug travis * Debug travis * Debug travis * Debug travis * Debug travis * Debug travis * Install bash completion script
-
- 20 Aug, 2018 1 commit
-
-
Deshui Yu authored
-