- 30 Apr, 2020 3 commits
-
-
George Cheng authored
-
SparkSnail authored
-
Chi Song authored
To support Windows node in remote mode, this PR adds a layer of commands (osCommands) to deal difference between Windows and Unix-like OS. To share code, ShellExecutor is added to enrich original SshClient class. I will implement windows version commands in next phase. This pattern can be expanded to Local or other platform in future, so I moved related code to common folder for sharing.
-
- 26 Apr, 2020 1 commit
-
-
Chi Song authored
Add shell support for ssh connection, so that remote script can be started with user environment. Minor fixes, 1. Fix gpu_metrics_collector to support pyenv. As pyenv will create one more process, so that original pgrep code always got extra processes, and cannot start gpu_metrics_collector. 2. Fix NASUI failure on dev-install-node-modules, to create subfolder every time. 3. Fix MakeFile to reduce mis-created links, and other minor issues. 4. Add node --watch for nni_manager for better dev experience.
-
- 05 Apr, 2020 1 commit
-
-
SparkSnail authored
-
- 25 Mar, 2020 1 commit
-
-
SparkSnail authored
-
- 02 Mar, 2020 1 commit
-
-
George Cheng authored
* skeleton of dlts training service (#1844) * Hello, DLTS! * Revert version * Remove fs-extra * Add some default cluster config * schema * fix * Optional cluster (default to `.default`) Depends on DLWorkspace#837 * fix * fix * optimize gpu type * No more copy * Format * Code clean up * Issue fix * Add optional fields in config * Issue fix * Lint * Lint * Validate email, password and team * Doc * Doc fix * Set TMPDIR * Use metadata instead of gpu_capacity * Cancel paused DLTS job * workaround lint rules * pylint * doc Co-authored-by:QuanluZhang <z.quanluzhang@gmail.com>
-
- 21 Feb, 2020 1 commit
-
-
leo authored
-
- 09 Feb, 2020 1 commit
-
-
QuanluZhang authored
-
- 07 Feb, 2020 1 commit
-
-
SparkSnail authored
-
- 03 Feb, 2020 1 commit
-
-
SparkSnail authored
-
- 15 Jan, 2020 1 commit
-
-
chicm-ms authored
-
- 31 Dec, 2019 1 commit
-
-
SparkSnail authored
-
- 30 Dec, 2019 1 commit
-
-
SparkSnail authored
-
- 25 Dec, 2019 1 commit
-
-
SparkSnail authored
-
- 23 Dec, 2019 1 commit
-
-
SparkSnail authored
-
- 20 Dec, 2019 1 commit
-
-
Seung Ho Jang authored
-
- 18 Dec, 2019 1 commit
-
-
chicm-ms authored
* Fix local system as remote machine issue #1852
-
- 11 Dec, 2019 1 commit
-
-
chicm-ms authored
* enable eslint * remove tslint
-
- 10 Dec, 2019 3 commits
-
-
chicm-ms authored
* update eslint rules * auto fix eslint * manually fix eslint (#1833)
-
Seung Ho Jang authored
-
chicm-ms authored
-
- 25 Nov, 2019 1 commit
-
-
liuzhe-lz authored
-
- 22 Nov, 2019 2 commits
-
-
SparkSnail authored
-
SparkSnail authored
-
- 21 Nov, 2019 1 commit
-
-
liuzhe-lz authored
* fix gpu script permission issue * make gpu tool local to user
-
- 11 Nov, 2019 1 commit
-
-
SparkSnail authored
-
- 08 Nov, 2019 1 commit
-
-
chicm-ms authored
-
- 06 Nov, 2019 1 commit
-
-
SparkSnail authored
-
- 05 Nov, 2019 1 commit
-
-
chicm-ms authored
* show failed job log
-
- 31 Oct, 2019 1 commit
-
-
SparkSnail authored
-
- 28 Oct, 2019 1 commit
-
-
SparkSnail authored
-
- 14 Oct, 2019 1 commit
-
-
Yuge Zhang authored
-
- 26 Sep, 2019 1 commit
-
-
liuzhe-lz authored
* Refactor web UI to support incremental metric loading * refactor * Remove host job * Move sequence ID to NNI manager * implement incremental loading
-
- 29 Aug, 2019 1 commit
-
-
SparkSnail authored
-
- 26 Aug, 2019 1 commit
-
-
SparkSnail authored
-
- 14 Aug, 2019 1 commit
-
-
Guoxin authored
* squash commits in v1.0 first round bug bash
-
- 12 Aug, 2019 3 commits
-
-
Yuge Zhang authored
Fix the issue that date nanoseconds does not work under macOS
-
SparkSnail authored
* change authFile to local path
-
suiguoxin authored
-