- 05 Mar, 2020 1 commit
-
-
chicm-ms authored
-
- 02 Mar, 2020 1 commit
-
-
George Cheng authored
* skeleton of dlts training service (#1844) * Hello, DLTS! * Revert version * Remove fs-extra * Add some default cluster config * schema * fix * Optional cluster (default to `.default`) Depends on DLWorkspace#837 * fix * fix * optimize gpu type * No more copy * Format * Code clean up * Issue fix * Add optional fields in config * Issue fix * Lint * Lint * Validate email, password and team * Doc * Doc fix * Set TMPDIR * Use metadata instead of gpu_capacity * Cancel paused DLTS job * workaround lint rules * pylint * doc Co-authored-by:QuanluZhang <z.quanluzhang@gmail.com>
-
- 27 Feb, 2020 1 commit
-
-
SparkSnail authored
-
- 14 Feb, 2020 1 commit
-
-
SparkSnail authored
-
- 09 Feb, 2020 1 commit
-
-
QuanluZhang authored
-
- 07 Feb, 2020 1 commit
-
-
SparkSnail authored
-
- 08 Jan, 2020 1 commit
-
-
SparkSnail authored
-
- 30 Dec, 2019 1 commit
-
-
SparkSnail authored
-
- 23 Dec, 2019 1 commit
-
-
SparkSnail authored
-
- 25 Nov, 2019 1 commit
-
-
liuzhe-lz authored
-
- 11 Nov, 2019 1 commit
-
-
Yuge Zhang authored
* show experiment name in nnictl list * remove author name in metadata
-
- 04 Nov, 2019 1 commit
-
-
chicm-ms authored
Fix pylint errors
-
- 26 Sep, 2019 1 commit
-
-
SparkSnail authored
-
- 20 Sep, 2019 1 commit
-
-
QuanluZhang authored
* support specifying gpu for tuner and advisor
-
- 14 Aug, 2019 1 commit
-
-
Guoxin authored
* squash commits in v1.0 first round bug bash
-
- 12 Aug, 2019 1 commit
-
-
suiguoxin authored
-
- 02 Aug, 2019 1 commit
-
-
SparkSnail authored
-
- 25 Jun, 2019 3 commits
-
-
Zejun Lin authored
* fix a bug * fix a bug
-
Zejun Lin authored
fix bug for PR #1201
-
Zejun Lin authored
* update * dev-tf-master * fix bugs * fix bugs * remove unnecessary lines * dev oneshot and darts * dev nas * dev enas and oneshot * dev enas and oneshot * dev enas and oneshot * dev oneshot and enas * dev oneshot * add ut * add docstring * add docstring * fix * resolve comments by changing docstring * resolve comments
-
- 21 Jun, 2019 1 commit
-
-
SparkSnail authored
-
- 20 Jun, 2019 1 commit
-
-
demianzhang authored
* fix local and remote training services tslint
-
- 19 Jun, 2019 1 commit
-
-
Hongarc authored
-
- 18 Jun, 2019 1 commit
-
-
SparkSnail authored
-
- 17 Jun, 2019 1 commit
-
-
demianzhang authored
* remove check powershell policy * remove policy dependency * update
-
- 30 May, 2019 1 commit
-
-
Mohit Anand authored
-
- 28 May, 2019 2 commits
-
-
SparkSnail authored
-
demianzhang authored
-
- 22 May, 2019 1 commit
-
-
SparkSnail authored
-
- 24 Apr, 2019 1 commit
-
-
demianzhang authored
-
- 22 Apr, 2019 1 commit
-
-
demianzhang authored
-
- 19 Apr, 2019 1 commit
-
-
SparkSnail authored
Fix issue #890
-
- 18 Apr, 2019 1 commit
-
-
chicm-ms authored
* Refactoring local training service * Designated GPU for local training service * RemoteMachine designated GPU configuration
-
- 29 Mar, 2019 1 commit
-
-
SparkSnail authored
-
- 22 Mar, 2019 1 commit
-
-
SparkSnail authored
If user set remoteloggingType in config file, log content will not be transmitted from trialkeeper
-
- 21 Mar, 2019 1 commit
-
-
SparkSnail authored
In nnictl, we support debug mode from config file and --debug. If users does not set debug: true in config, nnictl will use --debug value.
-
- 18 Mar, 2019 1 commit
-
-
SparkSnail authored
-
- 15 Mar, 2019 1 commit
-
-
SparkSnail authored
check nni version in trialkeeper, to make sure the version of trialkeeper is consistent with trainingService add a debug mode in config file
-
- 25 Feb, 2019 1 commit
-
-
SparkSnail authored
* add trialkeeper_stdout and trialkeeper_stderr * fix nnictl set remote nniManagerIP
-
- 29 Jan, 2019 1 commit
-
-
SparkSnail authored
* fix remote bug * add document * add document * update * update * update * update * fix remote issue * fix forEach * update doc according to comments * update * update * update * remove 'any more' * add base version for remote-log * change launcher.py * test * basic version * debug * debug * basic work version * fix code * update disable_log * remove unused line * add diable log in kubernetesTrainingService * add detect frameworkcontroller * fix comment * update * update * fix kubernetesData * debug * debug * debug * fix comment * fix conflict * remove local temp files * revert launcher.py * update code by comments * remove disableLog * remove disable Log * set timeout for cleanup * fix code by comments * update variable names * add comments * add delay function * update * update * update by comments * add in remote script path * rename variables * update variable name * add mkdir -p for subfolder
-