- 17 Dec, 2018 1 commit
-
-
fishyds authored
* [PAI training service] codeDir files upload improvement * Create full local temp folder * Organize the folder structure for experiment and trial files
-
- 29 Nov, 2018 1 commit
-
-
fishyds authored
* [Trial keeper refactor] refactor trial keeper stdout output
-
- 20 Nov, 2018 1 commit
-
-
fishyds authored
* Kubeflow TrainingService support, v1 (#373) 1. Create new Training Service: kubeflow trainning service, use 'kubectl' and kubeflow tfjobs CRD to submit and manage jobs 2. Update nni python SDK to support new kubeflow platform 3. Update nni python SDK's get_sequende_id() implementation, read NNI_TRIAL_SEQ_ID env variable, instead of reading .nni/sequence_id file 4. This version only supports Tensorflow operator. Will add more operators' support in future versions
-
- 12 Nov, 2018 1 commit
-
-
fishyds authored
* Change base image from devel to runtime, to reduce docker image size * Support running multiple experiment for PAI * Fix a bug regarding to recuisively reference between paiRestServer and paiTrainingService
-
- 05 Nov, 2018 1 commit
-
-
Gems Guo authored
-