- 19 Apr, 2019 4 commits
-
-
Zejun Lin authored
-
Lee authored
* Revert to orginal `nnictl log trial` and `nnictl trial kill` * Change print() to print_normal()
-
SparkSnail authored
Fix issue #890
-
SparkSnail authored
Advisor and assessor could not be used simultaneously. Refactor config file error message. Support scientific notation, like 1e-2
-
- 18 Apr, 2019 1 commit
-
-
chicm-ms authored
* Refactoring local training service * Designated GPU for local training service * RemoteMachine designated GPU configuration
-
- 12 Apr, 2019 1 commit
-
-
Shufan Huang authored
add BOHB Advisor
-
- 10 Apr, 2019 1 commit
-
-
Yan Ni authored
add trial id to csv and remove pandas
-
- 09 Apr, 2019 1 commit
-
-
Yan Ni authored
* export to csv core code done * add nnictl trail export * fix bug * add dependency for pandas * remove pandas from dependency * add multi phase support * add document
-
- 29 Mar, 2019 1 commit
-
-
SparkSnail authored
-
- 27 Mar, 2019 2 commits
-
-
xuehui authored
* update readme in ga_squad * update readme * fix typo * Update README.md * Update README.md * Update README.md * update readme * update * fix path * update reference * fix bug in config file * update nni_arch_overview.png * update * update * update * update home page * fix issue: #902, random parameter * update doc about random * change Random tuner test * update metric_test for Random tuner * update random config in test * update Evolution_SQuAD doc
-
SparkSnail authored
-
- 26 Mar, 2019 1 commit
-
-
Lee authored
* add different tuner config files for config_test * change MetisTuner config test due to no lightgbm python module in integration test * install smac package in azure-pipelines * SMAC need swig to be installed * Try to install swig from source code * remove SMAC test because the dependency can not be installed * use sudo to install the swig * sleep 10s to make sure the port has been released * remove tuner test for networkmorphism because it uses more than 30s to release the tcp port * word "down" to "done" * add config test for Curvefitting assessor * change file name * Fix data type not match bug * Optimize MetisTunner * pretty the code * Follow the review comment * add exploration probability * Avoid None type object generating * fix nnictl log trial bug * rollback chinese doc * add argument 'experiment' to parser_log_trial and parser_trial_kill * update doc
-
- 22 Mar, 2019 3 commits
-
-
chicm-ms authored
1. Route tuner and assessor commands to 2 seperate queues issue #841 2. Allow tuner to leverage intermediate result when trial is early stopped. issue #843
-
SparkSnail authored
If user set remoteloggingType in config file, log content will not be transmitted from trialkeeper
-
SparkSnail authored
There is one kind of version string like 'v0.5.2-gews11f', it is generated by installing from source code. In current trialKeeper, use exact version match, and this version string will cause code break in msranni/nni image, because our offical image use clean number version. Change the logic to fuzzy match, only match the main number of nni.
-
- 21 Mar, 2019 1 commit
-
-
SparkSnail authored
In nnictl, we support debug mode from config file and --debug. If users does not set debug: true in config, nnictl will use --debug value.
-
- 20 Mar, 2019 1 commit
-
-
SparkSnail authored
-
- 18 Mar, 2019 1 commit
-
-
SparkSnail authored
-
- 15 Mar, 2019 1 commit
-
-
SparkSnail authored
check nni version in trialkeeper, to make sure the version of trialkeeper is consistent with trainingService add a debug mode in config file
-
- 07 Mar, 2019 1 commit
-
-
Zejun Lin authored
* fix annotation, resolve annotation's key err bug, refactor the design
-
- 06 Mar, 2019 2 commits
- 04 Mar, 2019 1 commit
-
-
ShufanHuang authored
* update link
-
- 26 Feb, 2019 1 commit
-
-
Chi Song authored
-
- 25 Feb, 2019 4 commits
-
-
ShufanHuang authored
Improve the performance of the curve fitting assessor
-
SparkSnail authored
Add integration test for remoteTrainingService platform
-
SparkSnail authored
trial_keeper use 50070 port to connect to webhdfs server, and PAI use a mapping method to map 50070 port to 5070 port to visit restful server, this method has some risk for PAI may not support this kind of mapping in later release.Now use Pylon path(/webhdfs/api/v1) instead of 50070 port in webhdfs client of trial_keeper, the path is transmitted in trainingService. In this pr, we have these changes: 1. Change to use webhdfs path instead of 50070 port in hdfs client. 2. Change to use new hdfs package "PythonWebHDFS", which is build to support pylon by myself. You could test the new function from "sparksnail/nni:dev-pai" image to test pai trainingService. 3. Update some variables' name according to comments.
-
SparkSnail authored
* add trialkeeper_stdout and trialkeeper_stderr * fix nnictl set remote nniManagerIP
-
- 24 Feb, 2019 1 commit
-
-
Lee authored
Add integration test cases for tuners.
-
- 18 Feb, 2019 1 commit
-
-
Yan Ni authored
-
- 12 Feb, 2019 1 commit
-
-
Zejun Lin authored
fix bug in annotation, code generator cannot deal with dict as left value of assignment expression.
-
- 29 Jan, 2019 1 commit
-
-
SparkSnail authored
* fix remote bug * add document * add document * update * update * update * update * fix remote issue * fix forEach * update doc according to comments * update * update * update * remove 'any more' * add base version for remote-log * change launcher.py * test * basic version * debug * debug * basic work version * fix code * update disable_log * remove unused line * add diable log in kubernetesTrainingService * add detect frameworkcontroller * fix comment * update * update * fix kubernetesData * debug * debug * debug * fix comment * fix conflict * remove local temp files * revert launcher.py * update code by comments * remove disableLog * remove disable Log * set timeout for cleanup * fix code by comments * update variable names * add comments * add delay function * update * update * update by comments * add in remote script path * rename variables * update variable name * add mkdir -p for subfolder
-
- 24 Jan, 2019 1 commit
-
-
chicm-ms authored
* Pull code (#22) * Support distributed job for frameworkcontroller (#612) support distributed job for frameworkcontroller * Multiphase doc (#519) * multiPhase doc * updates * updates * Add time parser for 'nnictl update duration' (#632) Current nnictl update duration only support seconds unit, add a parser for this command to support {s, m, h, d} * fix experiment state bug (#629) * update top README.md (#622) * Update README.md * update (#634) * Integration tests refactoring (#625) * Integration test refactoring (#21) (#616) * Integration test refactoring (#21) * Refactoring integration tests * test metrics * update azure pipeline * updates * updates * updates * updates * updates * updates * updates * updates * updates * updates * updates * updates * updates * updates * updates * updates * updates * updates * updates * updates * update trigger * Integration test refactoring (#618) * updates * updates * update pipeline (#619) * update pipeline * updates * updates * updates * updates * updates * test pipeline (#623) * test pipeline * updates * updates * updates * Update integration test (#624) * Update integration test * updates * updates * updates * updates * updates * updates * Revert "Pull code (#22)" This reverts commit 62fc165ad7b2ba724eead3b99f010aa34491e2c7. * Configurable nniManager log path * Configure log level * add --debug command line for nnictl * updates
-
- 23 Jan, 2019 1 commit
-
-
Zejun Lin authored
* fix nni-sdk version problem * fix * fix installation * fix installation * fix installation * resolve suggestion * resolve suggestion * resolve suggestion * resolve suggestion * refactor logic * refactor logic * resolve comments
-
- 22 Jan, 2019 1 commit
-
-
SparkSnail authored
* add serviceAccountName * add serviceAccountName for frameworkcontroller
-
- 18 Jan, 2019 1 commit
-
-
SparkSnail authored
Current nnictl update duration only support seconds unit, add a parser for this command to support {s, m, h, d}
-
- 11 Jan, 2019 2 commits
-
-
Chi Song authored
Fix document with formats and typos
-
Linjie Xu authored
* add module info for launcher check * update launcher.py Add packages check before start restful server. * check sub-key * modify mistype * delete non-tuner in constants/ModuleName * Delete ModuleName to prevent double maintain * only catch ModuleNotFoundError
-
- 08 Jan, 2019 2 commits
-
-
fishyds authored
* Fix a race condidtion issue in trial_keeper for reading log from pipe
-
SparkSnail authored
change the warning level to info level when expand relative path add nnictl --version log update readme.md
-