Implement lower bound and clarify docs for randint (#1435)

Implement lower bound and clarify docs for randint (#1435)

Implement lower bound and clarify docs for randint (#1435)
485245a0 · Yuge Zhang · Guoxin · 0f22aaaf · 485245a0 · 485245a0
Commit 485245a0 authored Aug 12, 2019 by Yuge Zhang Committed by Guoxin Aug 12, 2019
11 changed files
--- a/docs/en_US/Tutorial/SearchSpaceSpec.md
+++ b/docs/en_US/Tutorial/SearchSpaceSpec.md
@@ -10,11 +10,11 @@ To define a search space, users should define the name of variable, the type of

 ```yaml
 {
-    "dropout_rate":{"_type":"uniform","_value":[0.1,0.5]},
-    "conv_size":{"_type":"choice","_value":[2,3,5,7]},
-    "hidden_size":{"_type":"choice","_value":[124, 512, 1024]},
-    "batch_size":{"_type":"choice","_value":[50, 250, 500]},
-    "learning_rate":{"_type":"uniform","_value":[0.0001, 0.1]}
+    "dropout_rate": {"_type": "uniform", "_value": [0.1, 0.5]},
+    "conv_size": {"_type": "choice", "_value": [2, 3, 5, 7]},
+    "hidden_size": {"_type": "choice", "_value": [124, 512, 1024]},
+    "batch_size": {"_type": "choice", "_value": [50, 250, 500]},
+    "learning_rate": {"_type": "uniform", "_value": [0.0001, 0.1]}
 }

 ```
@@ -25,55 +25,54 @@ Take the first line as an example. `dropout_rate` is defined as a variable whose

 All types of sampling strategies and their parameter are listed here:

-* {"_type":"choice","_value":options}
+* `{"_type": "choice", "_value": options}`

  * Which means the variable's value is one of the options. Here 'options' should be a list. Each element of options is a number of string. It could also be a nested sub-search-space, this sub-search-space takes effect only when the corresponding element is chosen. The variables in this sub-search-space could be seen as conditional variables.

-  * An simple [example](https://github.com/microsoft/nni/tree/master/examples/trials/mnist-nested-search-space/search_space.json) of [nested] search space definition. If an element in the options list is a dict, it is a sub-search-space, and for our built-in tuners you have to add a key '_name' in this dict, which helps you to identify which element is chosen. Accordingly, here is a [sample](https://github.com/microsoft/nni/tree/master/examples/trials/mnist-nested-search-space/sample.json) which users can get from nni with nested search space definition. Tuners which support nested search space is as follows:
+  * An simple [example](https://github.com/microsoft/nni/tree/master/examples/trials/mnist-nested-search-space/search_space.json) of [nested] search space definition. If an element in the options list is a dict, it is a sub-search-space, and for our built-in tuners you have to add a key `_name` in this dict, which helps you to identify which element is chosen. Accordingly, here is a [sample](https://github.com/microsoft/nni/tree/master/examples/trials/mnist-nested-search-space/sample.json) which users can get from nni with nested search space definition. Tuners which support nested search space is as follows:

    - Random Search 
    - TPE
    - Anneal
    - Evolution

-* {"_type":"randint","_value":[lower, upper]}
+* `{"_type": "randint", "_value": [lower, upper]}`
+  * Choosing a random integer from `lower` (inclusive) to `upper` (exclusive).
+  * Note: Different tuners may interpret `randint` differently. Some (e.g., TPE, GridSearch) treat integers from lower 
+    to upper as unordered ones, while others respect the ordering (e.g., SMAC). If you want all the tuners to respect 
+    the ordering, please use `quniform` with `q=1`.

-  * For now, we implement the "randint" distribution with "quniform", which means the variable value is a value like round(uniform(lower, upper)). The type of chosen value is float. If you want to use integer value, please convert it explicitly.
-
-* {"_type":"uniform","_value":[low, high]}
+* `{"_type": "uniform", "_value": [low, high]}`
  * Which means the variable value is a value uniformly between low and high.
  * When optimizing, this variable is constrained to a two-sided interval.

-* {"_type":"quniform","_value":[low, high, q]}
-  * Which means the variable value is a value like clip(round(uniform(low, high) / q) * q, low, high), where the clip operation is used to constraint the generated value in the bound. For example, for _value specified as [0, 10, 2.5], possible values are [0, 2.5, 5.0, 7.5, 10.0]; For _value specified as [2, 10, 5], possible values are [2, 5, 10].
-
+* `{"_type": "quniform", "_value": [low, high, q]}`
+  * Which means the variable value is a value like `clip(round(uniform(low, high) / q) * q, low, high)`, where the clip operation is used to constraint the generated value in the bound. For example, for `_value` specified as [0, 10, 2.5], possible values are [0, 2.5, 5.0, 7.5, 10.0]; For `_value` specified as [2, 10, 5], possible values are [2, 5, 10].
  * Suitable for a discrete value with respect to which the objective is still somewhat "smooth", but which should be bounded both above and below. If you want to uniformly choose integer from a range [low, high], you can write `_value` like this: `[low, high, 1]`.

-* {"_type":"loguniform","_value":[low, high]}
+* `{"_type": "loguniform", "_value": [low, high]}`
  * Which means the variable value is a value drawn from a range [low, high] according to a loguniform distribution like exp(uniform(log(low), log(high))), so that the logarithm of the return value is uniformly distributed.
  * When optimizing, this variable is constrained to be positive.

-* {"_type":"qloguniform","_value":[low, high, q]}
-  * Which means the variable value is a value like clip(round(loguniform(low, high) / q) * q, low, high), where the clip operation is used to constraint the generated value in the bound.
+* `{"_type": "qloguniform", "_value": [low, high, q]}`
+  * Which means the variable value is a value like `clip(round(loguniform(low, high) / q) * q, low, high)`, where the clip operation is used to constraint the generated value in the bound.
  * Suitable for a discrete variable with respect to which the objective is "smooth" and gets smoother with the size of the value, but which should be bounded both above and below.

-* {"_type":"normal","_value":[mu, sigma]}
-
+* `{"_type": "normal", "_value": [mu, sigma]}`
  * Which means the variable value is a real value that's normally-distributed with mean mu and standard deviation sigma. When optimizing, this is an unconstrained variable.

-* {"_type":"qnormal","_value":[mu, sigma, q]}
-  * Which means the variable value is a value like round(normal(mu, sigma) / q) * q
+* `{"_type": "qnormal", "_value": [mu, sigma, q]}`
+  * Which means the variable value is a value like `round(normal(mu, sigma) / q) * q`
  * Suitable for a discrete variable that probably takes a value around mu, but is fundamentally unbounded.

-* {"_type":"lognormal","_value":[mu, sigma]}
-
-  * Which means the variable value is a value drawn according to exp(normal(mu, sigma)) so that the logarithm of the return value is normally distributed. When optimizing, this variable is constrained to be positive.
+* `{"_type": "lognormal", "_value": [mu, sigma]}`
+  * Which means the variable value is a value drawn according to `exp(normal(mu, sigma))` so that the logarithm of the return value is normally distributed. When optimizing, this variable is constrained to be positive.

-* {"_type":"qlognormal","_value":[mu, sigma, q]}
-  * Which means the variable value is a value like round(exp(normal(mu, sigma)) / q) * q
+* `{"_type": "qlognormal", "_value": [mu, sigma, q]}`
+  * Which means the variable value is a value like `round(exp(normal(mu, sigma)) / q) * q`
  * Suitable for a discrete variable with respect to which the objective is smooth and gets smoother with the size of the variable, which is bounded from one side.

-* {"_type":"mutable_layer","_value":{mutable_layer_infomation}}
+* `{"_type": "mutable_layer", "_value": {mutable_layer_infomation}}`
  * Type for [Neural Architecture Search Space][1]. Value is also a dictionary, which contains key-value pairs representing respectively name and search space of each mutable_layer.
  * For now, users can only use this type of search space with annotation, which means that there is no need to define a json file for search space since it will be automatically generated according to the annotation in trial code.
  * For detailed usage, please refer to [General NAS Interfaces][1].

--- a/src/sdk/pynni/nni/bohb_advisor/bohb_advisor.py
+++ b/src/sdk/pynni/nni/bohb_advisor/bohb_advisor.py
@@ -31,7 +31,7 @@ import ConfigSpace.hyperparameters as CSH

 from nni.protocol import CommandType, send
 from nni.msg_dispatcher_base import MsgDispatcherBase
-from nni.utils import OptimizeMode, MetricType, extract_scalar_reward, randint_to_quniform
+from nni.utils import OptimizeMode, MetricType, extract_scalar_reward
 from nni.common import multi_phase_enabled

 from .config_generator import CG_BOHB
@@ -467,7 +467,6 @@ class BOHB(MsgDispatcherBase):
            search space of this experiment
        """
        search_space = data
-        randint_to_quniform(search_space)
        cs = CS.ConfigurationSpace()
        for var in search_space:
            _type = str(search_space[var]["_type"])
@@ -476,7 +475,7 @@ class BOHB(MsgDispatcherBase):
                    var, choices=search_space[var]["_value"]))
            elif _type == 'randint':
                cs.add_hyperparameter(CSH.UniformIntegerHyperparameter(
-                    var, lower=0, upper=search_space[var]["_value"][0]))
+                    var, lower=search_space[var]["_value"][0], upper=search_space[var]["_value"][1] - 1))
            elif _type == 'uniform':
                cs.add_hyperparameter(CSH.UniformFloatHyperparameter(
                    var, lower=search_space[var]["_value"][0], upper=search_space[var]["_value"][1]))

--- a/src/sdk/pynni/nni/evolution_tuner/evolution_tuner.py
+++ b/src/sdk/pynni/nni/evolution_tuner/evolution_tuner.py
@@ -26,7 +26,7 @@ import random

 import numpy as np
 from nni.tuner import Tuner
-from nni.utils import NodeType, OptimizeMode, extract_scalar_reward, split_index, randint_to_quniform
+from nni.utils import NodeType, OptimizeMode, extract_scalar_reward, split_index

 import nni.parameter_expressions as parameter_expressions

@@ -175,7 +175,6 @@ class EvolutionTuner(Tuner):
        search_space : dict
        """
        self.searchspace_json = search_space
-        randint_to_quniform(self.searchspace_json)
        self.space = json2space(self.searchspace_json)

        self.random_state = np.random.RandomState()

--- a/src/sdk/pynni/nni/hyperband_advisor/hyperband_advisor.py
+++ b/src/sdk/pynni/nni/hyperband_advisor/hyperband_advisor.py
@@ -31,7 +31,7 @@ import json_tricks
 from nni.protocol import CommandType, send
 from nni.msg_dispatcher_base import MsgDispatcherBase
 from nni.common import init_logger, multi_phase_enabled
-from nni.utils import NodeType, OptimizeMode, MetricType, extract_scalar_reward, randint_to_quniform
+from nni.utils import NodeType, OptimizeMode, MetricType, extract_scalar_reward
 import nni.parameter_expressions as parameter_expressions

 _logger = logging.getLogger(__name__)
@@ -358,7 +358,6 @@ class Hyperband(MsgDispatcherBase):
            number of trial jobs
        """
        self.searchspace_json = data
-        randint_to_quniform(self.searchspace_json)
        self.random_state = np.random.RandomState()

    def _handle_trial_end(self, parameter_id):

--- a/src/sdk/pynni/nni/hyperopt_tuner/hyperopt_tuner.py
+++ b/src/sdk/pynni/nni/hyperopt_tuner/hyperopt_tuner.py
@@ -27,7 +27,7 @@ import logging
 import hyperopt as hp
 import numpy as np
 from nni.tuner import Tuner
-from nni.utils import NodeType, OptimizeMode, extract_scalar_reward, split_index, randint_to_quniform
+from nni.utils import NodeType, OptimizeMode, extract_scalar_reward, split_index

 logger = logging.getLogger('hyperopt_AutoML')

@@ -51,6 +51,8 @@ def json2space(in_x, name=NodeType.ROOT):
            _value = json2space(in_x[NodeType.VALUE], name=name)
            if _type == 'choice':
                out_y = eval('hp.hp.choice')(name, _value)
+            elif _type == 'randint':
+                out_y = hp.hp.randint(name, _value[1] - _value[0])
            else:
                if _type in ['loguniform', 'qloguniform']:
                    _value[:2] = np.log(_value[:2])
@@ -93,6 +95,8 @@ def json2parameter(in_x, parameter, name=NodeType.ROOT):
            else:
                if _type in ['quniform', 'qloguniform']:
                    out_y = np.clip(parameter[name], in_x[NodeType.VALUE][0], in_x[NodeType.VALUE][1])
+                elif _type == 'randint':
+                    out_y = parameter[name] + in_x[NodeType.VALUE][0]
                else:
                    out_y = parameter[name]
        else:
@@ -247,7 +251,6 @@ class HyperoptTuner(Tuner):
        search_space : dict
        """
        self.json = search_space
-        randint_to_quniform(self.json)

        search_space_instance = json2space(self.json)
        rstate = np.random.RandomState()
@@ -279,7 +282,7 @@ class HyperoptTuner(Tuner):
        total_params = self.get_suggestion(random_search=False)
        # avoid generating same parameter with concurrent trials because hyperopt doesn't support parallel mode
        if total_params in self.total_data.values():
-            # but it can cause deplicate parameter rarely
+            # but it can cause duplicate parameter rarely
            total_params = self.get_suggestion(random_search=True)
        self.total_data[parameter_id] = total_params


--- a/src/sdk/pynni/nni/hyperopt_tuner/test_hyperopt_tuner.py
+++ b/src/sdk/pynni/nni/hyperopt_tuner/test_hyperopt_tuner.py
@@ -25,7 +25,7 @@ from unittest import TestCase, main

 import hyperopt as hp

-from nni.hyperopt_tuner.hyperopt_tuner import json2space, json2parameter, json2vals
+from nni.hyperopt_tuner.hyperopt_tuner import json2space, json2parameter, json2vals, HyperoptTuner


 class HyperoptTunerTestCase(TestCase):
@@ -99,6 +99,29 @@ class HyperoptTunerTestCase(TestCase):
        self.assertEqual(out_y["root[optimizer]-choice"], 0)
        self.assertEqual(out_y["root[learning_rate]-choice"], 1)

+    def test_tuner_generate(self):
+        for algorithm in ["tpe", "random_search", "anneal"]:
+            tuner = HyperoptTuner(algorithm)
+            choice_list = ["a", "b", 1, 2]
+            tuner.update_search_space({
+                "a": {
+                    "_type": "randint",
+                    "_value": [1, 3]
+                },
+                "b": {
+                    "_type": "choice",
+                    "_value": choice_list
+                }
+            })
+            for k in range(30):
+                # sample multiple times
+                param = tuner.generate_parameters(k)
+                print(param)
+                self.assertIsInstance(param["a"], int)
+                self.assertGreaterEqual(param["a"], 1)
+                self.assertLessEqual(param["a"], 2)
+                self.assertIn(param["b"], choice_list)
+

 if __name__ == '__main__':
    main()
--- a/src/sdk/pynni/nni/parameter_expressions.py
+++ b/src/sdk/pynni/nni/parameter_expressions.py
@@ -32,12 +32,14 @@ def choice(options, random_state):
    return random_state.choice(options)


-def randint(upper, random_state):
+def randint(lower, upper, random_state):
    '''
+    Generate a random integer from `lower` (inclusive) to `upper` (exclusive).
+    lower: an int that represent an lower bound
    upper: an int that represent an upper bound
    random_state: an object of numpy.random.RandomState
    '''
-    return random_state.randint(upper)
+    return random_state.randint(lower, upper)


 def uniform(low, high, random_state):

--- a/src/sdk/pynni/nni/smac_tuner/convert_ss_to_scenario.py
+++ b/src/sdk/pynni/nni/smac_tuner/convert_ss_to_scenario.py
@@ -88,10 +88,10 @@ def generate_pcs(nni_search_space_content):
                                raise RuntimeError('%s has already existed, please make sure search space has no duplicate key.' % key)
                            categorical_dict[key] = search_space[key]['_value']
                        elif search_space[key]['_type'] == 'randint':
-                            # TODO: support lower bound in randint
-                            pcs_fd.write('%s integer [0, %d] [%d]\n' % (
+                            pcs_fd.write('%s integer [%d, %d] [%d]\n' % (
                                key,
                                search_space[key]['_value'][0],
+                                search_space[key]['_value'][1] - 1,
                                search_space[key]['_value'][0]))
                        elif search_space[key]['_type'] == 'uniform':
                            pcs_fd.write('%s real %s [%s]\n' % (

--- a/src/sdk/pynni/nni/smac_tuner/smac_tuner.py
+++ b/src/sdk/pynni/nni/smac_tuner/smac_tuner.py
@@ -38,7 +38,7 @@ from ConfigSpaceNNI import Configuration
 from .convert_ss_to_scenario import generate_scenario

 from nni.tuner import Tuner
-from nni.utils import OptimizeMode, extract_scalar_reward, randint_to_quniform
+from nni.utils import OptimizeMode, extract_scalar_reward


 class SMACTuner(Tuner):
@@ -139,7 +139,6 @@ class SMACTuner(Tuner):
        search_space:
            search space
        """
-        randint_to_quniform(search_space)
        if not self.update_ss_done:
            self.categorical_dict = generate_scenario(search_space)
            if self.categorical_dict is None:

--- a/src/sdk/pynni/nni/smartparam.py
+++ b/src/sdk/pynni/nni/smartparam.py
@@ -19,11 +19,11 @@
 # ==================================================================================================


-import random
 import numpy as np

 from .env_vars import trial_env_vars
 from . import trial
+from . import parameter_expressions as param_exp
 from .nas_utils import classic_mode, enas_mode, oneshot_mode, darts_mode


@@ -47,39 +47,39 @@ __all__ = [

 if trial_env_vars.NNI_PLATFORM is None:
    def choice(*options, name=None):
-        return random.choice(options)
+        return param_exp.choice(options, np.random.RandomState())

-    def randint(upper, name=None):
-        return random.randrange(upper)
+    def randint(lower, upper, name=None):
+        return param_exp.randint(lower, upper, np.random.RandomState())

    def uniform(low, high, name=None):
-        return random.uniform(low, high)
+        return param_exp.uniform(low, high, np.random.RandomState())

    def quniform(low, high, q, name=None):
        assert high > low, 'Upper bound must be larger than lower bound'
-        return np.clip(round(random.uniform(low, high) / q) * q, low, high)
+        return param_exp.quniform(low, high, q, np.random.RandomState())

    def loguniform(low, high, name=None):
        assert low > 0, 'Lower bound must be positive'
-        return np.exp(random.uniform(np.log(low), np.log(high)))
+        return param_exp.loguniform(low, high, np.random.RandomState())

    def qloguniform(low, high, q, name=None):
-        return np.clip(round(loguniform(low, high) / q) * q, low, high)
+        return param_exp.qloguniform(low, high, q, np.random.RandomState())

    def normal(mu, sigma, name=None):
-        return random.gauss(mu, sigma)
+        return param_exp.normal(mu, sigma, np.random.RandomState())

    def qnormal(mu, sigma, q, name=None):
-        return round(random.gauss(mu, sigma) / q) * q
+        return param_exp.qnormal(mu, sigma, q, np.random.RandomState())

    def lognormal(mu, sigma, name=None):
-        return np.exp(random.gauss(mu, sigma))
+        return param_exp.lognormal(mu, sigma, np.random.RandomState())

    def qlognormal(mu, sigma, q, name=None):
-        return round(lognormal(mu, sigma) / q) * q
+        return param_exp.qlognormal(mu, sigma, q, np.random.RandomState())

    def function_choice(*funcs, name=None):
-        return random.choice(funcs)()
+        return param_exp.choice(funcs, np.random.RandomState())()

    def mutable_layer():
        raise RuntimeError('Cannot call nni.mutable_layer in this mode')
@@ -89,7 +89,7 @@ else:
    def choice(options, name=None, key=None):
        return options[_get_param(key)]

-    def randint(upper, name=None, key=None):
+    def randint(lower, upper, name=None, key=None):
        return _get_param(key)

    def uniform(low, high, name=None, key=None):

--- a/src/sdk/pynni/nni/utils.py
+++ b/src/sdk/pynni/nni/utils.py
@@ -111,23 +111,3 @@ def init_dispatcher_logger():
    if dispatcher_env_vars.NNI_LOG_DIRECTORY is not None:
        logger_file_path = os.path.join(dispatcher_env_vars.NNI_LOG_DIRECTORY, logger_file_path)
    init_logger(logger_file_path, dispatcher_env_vars.NNI_LOG_LEVEL)
-
-
-def randint_to_quniform(in_x):
-    if isinstance(in_x, dict):
-        if NodeType.TYPE in in_x.keys():
-            if in_x[NodeType.TYPE] == 'randint':
-                value = in_x[NodeType.VALUE]
-                value.append(1)
-
-                in_x[NodeType.TYPE] = 'quniform'
-                in_x[NodeType.VALUE] = value
- 
-            elif in_x[NodeType.TYPE] == 'choice':
-                randint_to_quniform(in_x[NodeType.VALUE])
-        else:
-            for key in in_x.keys():
-                randint_to_quniform(in_x[key])
-    elif isinstance(in_x, list):
-        for temp in in_x:
-            randint_to_quniform(temp)