NAS tutorials (#4573)

9d835d9a · Yuge Zhang · GitHub · 395f65a0 · 9d835d9a · 9d835d9a
Unverified Commit 9d835d9a authored Feb 28, 2022 by Yuge Zhang Committed by GitHub Feb 28, 2022
20 changed files
--- a/docs/img/netron_entrance_webui.png
+++ b/docs/img/netron_entrance_webui.png
--- a/docs/source/conf.py
+++ b/docs/source/conf.py
@@ -13,6 +13,7 @@
 # documentation root, use os.path.abspath to make it absolute, like shown here.
 #
 import os
+import re
 import subprocess
 import sys
 sys.path.insert(0, os.path.abspath('../..'))

--- a/docs/source/tutorials.rst
+++ b/docs/source/tutorials.rst
@@ -8,7 +8,8 @@ Tutorials
   :hidden:
   tutorials/nni_experiment
-   tutorials/nas_quick_start_mnist
+   tutorials/hello_nas
+   tutorials/nasbench_as_dataset
 .. ----------------------
@@ -20,9 +21,17 @@ Tutorials
   :tags: Experiment/HPO
 .. cardlinkitem::
-   :header: Get started with NAS on MNIST
+   :header: Hello, NAS!
-   :description: bla bla bla bla
+   :description: Beginners' NAS tutorial on how to search for neural architectures for MNIST dataset.
-   :link: tutorials/nas_quick_start_mnist.html
+   :link: tutorials/hello_nas.html
   :image: ../img/thumbnails/overview-30.png
   :background: cyan
   :tags: NAS
+.. cardlinkitem::
+   :header: Use NAS Benchmarks as Datasets
+   :description: Query data from popular NAS benchmarks from our preprocessed benchmark database.
+   :link: tutorials/nasbench_as_dataset.html
+   :image: ../img/thumbnails/overview-30.png
+   :background: pink
+   :tags: NAS
--- a/docs/source/tutorials/hello_nas.ipynb
+++ b/docs/source/tutorials/hello_nas.ipynb
+{
+  "cells": [
+    {
+      "cell_type": "code",
+      "execution_count": null,
+      "metadata": {
+        "collapsed": false
+      },
+      "outputs": [],
+      "source": [
+        "%matplotlib inline"
+      ]
+    },
+    {
+      "cell_type": "markdown",
+      "metadata": {},
+      "source": [
+        "\n# Hello, NAS!\n\nThis is the 101 tutorial of Neural Architecture Search (NAS) on NNI.\nIn this tutorial, we will search for a neural architecture on MNIST dataset with the help of NAS framework of NNI, i.e., *Retiarii*.\nWe use multi-trial NAS as an example to show how to construct and explore a model space.\n\nThere are mainly three crucial components for a neural architecture search task, namely,\n\n* Model search space that defines a set of models to explore.\n* A proper strategy as the method to explore this model space.\n* A model evaluator that reports the performance of every model in the space.\n\nCurrently, PyTorch is the only supported framework by Retiarii, and we have only tested **PyTorch 1.7 to 1.10**.\nThis tutorial assumes PyTorch context but it should also apply to other frameworks, which is in our future plan.\n\n## Define your Model Space\n\nModel space is defined by users to express a set of models that users want to explore, which contains potentially good-performing models.\nIn this framework, a model space is defined with two parts: a base model and possible mutations on the base model.\n"
+      ]
+    },
+    {
+      "cell_type": "markdown",
+      "metadata": {},
+      "source": [
+        "### Define Base Model\n\nDefining a base model is almost the same as defining a PyTorch (or TensorFlow) model.\nUsually, you only need to replace the code ``import torch.nn as nn`` with\n``import nni.retiarii.nn.pytorch as nn`` to use our wrapped PyTorch modules.\n\nBelow is a very simple example of defining a base model.\n\n"
+      ]
+    },
+    {
+      "cell_type": "code",
+      "execution_count": null,
+      "metadata": {
+        "collapsed": false
+      },
+      "outputs": [],
+      "source": [
+        "import torch\nimport torch.nn.functional as F\nimport nni.retiarii.nn.pytorch as nn\nfrom nni.retiarii import model_wrapper\n\n\n@model_wrapper      # this decorator should be put on the out most\nclass Net(nn.Module):\n    def __init__(self):\n        super().__init__()\n        self.conv1 = nn.Conv2d(1, 32, 3, 1)\n        self.conv2 = nn.Conv2d(32, 64, 3, 1)\n        self.dropout1 = nn.Dropout(0.25)\n        self.dropout2 = nn.Dropout(0.5)\n        self.fc1 = nn.Linear(9216, 128)\n        self.fc2 = nn.Linear(128, 10)\n\n    def forward(self, x):\n        x = F.relu(self.conv1(x))\n        x = F.max_pool2d(self.conv2(x), 2)\n        x = torch.flatten(self.dropout1(x), 1)\n        x = self.fc2(self.dropout2(F.relu(self.fc1(x))))\n        output = F.log_softmax(x, dim=1)\n        return output"
+      ]
+    },
+    {
+      "cell_type": "markdown",
+      "metadata": {},
+      "source": [
+        ".. tip:: Always keep in mind that you should use ``import nni.retiarii.nn.pytorch as nn`` and :meth:`nni.retiarii.model_wrapper`.\n         Many mistakes are a result of forgetting one of those.\n         Also, please use ``torch.nn`` for submodules of ``nn.init``, e.g., ``torch.nn.init`` instead of ``nn.init``.\n\n### Define Model Mutations\n\nA base model is only one concrete model not a model space. We provide :doc:`API and Primitives </NAS/MutationPrimitives>`\nfor users to express how the base model can be mutated. That is, to build a model space which includes many models.\n\nBased on the above base model, we can define a model space as below.\n\n.. code-block:: diff\n\n  @model_wrapper\n  class Net(nn.Module):\n    def __init__(self):\n      super().__init__()\n      self.conv1 = nn.Conv2d(1, 32, 3, 1)\n  -   self.conv2 = nn.Conv2d(32, 64, 3, 1)\n  +   self.conv2 = nn.LayerChoice([\n  +       nn.Conv2d(32, 64, 3, 1),\n  +       DepthwiseSeparableConv(32, 64)\n  +   ])\n  -   self.dropout1 = nn.Dropout(0.25)\n  +   self.dropout1 = nn.Dropout(nn.ValueChoice([0.25, 0.5, 0.75]))\n      self.dropout2 = nn.Dropout(0.5)\n  -   self.fc1 = nn.Linear(9216, 128)\n  -   self.fc2 = nn.Linear(128, 10)\n  +   feature = nn.ValueChoice([64, 128, 256])\n  +   self.fc1 = nn.Linear(9216, feature)\n  +   self.fc2 = nn.Linear(feature, 10)\n\n    def forward(self, x):\n      x = F.relu(self.conv1(x))\n      x = F.max_pool2d(self.conv2(x), 2)\n      x = torch.flatten(self.dropout1(x), 1)\n      x = self.fc2(self.dropout2(F.relu(self.fc1(x))))\n      output = F.log_softmax(x, dim=1)\n      return output\n\nThis results in the following code:\n\n"
+      ]
+    },
+    {
+      "cell_type": "code",
+      "execution_count": null,
+      "metadata": {
+        "collapsed": false
+      },
+      "outputs": [],
+      "source": [
+        "class DepthwiseSeparableConv(nn.Module):\n    def __init__(self, in_ch, out_ch):\n        super().__init__()\n        self.depthwise = nn.Conv2d(in_ch, in_ch, kernel_size=3, groups=in_ch)\n        self.pointwise = nn.Conv2d(in_ch, out_ch, kernel_size=1)\n\n    def forward(self, x):\n        return self.pointwise(self.depthwise(x))\n\n\n@model_wrapper\nclass ModelSpace(nn.Module):\n    def __init__(self):\n        super().__init__()\n        self.conv1 = nn.Conv2d(1, 32, 3, 1)\n        # LayerChoice is used to select a layer between Conv2d and DwConv.\n        self.conv2 = nn.LayerChoice([\n            nn.Conv2d(32, 64, 3, 1),\n            DepthwiseSeparableConv(32, 64)\n        ])\n        # ValueChoice is used to select a dropout rate.\n        # ValueChoice can be used as parameter of modules wrapped in `nni.retiarii.nn.pytorch`\n        # or customized modules wrapped with `@basic_unit`.\n        self.dropout1 = nn.Dropout(nn.ValueChoice([0.25, 0.5, 0.75]))  # choose dropout rate from 0.25, 0.5 and 0.75\n        self.dropout2 = nn.Dropout(0.5)\n        feature = nn.ValueChoice([64, 128, 256])\n        self.fc1 = nn.Linear(9216, feature)\n        self.fc2 = nn.Linear(feature, 10)\n\n    def forward(self, x):\n        x = F.relu(self.conv1(x))\n        x = F.max_pool2d(self.conv2(x), 2)\n        x = torch.flatten(self.dropout1(x), 1)\n        x = self.fc2(self.dropout2(F.relu(self.fc1(x))))\n        output = F.log_softmax(x, dim=1)\n        return output\n\n\nmodel_space = ModelSpace()\nmodel_space"
+      ]
+    },
+    {
+      "cell_type": "markdown",
+      "metadata": {},
+      "source": [
+        "This example uses two mutation APIs, ``nn.LayerChoice`` and ``nn.ValueChoice``.\n``nn.LayerChoice`` takes a list of candidate modules (two in this example), one will be chosen for each sampled model.\nIt can be used like normal PyTorch module.\n``nn.ValueChoice`` takes a list of candidate values, one will be chosen to take effect for each sampled model.\n\nMore detailed API description and usage can be found :doc:`here </NAS/construct_space>`.\n\n<div class=\"alert alert-info\"><h4>Note</h4><p>We are actively enriching the mutation APIs, to facilitate easy construction of model space.\n    If the currently supported mutation APIs cannot express your model space,\n    please refer to :doc:`this doc </NAS/Mutators>` for customizing mutators.</p></div>\n\n## Explore the Defined Model Space\n\nThere are basically two exploration approaches: (1) search by evaluating each sampled model independently,\nwhich is the search approach in multi-trial NAS and (2) one-shot weight-sharing based search, which is used in one-shot NAS.\nWe demonstrate the first approach in this tutorial. Users can refer to :doc:`here </NAS/OneshotTrainer>` for the second approach.\n\nFirst, users need to pick a proper exploration strategy to explore the defined model space.\nSecond, users need to pick or customize a model evaluator to evaluate the performance of each explored model.\n\n### Pick an exploration strategy\n\nRetiarii supports many :doc:`exploration strategies </NAS/ExplorationStrategies>`.\n\nSimply choosing (i.e., instantiate) an exploration strategy as below.\n\n"
+      ]
+    },
+    {
+      "cell_type": "code",
+      "execution_count": null,
+      "metadata": {
+        "collapsed": false
+      },
+      "outputs": [],
+      "source": [
+        "import nni.retiarii.strategy as strategy\nsearch_strategy = strategy.Random(dedup=True)  # dedup=False if deduplication is not wanted"
+      ]
+    },
+    {
+      "cell_type": "markdown",
+      "metadata": {},
+      "source": [
+        "### Pick or customize a model evaluator\n\nIn the exploration process, the exploration strategy repeatedly generates new models. A model evaluator is for training and validating each generated model to obtain the model's performance. The performance is sent to the exploration strategy for the strategy to generate better models.\n\nRetiarii has provided :doc:`built-in model evaluators </NAS/ModelEvaluators>`, but to start with, it is recommended to use ``FunctionalEvaluator``, that is, to wrap your own training and evaluation code with one single function. This function should receive one single model class and uses ``nni.report_final_result`` to report the final score of this model.\n\nAn example here creates a simple evaluator that runs on MNIST dataset, trains for 2 epochs, and reports its validation accuracy.\n\n"
+      ]
+    },
+    {
+      "cell_type": "code",
+      "execution_count": null,
+      "metadata": {
+        "collapsed": false
+      },
+      "outputs": [],
+      "source": [
+        "import nni\n\nfrom torchvision import transforms\nfrom torchvision.datasets import MNIST\nfrom torch.utils.data import DataLoader\n\n\ndef train_epoch(model, device, train_loader, optimizer, epoch):\n    loss_fn = torch.nn.CrossEntropyLoss()\n    model.train()\n    for batch_idx, (data, target) in enumerate(train_loader):\n        data, target = data.to(device), target.to(device)\n        optimizer.zero_grad()\n        output = model(data)\n        loss = loss_fn(output, target)\n        loss.backward()\n        optimizer.step()\n        if batch_idx % 10 == 0:\n            print('Train Epoch: {} [{}/{} ({:.0f}%)]\\tLoss: {:.6f}'.format(\n                epoch, batch_idx * len(data), len(train_loader.dataset),\n                100. * batch_idx / len(train_loader), loss.item()))\n\n\ndef test_epoch(model, device, test_loader):\n    model.eval()\n    test_loss = 0\n    correct = 0\n    with torch.no_grad():\n        for data, target in test_loader:\n            data, target = data.to(device), target.to(device)\n            output = model(data)\n            pred = output.argmax(dim=1, keepdim=True)\n            correct += pred.eq(target.view_as(pred)).sum().item()\n\n    test_loss /= len(test_loader.dataset)\n    accuracy = 100. * correct / len(test_loader.dataset)\n\n    print('\\nTest set: Accuracy: {}/{} ({:.0f}%)\\n'.format(\n          correct, len(test_loader.dataset), accuracy))\n\n    return accuracy\n\n\ndef evaluate_model(model_cls):\n    # \"model_cls\" is a class, need to instantiate\n    model = model_cls()\n\n    device = torch.device('cuda') if torch.cuda.is_available() else torch.device('cpu')\n    model.to(device)\n\n    optimizer = torch.optim.Adam(model.parameters(), lr=1e-3)\n    transf = transforms.Compose([transforms.ToTensor(), transforms.Normalize((0.1307,), (0.3081,))])\n    train_loader = DataLoader(MNIST('data/mnist', download=True, transform=transf), batch_size=64, shuffle=True)\n    test_loader = DataLoader(MNIST('data/mnist', download=True, train=False, transform=transf), batch_size=64)\n\n    for epoch in range(3):\n        # train the model for one epoch\n        train_epoch(model, device, train_loader, optimizer, epoch)\n        # test the model for one epoch\n        accuracy = test_epoch(model, device, test_loader)\n        # call report intermediate result. Result can be float or dict\n        nni.report_intermediate_result(accuracy)\n\n    # report final test result\n    nni.report_final_result(accuracy)"
+      ]
+    },
+    {
+      "cell_type": "markdown",
+      "metadata": {},
+      "source": [
+        "Create the evaluator\n\n"
+      ]
+    },
+    {
+      "cell_type": "code",
+      "execution_count": null,
+      "metadata": {
+        "collapsed": false
+      },
+      "outputs": [],
+      "source": [
+        "from nni.retiarii.evaluator import FunctionalEvaluator\nevaluator = FunctionalEvaluator(evaluate_model)"
+      ]
+    },
+    {
+      "cell_type": "markdown",
+      "metadata": {},
+      "source": [
+        "The ``train_epoch`` and ``test_epoch`` here can be any customized function, where users can write their own training recipe.\n\nIt is recommended that the :doc:``evaluate_model`` here accepts no additional arguments other than ``model_cls``.\nHowever, in the `advanced tutorial </NAS/ModelEvaluators>`, we will show how to use additional arguments in case you actually need those.\nIn future, we will support mutation on the arguments of evaluators, which is commonly called \"Hyper-parmeter tuning\".\n\n## Launch an Experiment\n\nAfter all the above are prepared, it is time to start an experiment to do the model search. An example is shown below.\n\n"
+      ]
+    },
+    {
+      "cell_type": "code",
+      "execution_count": null,
+      "metadata": {
+        "collapsed": false
+      },
+      "outputs": [],
+      "source": [
+        "from nni.retiarii.experiment.pytorch import RetiariiExperiment, RetiariiExeConfig\nexp = RetiariiExperiment(model_space, evaluator, [], search_strategy)\nexp_config = RetiariiExeConfig('local')\nexp_config.experiment_name = 'mnist_search'"
+      ]
+    },
+    {
+      "cell_type": "markdown",
+      "metadata": {},
+      "source": [
+        "The following configurations are useful to control how many trials to run at most / at the same time.\n\n"
+      ]
+    },
+    {
+      "cell_type": "code",
+      "execution_count": null,
+      "metadata": {
+        "collapsed": false
+      },
+      "outputs": [],
+      "source": [
+        "exp_config.max_trial_number = 4   # spawn 4 trials at most\nexp_config.trial_concurrency = 2  # will run two trials concurrently"
+      ]
+    },
+    {
+      "cell_type": "markdown",
+      "metadata": {},
+      "source": [
+        "Remember to set the following config if you want to GPU.\n``use_active_gpu`` should be set true if you wish to use an occupied GPU (possibly running a GUI).\n\n"
+      ]
+    },
+    {
+      "cell_type": "code",
+      "execution_count": null,
+      "metadata": {
+        "collapsed": false
+      },
+      "outputs": [],
+      "source": [
+        "exp_config.trial_gpu_number = 1\nexp_config.training_service.use_active_gpu = False"
+      ]
+    },
+    {
+      "cell_type": "markdown",
+      "metadata": {},
+      "source": [
+        "Launch the experiment. The experiment should take several minutes to finish on a workstation with 2 GPUs.\n\n"
+      ]
+    },
+    {
+      "cell_type": "code",
+      "execution_count": null,
+      "metadata": {
+        "collapsed": false
+      },
+      "outputs": [],
+      "source": [
+        "exp.run(exp_config, 8081)"
+      ]
+    },
+    {
+      "cell_type": "markdown",
+      "metadata": {},
+      "source": [
+        "Users can also run Retiarii Experiment with :doc:`different training services <../training_services>` besides ``local`` training service.\n\n## Visualize the Experiment\n\nUsers can visualize their experiment in the same way as visualizing a normal hyper-parameter tuning experiment.\nFor example, open ``localhost:8081`` in your browser, 8081 is the port that you set in ``exp.run``.\nPlease refer to :doc:`here <../Tutorial/WebUI>` for details.\n\nWe support visualizing models with 3rd-party visualization engines (like `Netron <https://netron.app/>`__).\nThis can be used by clicking ``Visualization`` in detail panel for each trial.\nNote that current visualization is based on `onnx <https://onnx.ai/>`__ ,\nthus visualization is not feasible if the model cannot be exported into onnx.\n\nBuilt-in evaluators (e.g., Classification) will automatically export the model into a file.\nFor your own evaluator, you need to save your file into ``$NNI_OUTPUT_DIR/model.onnx`` to make this work.\nFor instance,\n\n"
+      ]
+    },
+    {
+      "cell_type": "code",
+      "execution_count": null,
+      "metadata": {
+        "collapsed": false
+      },
+      "outputs": [],
+      "source": [
+        "import os\nfrom pathlib import Path\n\n\ndef evaluate_model_with_visualization(model_cls):\n    model = model_cls()\n    # dump the model into an onnx\n    if 'NNI_OUTPUT_DIR' in os.environ:\n        dummy_input = torch.zeros(1, 3, 32, 32)\n        torch.onnx.export(model, (dummy_input, ),\n                          Path(os.environ['NNI_OUTPUT_DIR']) / 'model.onnx')\n    evaluate_model(model_cls)"
+      ]
+    },
+    {
+      "cell_type": "markdown",
+      "metadata": {},
+      "source": [
+        "Relaunch the experiment, and a button is shown on WebUI.\n\n<img src=\"file://../../img/netron_entrance_webui.png\">\n\n## Export Top Models\n\nUsers can export top models after the exploration is done using ``export_top_models``.\n\n"
+      ]
+    },
+    {
+      "cell_type": "code",
+      "execution_count": null,
+      "metadata": {
+        "collapsed": false
+      },
+      "outputs": [],
+      "source": [
+        "for model_dict in exp.export_top_models(formatter='dict'):\n    print(model_dict)\n\n# The output is `json` object which records the mutation actions of the top model.\n# If users want to output source code of the top model, they can use graph-based execution engine for the experiment,\n# by simply adding the following two lines.\n#\n# .. code-block:: python\n#\n#   exp_config.execution_engine = 'base'\n#   export_formatter = 'code'"
+      ]
+    }
+  ],
+  "metadata": {
+    "kernelspec": {
+      "display_name": "Python 3",
+      "language": "python",
+      "name": "python3"
+    },
+    "language_info": {
+      "codemirror_mode": {
+        "name": "ipython",
+        "version": 3
+      },
+      "file_extension": ".py",
+      "mimetype": "text/x-python",
+      "name": "python",
+      "nbconvert_exporter": "python",
+      "pygments_lexer": "ipython3",
+      "version": "3.8.8"
+    }
+  },
+  "nbformat": 4,
+  "nbformat_minor": 0
+}
\ No newline at end of file
--- a/docs/source/tutorials/hello_nas.py
+++ b/docs/source/tutorials/hello_nas.py
+"""
+Hello, NAS!
+===========
+This is the 101 tutorial of Neural Architecture Search (NAS) on NNI.
+In this tutorial, we will search for a neural architecture on MNIST dataset with the help of NAS framework of NNI, i.e., *Retiarii*.
+We use multi-trial NAS as an example to show how to construct and explore a model space.
+There are mainly three crucial components for a neural architecture search task, namely,
+* Model search space that defines a set of models to explore.
+* A proper strategy as the method to explore this model space.
+* A model evaluator that reports the performance of every model in the space.
+Currently, PyTorch is the only supported framework by Retiarii, and we have only tested **PyTorch 1.7 to 1.10**.
+This tutorial assumes PyTorch context but it should also apply to other frameworks, which is in our future plan.
+Define your Model Space
+-----------------------
+Model space is defined by users to express a set of models that users want to explore, which contains potentially good-performing models.
+In this framework, a model space is defined with two parts: a base model and possible mutations on the base model.
+"""
+# %%
+#
+# Define Base Model
+# ^^^^^^^^^^^^^^^^^
+#
+# Defining a base model is almost the same as defining a PyTorch (or TensorFlow) model.
+# Usually, you only need to replace the code ``import torch.nn as nn`` with
+# ``import nni.retiarii.nn.pytorch as nn`` to use our wrapped PyTorch modules.
+#
+# Below is a very simple example of defining a base model.
+import torch
+import torch.nn.functional as F
+import nni.retiarii.nn.pytorch as nn
+from nni.retiarii import model_wrapper
+@model_wrapper      # this decorator should be put on the out most
+class Net(nn.Module):
+    def __init__(self):
+        super().__init__()
+        self.conv1 = nn.Conv2d(1, 32, 3, 1)
+        self.conv2 = nn.Conv2d(32, 64, 3, 1)
+        self.dropout1 = nn.Dropout(0.25)
+        self.dropout2 = nn.Dropout(0.5)
+        self.fc1 = nn.Linear(9216, 128)
+        self.fc2 = nn.Linear(128, 10)
+    def forward(self, x):
+        x = F.relu(self.conv1(x))
+        x = F.max_pool2d(self.conv2(x), 2)
+        x = torch.flatten(self.dropout1(x), 1)
+        x = self.fc2(self.dropout2(F.relu(self.fc1(x))))
+        output = F.log_softmax(x, dim=1)
+        return output
+# %%
+# .. tip:: Always keep in mind that you should use ``import nni.retiarii.nn.pytorch as nn`` and :meth:`nni.retiarii.model_wrapper`.
+#          Many mistakes are a result of forgetting one of those.
+#          Also, please use ``torch.nn`` for submodules of ``nn.init``, e.g., ``torch.nn.init`` instead of ``nn.init``.
+#
+# Define Model Mutations
+# ^^^^^^^^^^^^^^^^^^^^^^
+#
+# A base model is only one concrete model not a model space. We provide :doc:`API and Primitives </NAS/MutationPrimitives>`
+# for users to express how the base model can be mutated. That is, to build a model space which includes many models.
+#
+# Based on the above base model, we can define a model space as below.
+#
+# .. code-block:: diff
+#
+#   @model_wrapper
+#   class Net(nn.Module):
+#     def __init__(self):
+#       super().__init__()
+#       self.conv1 = nn.Conv2d(1, 32, 3, 1)
+#   -   self.conv2 = nn.Conv2d(32, 64, 3, 1)
+#   +   self.conv2 = nn.LayerChoice([
+#   +       nn.Conv2d(32, 64, 3, 1),
+#   +       DepthwiseSeparableConv(32, 64)
+#   +   ])
+#   -   self.dropout1 = nn.Dropout(0.25)
+#   +   self.dropout1 = nn.Dropout(nn.ValueChoice([0.25, 0.5, 0.75]))
+#       self.dropout2 = nn.Dropout(0.5)
+#   -   self.fc1 = nn.Linear(9216, 128)
+#   -   self.fc2 = nn.Linear(128, 10)
+#   +   feature = nn.ValueChoice([64, 128, 256])
+#   +   self.fc1 = nn.Linear(9216, feature)
+#   +   self.fc2 = nn.Linear(feature, 10)
+#
+#     def forward(self, x):
+#       x = F.relu(self.conv1(x))
+#       x = F.max_pool2d(self.conv2(x), 2)
+#       x = torch.flatten(self.dropout1(x), 1)
+#       x = self.fc2(self.dropout2(F.relu(self.fc1(x))))
+#       output = F.log_softmax(x, dim=1)
+#       return output
+#
+# This results in the following code:
+class DepthwiseSeparableConv(nn.Module):
+    def __init__(self, in_ch, out_ch):
+        super().__init__()
+        self.depthwise = nn.Conv2d(in_ch, in_ch, kernel_size=3, groups=in_ch)
+        self.pointwise = nn.Conv2d(in_ch, out_ch, kernel_size=1)
+    def forward(self, x):
+        return self.pointwise(self.depthwise(x))
+@model_wrapper
+class ModelSpace(nn.Module):
+    def __init__(self):
+        super().__init__()
+        self.conv1 = nn.Conv2d(1, 32, 3, 1)
+        # LayerChoice is used to select a layer between Conv2d and DwConv.
+        self.conv2 = nn.LayerChoice([
+            nn.Conv2d(32, 64, 3, 1),
+            DepthwiseSeparableConv(32, 64)
+        ])
+        # ValueChoice is used to select a dropout rate.
+        # ValueChoice can be used as parameter of modules wrapped in `nni.retiarii.nn.pytorch`
+        # or customized modules wrapped with `@basic_unit`.
+        self.dropout1 = nn.Dropout(nn.ValueChoice([0.25, 0.5, 0.75]))  # choose dropout rate from 0.25, 0.5 and 0.75
+        self.dropout2 = nn.Dropout(0.5)
+        feature = nn.ValueChoice([64, 128, 256])
+        self.fc1 = nn.Linear(9216, feature)
+        self.fc2 = nn.Linear(feature, 10)
+    def forward(self, x):
+        x = F.relu(self.conv1(x))
+        x = F.max_pool2d(self.conv2(x), 2)
+        x = torch.flatten(self.dropout1(x), 1)
+        x = self.fc2(self.dropout2(F.relu(self.fc1(x))))
+        output = F.log_softmax(x, dim=1)
+        return output
+model_space = ModelSpace()
+model_space
+# %%
+# This example uses two mutation APIs, ``nn.LayerChoice`` and ``nn.ValueChoice``.
+# ``nn.LayerChoice`` takes a list of candidate modules (two in this example), one will be chosen for each sampled model.
+# It can be used like normal PyTorch module.
+# ``nn.ValueChoice`` takes a list of candidate values, one will be chosen to take effect for each sampled model.
+#
+# More detailed API description and usage can be found :doc:`here </NAS/construct_space>`.
+#
+# .. note::
+#
+#     We are actively enriching the mutation APIs, to facilitate easy construction of model space.
+#     If the currently supported mutation APIs cannot express your model space,
+#     please refer to :doc:`this doc </NAS/Mutators>` for customizing mutators.
+#
+# Explore the Defined Model Space
+# -------------------------------
+#
+# There are basically two exploration approaches: (1) search by evaluating each sampled model independently,
+# which is the search approach in multi-trial NAS and (2) one-shot weight-sharing based search, which is used in one-shot NAS.
+# We demonstrate the first approach in this tutorial. Users can refer to :doc:`here </NAS/OneshotTrainer>` for the second approach.
+#
+# First, users need to pick a proper exploration strategy to explore the defined model space.
+# Second, users need to pick or customize a model evaluator to evaluate the performance of each explored model.
+#
+# Pick an exploration strategy
+# ^^^^^^^^^^^^^^^^^^^^^^^^^^^^
+#
+# Retiarii supports many :doc:`exploration strategies </NAS/ExplorationStrategies>`.
+#
+# Simply choosing (i.e., instantiate) an exploration strategy as below.
+import nni.retiarii.strategy as strategy
+search_strategy = strategy.Random(dedup=True)  # dedup=False if deduplication is not wanted
+# %%
+# Pick or customize a model evaluator
+# ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
+#
+# In the exploration process, the exploration strategy repeatedly generates new models. A model evaluator is for training and validating each generated model to obtain the model's performance. The performance is sent to the exploration strategy for the strategy to generate better models.
+#
+# Retiarii has provided :doc:`built-in model evaluators </NAS/ModelEvaluators>`, but to start with, it is recommended to use ``FunctionalEvaluator``, that is, to wrap your own training and evaluation code with one single function. This function should receive one single model class and uses ``nni.report_final_result`` to report the final score of this model.
+#
+# An example here creates a simple evaluator that runs on MNIST dataset, trains for 2 epochs, and reports its validation accuracy.
+import nni
+from torchvision import transforms
+from torchvision.datasets import MNIST
+from torch.utils.data import DataLoader
+def train_epoch(model, device, train_loader, optimizer, epoch):
+    loss_fn = torch.nn.CrossEntropyLoss()
+    model.train()
+    for batch_idx, (data, target) in enumerate(train_loader):
+        data, target = data.to(device), target.to(device)
+        optimizer.zero_grad()
+        output = model(data)
+        loss = loss_fn(output, target)
+        loss.backward()
+        optimizer.step()
+        if batch_idx % 10 == 0:
+            print('Train Epoch: {} [{}/{} ({:.0f}%)]\tLoss: {:.6f}'.format(
+                epoch, batch_idx * len(data), len(train_loader.dataset),
+                100. * batch_idx / len(train_loader), loss.item()))
+def test_epoch(model, device, test_loader):
+    model.eval()
+    test_loss = 0
+    correct = 0
+    with torch.no_grad():
+        for data, target in test_loader:
+            data, target = data.to(device), target.to(device)
+            output = model(data)
+            pred = output.argmax(dim=1, keepdim=True)
+            correct += pred.eq(target.view_as(pred)).sum().item()
+    test_loss /= len(test_loader.dataset)
+    accuracy = 100. * correct / len(test_loader.dataset)
+    print('\nTest set: Accuracy: {}/{} ({:.0f}%)\n'.format(
+          correct, len(test_loader.dataset), accuracy))
+    return accuracy
+def evaluate_model(model_cls):
+    # "model_cls" is a class, need to instantiate
+    model = model_cls()
+    device = torch.device('cuda') if torch.cuda.is_available() else torch.device('cpu')
+    model.to(device)
+    optimizer = torch.optim.Adam(model.parameters(), lr=1e-3)
+    transf = transforms.Compose([transforms.ToTensor(), transforms.Normalize((0.1307,), (0.3081,))])
+    train_loader = DataLoader(MNIST('data/mnist', download=True, transform=transf), batch_size=64, shuffle=True)
+    test_loader = DataLoader(MNIST('data/mnist', download=True, train=False, transform=transf), batch_size=64)
+    for epoch in range(3):
+        # train the model for one epoch
+        train_epoch(model, device, train_loader, optimizer, epoch)
+        # test the model for one epoch
+        accuracy = test_epoch(model, device, test_loader)
+        # call report intermediate result. Result can be float or dict
+        nni.report_intermediate_result(accuracy)
+    # report final test result
+    nni.report_final_result(accuracy)
+# %%
+# Create the evaluator
+from nni.retiarii.evaluator import FunctionalEvaluator
+evaluator = FunctionalEvaluator(evaluate_model)
+# %%
+#
+# The ``train_epoch`` and ``test_epoch`` here can be any customized function, where users can write their own training recipe.
+#
+# It is recommended that the :doc:``evaluate_model`` here accepts no additional arguments other than ``model_cls``.
+# However, in the `advanced tutorial </NAS/ModelEvaluators>`, we will show how to use additional arguments in case you actually need those.
+# In future, we will support mutation on the arguments of evaluators, which is commonly called "Hyper-parmeter tuning".
+#
+# Launch an Experiment
+# --------------------
+#
+# After all the above are prepared, it is time to start an experiment to do the model search. An example is shown below.
+from nni.retiarii.experiment.pytorch import RetiariiExperiment, RetiariiExeConfig
+exp = RetiariiExperiment(model_space, evaluator, [], search_strategy)
+exp_config = RetiariiExeConfig('local')
+exp_config.experiment_name = 'mnist_search'
+# %%
+# The following configurations are useful to control how many trials to run at most / at the same time.
+exp_config.max_trial_number = 4   # spawn 4 trials at most
+exp_config.trial_concurrency = 2  # will run two trials concurrently
+# %%
+# Remember to set the following config if you want to GPU.
+# ``use_active_gpu`` should be set true if you wish to use an occupied GPU (possibly running a GUI).
+exp_config.trial_gpu_number = 1
+exp_config.training_service.use_active_gpu = False
+# %%
+# Launch the experiment. The experiment should take several minutes to finish on a workstation with 2 GPUs.
+exp.run(exp_config, 8081)
+# %%
+# Users can also run Retiarii Experiment with :doc:`different training services <../training_services>` besides ``local`` training service.
+#
+# Visualize the Experiment
+# ------------------------
+#
+# Users can visualize their experiment in the same way as visualizing a normal hyper-parameter tuning experiment.
+# For example, open ``localhost:8081`` in your browser, 8081 is the port that you set in ``exp.run``.
+# Please refer to :doc:`here <../Tutorial/WebUI>` for details.
+#
+# We support visualizing models with 3rd-party visualization engines (like `Netron <https://netron.app/>`__).
+# This can be used by clicking ``Visualization`` in detail panel for each trial.
+# Note that current visualization is based on `onnx <https://onnx.ai/>`__ ,
+# thus visualization is not feasible if the model cannot be exported into onnx.
+#
+# Built-in evaluators (e.g., Classification) will automatically export the model into a file.
+# For your own evaluator, you need to save your file into ``$NNI_OUTPUT_DIR/model.onnx`` to make this work.
+# For instance,
+import os
+from pathlib import Path
+def evaluate_model_with_visualization(model_cls):
+    model = model_cls()
+    # dump the model into an onnx
+    if 'NNI_OUTPUT_DIR' in os.environ:
+        dummy_input = torch.zeros(1, 3, 32, 32)
+        torch.onnx.export(model, (dummy_input, ),
+                          Path(os.environ['NNI_OUTPUT_DIR']) / 'model.onnx')
+    evaluate_model(model_cls)
+# %%
+# Relaunch the experiment, and a button is shown on WebUI.
+#
+# .. image:: ../../img/netron_entrance_webui.png
+#
+# Export Top Models
+# -----------------
+#
+# Users can export top models after the exploration is done using ``export_top_models``.
+for model_dict in exp.export_top_models(formatter='dict'):
+    print(model_dict)
+# The output is `json` object which records the mutation actions of the top model.
+# If users want to output source code of the top model, they can use graph-based execution engine for the experiment,
+# by simply adding the following two lines.
+#
+# .. code-block:: python
+#
+#   exp_config.execution_engine = 'base'
+#   export_formatter = 'code'
--- a/docs/source/tutorials/hello_nas.py.md5
+++ b/docs/source/tutorials/hello_nas.py.md5
+49ae2fd144f8c845a18b778edf168636
\ No newline at end of file
--- a/docs/source/tutorials/hello_nas.rst
+++ b/docs/source/tutorials/hello_nas.rst
+.. DO NOT EDIT.
+.. THIS FILE WAS AUTOMATICALLY GENERATED BY SPHINX-GALLERY.
+.. TO MAKE CHANGES, EDIT THE SOURCE PYTHON FILE:
+.. "tutorials/hello_nas.py"
+.. LINE NUMBERS ARE GIVEN BELOW.
+.. only:: html
+    .. note::
+        :class: sphx-glr-download-link-note
+        Click :ref:`here <sphx_glr_download_tutorials_hello_nas.py>`
+        to download the full example code
+.. rst-class:: sphx-glr-example-title
+.. _sphx_glr_tutorials_hello_nas.py:
+Hello, NAS!
+===========
+This is the 101 tutorial of Neural Architecture Search (NAS) on NNI.
+In this tutorial, we will search for a neural architecture on MNIST dataset with the help of NAS framework of NNI, i.e., *Retiarii*.
+We use multi-trial NAS as an example to show how to construct and explore a model space.
+There are mainly three crucial components for a neural architecture search task, namely,
+* Model search space that defines a set of models to explore.
+* A proper strategy as the method to explore this model space.
+* A model evaluator that reports the performance of every model in the space.
+Currently, PyTorch is the only supported framework by Retiarii, and we have only tested **PyTorch 1.7 to 1.10**.
+This tutorial assumes PyTorch context but it should also apply to other frameworks, which is in our future plan.
+Define your Model Space
+-----------------------
+Model space is defined by users to express a set of models that users want to explore, which contains potentially good-performing models.
+In this framework, a model space is defined with two parts: a base model and possible mutations on the base model.
+.. GENERATED FROM PYTHON SOURCE LINES 26-34
+Define Base Model
+^^^^^^^^^^^^^^^^^
+Defining a base model is almost the same as defining a PyTorch (or TensorFlow) model.
+Usually, you only need to replace the code ``import torch.nn as nn`` with
+``import nni.retiarii.nn.pytorch as nn`` to use our wrapped PyTorch modules.
+Below is a very simple example of defining a base model.
+.. GENERATED FROM PYTHON SOURCE LINES 35-61
+.. code-block:: default
+    import torch
+    import torch.nn.functional as F
+    import nni.retiarii.nn.pytorch as nn
+    from nni.retiarii import model_wrapper
+    @model_wrapper      # this decorator should be put on the out most
+    class Net(nn.Module):
+        def __init__(self):
+            super().__init__()
+            self.conv1 = nn.Conv2d(1, 32, 3, 1)
+            self.conv2 = nn.Conv2d(32, 64, 3, 1)
+            self.dropout1 = nn.Dropout(0.25)
+            self.dropout2 = nn.Dropout(0.5)
+            self.fc1 = nn.Linear(9216, 128)
+            self.fc2 = nn.Linear(128, 10)
+        def forward(self, x):
+            x = F.relu(self.conv1(x))
+            x = F.max_pool2d(self.conv2(x), 2)
+            x = torch.flatten(self.dropout1(x), 1)
+            x = self.fc2(self.dropout2(F.relu(self.fc1(x))))
+            output = F.log_softmax(x, dim=1)
+            return output
+.. GENERATED FROM PYTHON SOURCE LINES 62-104
+.. tip:: Always keep in mind that you should use ``import nni.retiarii.nn.pytorch as nn`` and :meth:`nni.retiarii.model_wrapper`.
+         Many mistakes are a result of forgetting one of those.
+         Also, please use ``torch.nn`` for submodules of ``nn.init``, e.g., ``torch.nn.init`` instead of ``nn.init``.
+Define Model Mutations
+^^^^^^^^^^^^^^^^^^^^^^
+A base model is only one concrete model not a model space. We provide :doc:`API and Primitives </NAS/MutationPrimitives>`
+for users to express how the base model can be mutated. That is, to build a model space which includes many models.
+Based on the above base model, we can define a model space as below.
+.. code-block:: diff
+  @model_wrapper
+  class Net(nn.Module):
+    def __init__(self):
+      super().__init__()
+      self.conv1 = nn.Conv2d(1, 32, 3, 1)
+  -   self.conv2 = nn.Conv2d(32, 64, 3, 1)
+  +   self.conv2 = nn.LayerChoice([
+  +       nn.Conv2d(32, 64, 3, 1),
+  +       DepthwiseSeparableConv(32, 64)
+  +   ])
+  -   self.dropout1 = nn.Dropout(0.25)
+  +   self.dropout1 = nn.Dropout(nn.ValueChoice([0.25, 0.5, 0.75]))
+      self.dropout2 = nn.Dropout(0.5)
+  -   self.fc1 = nn.Linear(9216, 128)
+  -   self.fc2 = nn.Linear(128, 10)
+  +   feature = nn.ValueChoice([64, 128, 256])
+  +   self.fc1 = nn.Linear(9216, feature)
+  +   self.fc2 = nn.Linear(feature, 10)
+    def forward(self, x):
+      x = F.relu(self.conv1(x))
+      x = F.max_pool2d(self.conv2(x), 2)
+      x = torch.flatten(self.dropout1(x), 1)
+      x = self.fc2(self.dropout2(F.relu(self.fc1(x))))
+      output = F.log_softmax(x, dim=1)
+      return output
+This results in the following code:
+.. GENERATED FROM PYTHON SOURCE LINES 104-147
+.. code-block:: default
+    class DepthwiseSeparableConv(nn.Module):
+        def __init__(self, in_ch, out_ch):
+            super().__init__()
+            self.depthwise = nn.Conv2d(in_ch, in_ch, kernel_size=3, groups=in_ch)
+            self.pointwise = nn.Conv2d(in_ch, out_ch, kernel_size=1)
+        def forward(self, x):
+            return self.pointwise(self.depthwise(x))
+    @model_wrapper
+    class ModelSpace(nn.Module):
+        def __init__(self):
+            super().__init__()
+            self.conv1 = nn.Conv2d(1, 32, 3, 1)
+            # LayerChoice is used to select a layer between Conv2d and DwConv.
+            self.conv2 = nn.LayerChoice([
+                nn.Conv2d(32, 64, 3, 1),
+                DepthwiseSeparableConv(32, 64)
+            ])
+            # ValueChoice is used to select a dropout rate.
+            # ValueChoice can be used as parameter of modules wrapped in `nni.retiarii.nn.pytorch`
+            # or customized modules wrapped with `@basic_unit`.
+            self.dropout1 = nn.Dropout(nn.ValueChoice([0.25, 0.5, 0.75]))  # choose dropout rate from 0.25, 0.5 and 0.75
+            self.dropout2 = nn.Dropout(0.5)
+            feature = nn.ValueChoice([64, 128, 256])
+            self.fc1 = nn.Linear(9216, feature)
+            self.fc2 = nn.Linear(feature, 10)
+        def forward(self, x):
+            x = F.relu(self.conv1(x))
+            x = F.max_pool2d(self.conv2(x), 2)
+            x = torch.flatten(self.dropout1(x), 1)
+            x = self.fc2(self.dropout2(F.relu(self.fc1(x))))
+            output = F.log_softmax(x, dim=1)
+            return output
+    model_space = ModelSpace()
+    model_space
+.. rst-class:: sphx-glr-script-out
+ Out:
+ .. code-block:: none
+    ModelSpace(
+      (conv1): Conv2d(1, 32, kernel_size=(3, 3), stride=(1, 1))
+      (conv2): LayerChoice([Conv2d(32, 64, kernel_size=(3, 3), stride=(1, 1)), DepthwiseSeparableConv(
+        (depthwise): Conv2d(32, 32, kernel_size=(3, 3), stride=(1, 1), groups=32)
+        (pointwise): Conv2d(32, 64, kernel_size=(1, 1), stride=(1, 1))
+      )], label='model_1')
+      (dropout1): Dropout(p=0.25, inplace=False)
+      (dropout2): Dropout(p=0.5, inplace=False)
+      (fc1): Linear(in_features=9216, out_features=64, bias=True)
+      (fc2): Linear(in_features=64, out_features=10, bias=True)
+    )
+.. GENERATED FROM PYTHON SOURCE LINES 148-177
+This example uses two mutation APIs, ``nn.LayerChoice`` and ``nn.ValueChoice``.
+``nn.LayerChoice`` takes a list of candidate modules (two in this example), one will be chosen for each sampled model.
+It can be used like normal PyTorch module.
+``nn.ValueChoice`` takes a list of candidate values, one will be chosen to take effect for each sampled model.
+More detailed API description and usage can be found :doc:`here </NAS/construct_space>`.
+.. note::
+    We are actively enriching the mutation APIs, to facilitate easy construction of model space.
+    If the currently supported mutation APIs cannot express your model space,
+    please refer to :doc:`this doc </NAS/Mutators>` for customizing mutators.
+Explore the Defined Model Space
+-------------------------------
+There are basically two exploration approaches: (1) search by evaluating each sampled model independently,
+which is the search approach in multi-trial NAS and (2) one-shot weight-sharing based search, which is used in one-shot NAS.
+We demonstrate the first approach in this tutorial. Users can refer to :doc:`here </NAS/OneshotTrainer>` for the second approach.
+First, users need to pick a proper exploration strategy to explore the defined model space.
+Second, users need to pick or customize a model evaluator to evaluate the performance of each explored model.
+Pick an exploration strategy
+^^^^^^^^^^^^^^^^^^^^^^^^^^^^
+Retiarii supports many :doc:`exploration strategies </NAS/ExplorationStrategies>`.
+Simply choosing (i.e., instantiate) an exploration strategy as below.
+.. GENERATED FROM PYTHON SOURCE LINES 177-181
+.. code-block:: default
+    import nni.retiarii.strategy as strategy
+    search_strategy = strategy.Random(dedup=True)  # dedup=False if deduplication is not wanted
+.. rst-class:: sphx-glr-script-out
+ Out:
+ .. code-block:: none
+    [2022-02-22 18:55:27] INFO (hyperopt.utils/MainThread) Failed to load dill, try installing dill via "pip install dill" for enhanced pickling support.
+    [2022-02-22 18:55:27] INFO (hyperopt.fmin/MainThread) Failed to load dill, try installing dill via "pip install dill" for enhanced pickling support.
+    /home/yugzhan/miniconda3/envs/cu102/lib/python3.8/site-packages/ray/autoscaler/_private/cli_logger.py:57: FutureWarning: Not all Ray CLI dependencies were found. In Ray 1.4+, the Ray CLI, autoscaler, and dashboard will only be usable via `pip install 'ray[default]'`. Please update your install command.
+      warnings.warn(
+.. GENERATED FROM PYTHON SOURCE LINES 182-190
+Pick or customize a model evaluator
+^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
+In the exploration process, the exploration strategy repeatedly generates new models. A model evaluator is for training and validating each generated model to obtain the model's performance. The performance is sent to the exploration strategy for the strategy to generate better models.
+Retiarii has provided :doc:`built-in model evaluators </NAS/ModelEvaluators>`, but to start with, it is recommended to use ``FunctionalEvaluator``, that is, to wrap your own training and evaluation code with one single function. This function should receive one single model class and uses ``nni.report_final_result`` to report the final score of this model.
+An example here creates a simple evaluator that runs on MNIST dataset, trains for 2 epochs, and reports its validation accuracy.
+.. GENERATED FROM PYTHON SOURCE LINES 190-258
+.. code-block:: default
+    import nni
+    from torchvision import transforms
+    from torchvision.datasets import MNIST
+    from torch.utils.data import DataLoader
+    def train_epoch(model, device, train_loader, optimizer, epoch):
+        loss_fn = torch.nn.CrossEntropyLoss()
+        model.train()
+        for batch_idx, (data, target) in enumerate(train_loader):
+            data, target = data.to(device), target.to(device)
+            optimizer.zero_grad()
+            output = model(data)
+            loss = loss_fn(output, target)
+            loss.backward()
+            optimizer.step()
+            if batch_idx % 10 == 0:
+                print('Train Epoch: {} [{}/{} ({:.0f}%)]\tLoss: {:.6f}'.format(
+                    epoch, batch_idx * len(data), len(train_loader.dataset),
+                    100. * batch_idx / len(train_loader), loss.item()))
+    def test_epoch(model, device, test_loader):
+        model.eval()
+        test_loss = 0
+        correct = 0
+        with torch.no_grad():
+            for data, target in test_loader:
+                data, target = data.to(device), target.to(device)
+                output = model(data)
+                pred = output.argmax(dim=1, keepdim=True)
+                correct += pred.eq(target.view_as(pred)).sum().item()
+        test_loss /= len(test_loader.dataset)
+        accuracy = 100. * correct / len(test_loader.dataset)
+        print('\nTest set: Accuracy: {}/{} ({:.0f}%)\n'.format(
+              correct, len(test_loader.dataset), accuracy))
+        return accuracy
+    def evaluate_model(model_cls):
+        # "model_cls" is a class, need to instantiate
+        model = model_cls()
+        device = torch.device('cuda') if torch.cuda.is_available() else torch.device('cpu')
+        model.to(device)
+        optimizer = torch.optim.Adam(model.parameters(), lr=1e-3)
+        transf = transforms.Compose([transforms.ToTensor(), transforms.Normalize((0.1307,), (0.3081,))])
+        train_loader = DataLoader(MNIST('data/mnist', download=True, transform=transf), batch_size=64, shuffle=True)
+        test_loader = DataLoader(MNIST('data/mnist', download=True, train=False, transform=transf), batch_size=64)
+        for epoch in range(3):
+            # train the model for one epoch
+            train_epoch(model, device, train_loader, optimizer, epoch)
+            # test the model for one epoch
+            accuracy = test_epoch(model, device, test_loader)
+            # call report intermediate result. Result can be float or dict
+            nni.report_intermediate_result(accuracy)
+        # report final test result
+        nni.report_final_result(accuracy)
+.. GENERATED FROM PYTHON SOURCE LINES 259-260
+Create the evaluator
+.. GENERATED FROM PYTHON SOURCE LINES 260-264
+.. code-block:: default
+    from nni.retiarii.evaluator import FunctionalEvaluator
+    evaluator = FunctionalEvaluator(evaluate_model)
+.. GENERATED FROM PYTHON SOURCE LINES 265-275
+The ``train_epoch`` and ``test_epoch`` here can be any customized function, where users can write their own training recipe.
+It is recommended that the :doc:``evaluate_model`` here accepts no additional arguments other than ``model_cls``.
+However, in the `advanced tutorial </NAS/ModelEvaluators>`, we will show how to use additional arguments in case you actually need those.
+In future, we will support mutation on the arguments of evaluators, which is commonly called "Hyper-parmeter tuning".
+Launch an Experiment
+--------------------
+After all the above are prepared, it is time to start an experiment to do the model search. An example is shown below.
+.. GENERATED FROM PYTHON SOURCE LINES 276-282
+.. code-block:: default
+    from nni.retiarii.experiment.pytorch import RetiariiExperiment, RetiariiExeConfig
+    exp = RetiariiExperiment(model_space, evaluator, [], search_strategy)
+    exp_config = RetiariiExeConfig('local')
+    exp_config.experiment_name = 'mnist_search'
+.. GENERATED FROM PYTHON SOURCE LINES 283-284
+The following configurations are useful to control how many trials to run at most / at the same time.
+.. GENERATED FROM PYTHON SOURCE LINES 284-288
+.. code-block:: default
+    exp_config.max_trial_number = 4   # spawn 4 trials at most
+    exp_config.trial_concurrency = 2  # will run two trials concurrently
+.. GENERATED FROM PYTHON SOURCE LINES 289-291
+Remember to set the following config if you want to GPU.
+``use_active_gpu`` should be set true if you wish to use an occupied GPU (possibly running a GUI).
+.. GENERATED FROM PYTHON SOURCE LINES 291-295
+.. code-block:: default
+    exp_config.trial_gpu_number = 1
+    exp_config.training_service.use_active_gpu = False
+.. GENERATED FROM PYTHON SOURCE LINES 296-297
+Launch the experiment. The experiment should take several minutes to finish on a workstation with 2 GPUs.
+.. GENERATED FROM PYTHON SOURCE LINES 297-300
+.. code-block:: default
+    exp.run(exp_config, 8081)
+.. rst-class:: sphx-glr-script-out
+ Out:
+ .. code-block:: none
+    [2022-02-22 18:55:28] INFO (nni.experiment/MainThread) Creating experiment, Experiment ID: 68a4xl2o
+    [2022-02-22 18:55:28] INFO (nni.experiment/MainThread) Connecting IPC pipe...
+    [2022-02-22 18:55:28] INFO (nni.experiment/MainThread) Starting web server...
+    [2022-02-22 18:55:29] INFO (nni.experiment/MainThread) Setting up...
+    [2022-02-22 18:55:30] INFO (nni.runtime.msg_dispatcher_base/Thread-3) Dispatcher started
+    [2022-02-22 18:55:30] INFO (nni.retiarii.experiment.pytorch/MainThread) Web UI URLs: http://127.0.0.1:8081 http://10.190.172.35:8081 http://192.168.49.1:8081 http://172.17.0.1:8081
+    [2022-02-22 18:55:30] INFO (nni.retiarii.experiment.pytorch/MainThread) Start strategy...
+    [2022-02-22 18:55:30] INFO (root/MainThread) Successfully update searchSpace.
+    [2022-02-22 18:55:30] INFO (nni.retiarii.strategy.bruteforce/MainThread) Random search running in fixed size mode. Dedup: on.
+    [2022-02-22 18:57:50] INFO (nni.retiarii.experiment.pytorch/Thread-4) Stopping experiment, please wait...
+    [2022-02-22 18:57:50] INFO (nni.retiarii.experiment.pytorch/MainThread) Strategy exit
+    [2022-02-22 18:57:50] INFO (nni.retiarii.experiment.pytorch/MainThread) Waiting for experiment to become DONE (you can ctrl+c if there is no running trial jobs)...
+    [2022-02-22 18:57:51] INFO (nni.runtime.msg_dispatcher_base/Thread-3) Dispatcher exiting...
+    [2022-02-22 18:57:51] INFO (nni.retiarii.experiment.pytorch/Thread-4) Experiment stopped
+.. GENERATED FROM PYTHON SOURCE LINES 301-318
+Users can also run Retiarii Experiment with :doc:`different training services <../training_services>` besides ``local`` training service.
+Visualize the Experiment
+------------------------
+Users can visualize their experiment in the same way as visualizing a normal hyper-parameter tuning experiment.
+For example, open ``localhost:8081`` in your browser, 8081 is the port that you set in ``exp.run``.
+Please refer to :doc:`here <../Tutorial/WebUI>` for details.
+We support visualizing models with 3rd-party visualization engines (like `Netron <https://netron.app/>`__).
+This can be used by clicking ``Visualization`` in detail panel for each trial.
+Note that current visualization is based on `onnx <https://onnx.ai/>`__ ,
+thus visualization is not feasible if the model cannot be exported into onnx.
+Built-in evaluators (e.g., Classification) will automatically export the model into a file.
+For your own evaluator, you need to save your file into ``$NNI_OUTPUT_DIR/model.onnx`` to make this work.
+For instance,
+.. GENERATED FROM PYTHON SOURCE LINES 318-332
+.. code-block:: default
+    import os
+    from pathlib import Path
+    def evaluate_model_with_visualization(model_cls):
+        model = model_cls()
+        # dump the model into an onnx
+        if 'NNI_OUTPUT_DIR' in os.environ:
+            dummy_input = torch.zeros(1, 3, 32, 32)
+            torch.onnx.export(model, (dummy_input, ),
+                              Path(os.environ['NNI_OUTPUT_DIR']) / 'model.onnx')
+        evaluate_model(model_cls)
+.. GENERATED FROM PYTHON SOURCE LINES 333-341
+Relaunch the experiment, and a button is shown on WebUI.
+.. image:: ../../img/netron_entrance_webui.png
+Export Top Models
+-----------------
+Users can export top models after the exploration is done using ``export_top_models``.
+.. GENERATED FROM PYTHON SOURCE LINES 341-353
+.. code-block:: default
+    for model_dict in exp.export_top_models(formatter='dict'):
+        print(model_dict)
+    # The output is `json` object which records the mutation actions of the top model.
+    # If users want to output source code of the top model, they can use graph-based execution engine for the experiment,
+    # by simply adding the following two lines.
+    #
+    # .. code-block:: python
+    #
+    #   exp_config.execution_engine = 'base'
+    #   export_formatter = 'code'
+.. rst-class:: sphx-glr-script-out
+ Out:
+ .. code-block:: none
+    {'model_1': '1', 'model_2': 0.5, 'model_3': 256}
+.. rst-class:: sphx-glr-timing
+   **Total running time of the script:** ( 2 minutes  24.722 seconds)
+.. _sphx_glr_download_tutorials_hello_nas.py:
+.. only :: html
+ .. container:: sphx-glr-footer
+    :class: sphx-glr-footer-example
+  .. container:: sphx-glr-download sphx-glr-download-python
+     :download:`Download Python source code: hello_nas.py <hello_nas.py>`
+  .. container:: sphx-glr-download sphx-glr-download-jupyter
+     :download:`Download Jupyter notebook: hello_nas.ipynb <hello_nas.ipynb>`
+.. only:: html
+ .. rst-class:: sphx-glr-signature
+    `Gallery generated by Sphinx-Gallery <https://sphinx-gallery.github.io>`_
--- a/docs/source/tutorials/hello_nas_codeobj.pickle
+++ b/docs/source/tutorials/hello_nas_codeobj.pickle
--- a/docs/source/tutorials/images/thumb/sphx_glr_hello_nas_thumb.png
+++ b/docs/source/tutorials/images/thumb/sphx_glr_hello_nas_thumb.png
--- a/docs/source/tutorials/images/thumb/sphx_glr_nasbench_as_dataset_thumb.png
+++ b/docs/source/tutorials/images/thumb/sphx_glr_nasbench_as_dataset_thumb.png
--- a/docs/source/tutorials/index.rst
+++ b/docs/source/tutorials/index.rst
@@ -11,14 +11,14 @@ Tutorials
 .. raw:: html
-    <div class="sphx-glr-thumbcontainer" tooltip="Get started with NAS on MNIST">
+    <div class="sphx-glr-thumbcontainer" tooltip="Start and Manage a New Experiment">
 .. only:: html
- .. figure:: /tutorials/images/thumb/sphx_glr_nas_quick_start_mnist_thumb.png
+ .. figure:: /tutorials/images/thumb/sphx_glr_nni_experiment_thumb.png
-     :alt: Get started with NAS on MNIST
+     :alt: Start and Manage a New Experiment
-     :ref:`sphx_glr_tutorials_nas_quick_start_mnist.py`
+     :ref:`sphx_glr_tutorials_nni_experiment.py`
 .. raw:: html
@@ -28,18 +28,18 @@ Tutorials
 .. toctree::
   :hidden:
-   /tutorials/nas_quick_start_mnist
+   /tutorials/nni_experiment
 .. raw:: html
-    <div class="sphx-glr-thumbcontainer" tooltip="Start and Manage a New Experiment">
+    <div class="sphx-glr-thumbcontainer" tooltip="In this tutorial, we show how to use NAS Benchmarks as datasets. For research purposes we somet...">
 .. only:: html
- .. figure:: /tutorials/images/thumb/sphx_glr_nni_experiment_thumb.png
+ .. figure:: /tutorials/images/thumb/sphx_glr_nasbench_as_dataset_thumb.png
-     :alt: Start and Manage a New Experiment
+     :alt: Use NAS Benchmarks as Datasets
-     :ref:`sphx_glr_tutorials_nni_experiment.py`
+     :ref:`sphx_glr_tutorials_nasbench_as_dataset.py`
 .. raw:: html
@@ -49,7 +49,28 @@ Tutorials
 .. toctree::
   :hidden:
-   /tutorials/nni_experiment
+   /tutorials/nasbench_as_dataset
+.. raw:: html
+    <div class="sphx-glr-thumbcontainer" tooltip="This is the 101 tutorial of Neural Architecture Search (NAS) on NNI. In this tutorial, we will ...">
+.. only:: html
+ .. figure:: /tutorials/images/thumb/sphx_glr_hello_nas_thumb.png
+     :alt: Hello, NAS!
+     :ref:`sphx_glr_tutorials_hello_nas.py`
+.. raw:: html
+    </div>
+.. toctree::
+   :hidden:
+   /tutorials/hello_nas
 .. raw:: html
    <div class="sphx-glr-clear"></div>

--- a/docs/source/tutorials/nas_quick_start_mnist.ipynb
+++ b/docs/source/tutorials/nas_quick_start_mnist.ipynb
-{
-  "cells": [
-    {
-      "cell_type": "code",
-      "execution_count": null,
-      "metadata": {
-        "collapsed": false
-      },
-      "outputs": [],
-      "source": [
-        "%matplotlib inline"
-      ]
-    },
-    {
-      "cell_type": "markdown",
-      "metadata": {},
-      "source": [
-        "\n# Get started with NAS on MNIST\n"
-      ]
-    },
-    {
-      "cell_type": "code",
-      "execution_count": null,
-      "metadata": {
-        "collapsed": false
-      },
-      "outputs": [],
-      "source": [
-        "a = (1, 2, 3)\na"
-      ]
-    },
-    {
-      "cell_type": "code",
-      "execution_count": null,
-      "metadata": {
-        "collapsed": false
-      },
-      "outputs": [],
-      "source": [
-        "print('hello')"
-      ]
-    }
-  ],
-  "metadata": {
-    "kernelspec": {
-      "display_name": "Python 3",
-      "language": "python",
-      "name": "python3"
-    },
-    "language_info": {
-      "codemirror_mode": {
-        "name": "ipython",
-        "version": 3
-      },
-      "file_extension": ".py",
-      "mimetype": "text/x-python",
-      "name": "python",
-      "nbconvert_exporter": "python",
-      "pygments_lexer": "ipython3",
-      "version": "3.8.8"
-    }
-  },
-  "nbformat": 4,
-  "nbformat_minor": 0
-}
\ No newline at end of file
--- a/docs/source/tutorials/nas_quick_start_mnist.py
+++ b/docs/source/tutorials/nas_quick_start_mnist.py
-"""
-Get started with NAS on MNIST
-=============================
-"""
-# %%
-a = (1, 2, 3)
-a
-# %%
-print('hello')
--- a/docs/source/tutorials/nas_quick_start_mnist.py.md5
+++ b/docs/source/tutorials/nas_quick_start_mnist.py.md5
-f87a716bc3274d0f9a77db503198ac4a
\ No newline at end of file
--- a/docs/source/tutorials/nas_quick_start_mnist.rst
+++ b/docs/source/tutorials/nas_quick_start_mnist.rst
-.. DO NOT EDIT.
-.. THIS FILE WAS AUTOMATICALLY GENERATED BY SPHINX-GALLERY.
-.. TO MAKE CHANGES, EDIT THE SOURCE PYTHON FILE:
-.. "tutorials/nas_quick_start_mnist.py"
-.. LINE NUMBERS ARE GIVEN BELOW.
-.. only:: html
-    .. note::
-        :class: sphx-glr-download-link-note
-        Click :ref:`here <sphx_glr_download_tutorials_nas_quick_start_mnist.py>`
-        to download the full example code
-.. rst-class:: sphx-glr-example-title
-.. _sphx_glr_tutorials_nas_quick_start_mnist.py:
-Get started with NAS on MNIST
-=============================
-.. GENERATED FROM PYTHON SOURCE LINES 7-10
-.. code-block:: default
-    a = (1, 2, 3)
-    a
-.. rst-class:: sphx-glr-script-out
- Out:
- .. code-block:: none
-    (1, 2, 3)
-.. GENERATED FROM PYTHON SOURCE LINES 11-12
-.. code-block:: default
-    print('hello')
-.. rst-class:: sphx-glr-script-out
- Out:
- .. code-block:: none
-    hello
-.. rst-class:: sphx-glr-timing
-   **Total running time of the script:** ( 0 minutes  0.002 seconds)
-.. _sphx_glr_download_tutorials_nas_quick_start_mnist.py:
-.. only :: html
- .. container:: sphx-glr-footer
-    :class: sphx-glr-footer-example
-  .. container:: sphx-glr-download sphx-glr-download-python
-     :download:`Download Python source code: nas_quick_start_mnist.py <nas_quick_start_mnist.py>`
-  .. container:: sphx-glr-download sphx-glr-download-jupyter
-     :download:`Download Jupyter notebook: nas_quick_start_mnist.ipynb <nas_quick_start_mnist.ipynb>`
-.. only:: html
- .. rst-class:: sphx-glr-signature
-    `Gallery generated by Sphinx-Gallery <https://sphinx-gallery.github.io>`_
--- a/docs/source/tutorials/nas_quick_start_mnist_codeobj.pickle
+++ b/docs/source/tutorials/nas_quick_start_mnist_codeobj.pickle
--- a/docs/source/tutorials/nasbench_as_dataset.ipynb
+++ b/docs/source/tutorials/nasbench_as_dataset.ipynb
+{
+  "cells": [
+    {
+      "cell_type": "code",
+      "execution_count": null,
+      "metadata": {
+        "collapsed": false
+      },
+      "outputs": [],
+      "source": [
+        "%matplotlib inline"
+      ]
+    },
+    {
+      "cell_type": "markdown",
+      "metadata": {},
+      "source": [
+        "\n# Use NAS Benchmarks as Datasets\n\nIn this tutorial, we show how to use NAS Benchmarks as datasets.\nFor research purposes we sometimes desire to query the benchmarks for architecture accuracies,\nrather than train them one by one from scratch.\nNNI has provided query tools so that users can easily get the retrieve the data in NAS benchmarks.\n"
+      ]
+    },
+    {
+      "cell_type": "markdown",
+      "metadata": {},
+      "source": [
+        "## Prerequisites\nThis tutorial assumes that you have already prepared your NAS benchmarks under cache directory\n(by default, ``~/.cache/nni/nasbenchmark``).\nIf you haven't, please follow the data preparation guide in :doc:`../NAS/Benchmarks`.\n\nAs a result, the directory should look like:\n\n"
+      ]
+    },
+    {
+      "cell_type": "code",
+      "execution_count": null,
+      "metadata": {
+        "collapsed": false
+      },
+      "outputs": [],
+      "source": [
+        "import os\nos.listdir(os.path.expanduser('~/.cache/nni/nasbenchmark'))"
+      ]
+    },
+    {
+      "cell_type": "code",
+      "execution_count": null,
+      "metadata": {
+        "collapsed": false
+      },
+      "outputs": [],
+      "source": [
+        "import pprint\n\nfrom nni.nas.benchmarks.nasbench101 import query_nb101_trial_stats\nfrom nni.nas.benchmarks.nasbench201 import query_nb201_trial_stats\nfrom nni.nas.benchmarks.nds import query_nds_trial_stats"
+      ]
+    },
+    {
+      "cell_type": "markdown",
+      "metadata": {},
+      "source": [
+        "## NAS-Bench-101\n\nUse the following architecture as an example:\n\n<img src=\"file://../../img/nas-bench-101-example.png\">\n\n"
+      ]
+    },
+    {
+      "cell_type": "code",
+      "execution_count": null,
+      "metadata": {
+        "collapsed": false
+      },
+      "outputs": [],
+      "source": [
+        "arch = {\n    'op1': 'conv3x3-bn-relu',\n    'op2': 'maxpool3x3',\n    'op3': 'conv3x3-bn-relu',\n    'op4': 'conv3x3-bn-relu',\n    'op5': 'conv1x1-bn-relu',\n    'input1': [0],\n    'input2': [1],\n    'input3': [2],\n    'input4': [0],\n    'input5': [0, 3, 4],\n    'input6': [2, 5]\n}\nfor t in query_nb101_trial_stats(arch, 108, include_intermediates=True):\n    pprint.pprint(t)"
+      ]
+    },
+    {
+      "cell_type": "markdown",
+      "metadata": {},
+      "source": [
+        "An architecture of NAS-Bench-101 could be trained more than once.\nEach element of the returned generator is a dict which contains one of the training results of this trial config\n(architecture + hyper-parameters) including train/valid/test accuracy,\ntraining time, number of epochs, etc. The results of NAS-Bench-201 and NDS follow similar formats.\n\n## NAS-Bench-201\n\nUse the following architecture as an example:\n\n<img src=\"file://../../img/nas-bench-201-example.png\">\n\n"
+      ]
+    },
+    {
+      "cell_type": "code",
+      "execution_count": null,
+      "metadata": {
+        "collapsed": false
+      },
+      "outputs": [],
+      "source": [
+        "arch = {\n    '0_1': 'avg_pool_3x3',\n    '0_2': 'conv_1x1',\n    '1_2': 'skip_connect',\n    '0_3': 'conv_1x1',\n    '1_3': 'skip_connect',\n    '2_3': 'skip_connect'\n}\nfor t in query_nb201_trial_stats(arch, 200, 'cifar100'):\n    pprint.pprint(t)"
+      ]
+    },
+    {
+      "cell_type": "markdown",
+      "metadata": {},
+      "source": [
+        "Intermediate results are also available.\n\n"
+      ]
+    },
+    {
+      "cell_type": "code",
+      "execution_count": null,
+      "metadata": {
+        "collapsed": false
+      },
+      "outputs": [],
+      "source": [
+        "for t in query_nb201_trial_stats(arch, None, 'imagenet16-120', include_intermediates=True):\n    print(t['config'])\n    print('Intermediates:', len(t['intermediates']))"
+      ]
+    },
+    {
+      "cell_type": "markdown",
+      "metadata": {},
+      "source": [
+        "## NDS\n\nUse the following architecture as an example:\n\n<img src=\"file://../../img/nas-bench-nds-example.png\">\n\nHere, ``bot_muls``, ``ds``, ``num_gs``, ``ss`` and ``ws`` stand for \"bottleneck multipliers\",\n\"depths\", \"number of groups\", \"strides\" and \"widths\" respectively.\n\n"
+      ]
+    },
+    {
+      "cell_type": "code",
+      "execution_count": null,
+      "metadata": {
+        "collapsed": false
+      },
+      "outputs": [],
+      "source": [
+        "model_spec = {\n    'bot_muls': [0.0, 0.25, 0.25, 0.25],\n    'ds': [1, 16, 1, 4],\n    'num_gs': [1, 2, 1, 2],\n    'ss': [1, 1, 2, 2],\n    'ws': [16, 64, 128, 16]\n}"
+      ]
+    },
+    {
+      "cell_type": "markdown",
+      "metadata": {},
+      "source": [
+        "Use none as a wildcard.\n\n"
+      ]
+    },
+    {
+      "cell_type": "code",
+      "execution_count": null,
+      "metadata": {
+        "collapsed": false
+      },
+      "outputs": [],
+      "source": [
+        "for t in query_nds_trial_stats('residual_bottleneck', None, None, model_spec, None, 'cifar10'):\n    pprint.pprint(t)"
+      ]
+    },
+    {
+      "cell_type": "code",
+      "execution_count": null,
+      "metadata": {
+        "collapsed": false
+      },
+      "outputs": [],
+      "source": [
+        "model_spec = {\n    'bot_muls': [0.0, 0.25, 0.25, 0.25],\n    'ds': [1, 16, 1, 4],\n    'num_gs': [1, 2, 1, 2],\n    'ss': [1, 1, 2, 2],\n    'ws': [16, 64, 128, 16]\n}\nfor t in query_nds_trial_stats('residual_bottleneck', None, None, model_spec, None, 'cifar10', include_intermediates=True):\n    pprint.pprint(t['intermediates'][:10])"
+      ]
+    },
+    {
+      "cell_type": "code",
+      "execution_count": null,
+      "metadata": {
+        "collapsed": false
+      },
+      "outputs": [],
+      "source": [
+        "model_spec = {'ds': [1, 12, 12, 12], 'ss': [1, 1, 2, 2], 'ws': [16, 24, 24, 40]}\nfor t in query_nds_trial_stats('residual_basic', 'resnet', 'random', model_spec, {}, 'cifar10'):\n    pprint.pprint(t)"
+      ]
+    },
+    {
+      "cell_type": "markdown",
+      "metadata": {},
+      "source": [
+        "Get the first one.\n\n"
+      ]
+    },
+    {
+      "cell_type": "code",
+      "execution_count": null,
+      "metadata": {
+        "collapsed": false
+      },
+      "outputs": [],
+      "source": [
+        "pprint.pprint(next(query_nds_trial_stats('vanilla', None, None, None, None, None)))"
+      ]
+    },
+    {
+      "cell_type": "markdown",
+      "metadata": {},
+      "source": [
+        "Count number.\n\n"
+      ]
+    },
+    {
+      "cell_type": "code",
+      "execution_count": null,
+      "metadata": {
+        "collapsed": false
+      },
+      "outputs": [],
+      "source": [
+        "model_spec = {'num_nodes_normal': 5, 'num_nodes_reduce': 5, 'depth': 12, 'width': 32, 'aux': False, 'drop_prob': 0.0}\ncell_spec = {\n    'normal_0_op_x': 'avg_pool_3x3',\n    'normal_0_input_x': 0,\n    'normal_0_op_y': 'conv_7x1_1x7',\n    'normal_0_input_y': 1,\n    'normal_1_op_x': 'sep_conv_3x3',\n    'normal_1_input_x': 2,\n    'normal_1_op_y': 'sep_conv_5x5',\n    'normal_1_input_y': 0,\n    'normal_2_op_x': 'dil_sep_conv_3x3',\n    'normal_2_input_x': 2,\n    'normal_2_op_y': 'dil_sep_conv_3x3',\n    'normal_2_input_y': 2,\n    'normal_3_op_x': 'skip_connect',\n    'normal_3_input_x': 4,\n    'normal_3_op_y': 'dil_sep_conv_3x3',\n    'normal_3_input_y': 4,\n    'normal_4_op_x': 'conv_7x1_1x7',\n    'normal_4_input_x': 2,\n    'normal_4_op_y': 'sep_conv_3x3',\n    'normal_4_input_y': 4,\n    'normal_concat': [3, 5, 6],\n    'reduce_0_op_x': 'avg_pool_3x3',\n    'reduce_0_input_x': 0,\n    'reduce_0_op_y': 'dil_sep_conv_3x3',\n    'reduce_0_input_y': 1,\n    'reduce_1_op_x': 'sep_conv_3x3',\n    'reduce_1_input_x': 0,\n    'reduce_1_op_y': 'sep_conv_3x3',\n    'reduce_1_input_y': 0,\n    'reduce_2_op_x': 'skip_connect',\n    'reduce_2_input_x': 2,\n    'reduce_2_op_y': 'sep_conv_7x7',\n    'reduce_2_input_y': 0,\n    'reduce_3_op_x': 'conv_7x1_1x7',\n    'reduce_3_input_x': 4,\n    'reduce_3_op_y': 'skip_connect',\n    'reduce_3_input_y': 4,\n    'reduce_4_op_x': 'conv_7x1_1x7',\n    'reduce_4_input_x': 0,\n    'reduce_4_op_y': 'conv_7x1_1x7',\n    'reduce_4_input_y': 5,\n    'reduce_concat': [3, 6]\n}\n\nfor t in query_nds_trial_stats('nas_cell', None, None, model_spec, cell_spec, 'cifar10'):\n    assert t['config']['model_spec'] == model_spec\n    assert t['config']['cell_spec'] == cell_spec\n    pprint.pprint(t)"
+      ]
+    },
+    {
+      "cell_type": "markdown",
+      "metadata": {},
+      "source": [
+        "Count number.\n\n"
+      ]
+    },
+    {
+      "cell_type": "code",
+      "execution_count": null,
+      "metadata": {
+        "collapsed": false
+      },
+      "outputs": [],
+      "source": [
+        "print('NDS (amoeba) count:', len(list(query_nds_trial_stats(None, 'amoeba', None, None, None, None, None))))"
+      ]
+    }
+  ],
+  "metadata": {
+    "kernelspec": {
+      "display_name": "Python 3",
+      "language": "python",
+      "name": "python3"
+    },
+    "language_info": {
+      "codemirror_mode": {
+        "name": "ipython",
+        "version": 3
+      },
+      "file_extension": ".py",
+      "mimetype": "text/x-python",
+      "name": "python",
+      "nbconvert_exporter": "python",
+      "pygments_lexer": "ipython3",
+      "version": "3.8.8"
+    }
+  },
+  "nbformat": 4,
+  "nbformat_minor": 0
+}
\ No newline at end of file
--- a/docs/source/tutorials/nasbench_as_dataset.py
+++ b/docs/source/tutorials/nasbench_as_dataset.py
+"""
+Use NAS Benchmarks as Datasets
+==============================
+In this tutorial, we show how to use NAS Benchmarks as datasets.
+For research purposes we sometimes desire to query the benchmarks for architecture accuracies,
+rather than train them one by one from scratch.
+NNI has provided query tools so that users can easily get the retrieve the data in NAS benchmarks.
+"""
+# %%
+# Prerequisites
+# -------------
+# This tutorial assumes that you have already prepared your NAS benchmarks under cache directory
+# (by default, ``~/.cache/nni/nasbenchmark``).
+# If you haven't, please follow the data preparation guide in :doc:`../NAS/Benchmarks`.
+#
+# As a result, the directory should look like:
+import os
+os.listdir(os.path.expanduser('~/.cache/nni/nasbenchmark'))
+# %%
+import pprint
+from nni.nas.benchmarks.nasbench101 import query_nb101_trial_stats
+from nni.nas.benchmarks.nasbench201 import query_nb201_trial_stats
+from nni.nas.benchmarks.nds import query_nds_trial_stats
+# %%
+# NAS-Bench-101
+# -------------
+#
+# Use the following architecture as an example:
+#
+# .. image:: ../../img/nas-bench-101-example.png
+arch = {
+    'op1': 'conv3x3-bn-relu',
+    'op2': 'maxpool3x3',
+    'op3': 'conv3x3-bn-relu',
+    'op4': 'conv3x3-bn-relu',
+    'op5': 'conv1x1-bn-relu',
+    'input1': [0],
+    'input2': [1],
+    'input3': [2],
+    'input4': [0],
+    'input5': [0, 3, 4],
+    'input6': [2, 5]
+}
+for t in query_nb101_trial_stats(arch, 108, include_intermediates=True):
+    pprint.pprint(t)
+# %%
+# An architecture of NAS-Bench-101 could be trained more than once.
+# Each element of the returned generator is a dict which contains one of the training results of this trial config
+# (architecture + hyper-parameters) including train/valid/test accuracy,
+# training time, number of epochs, etc. The results of NAS-Bench-201 and NDS follow similar formats.
+#
+# NAS-Bench-201
+# -------------
+#
+# Use the following architecture as an example:
+#
+# .. image:: ../../img/nas-bench-201-example.png
+arch = {
+    '0_1': 'avg_pool_3x3',
+    '0_2': 'conv_1x1',
+    '1_2': 'skip_connect',
+    '0_3': 'conv_1x1',
+    '1_3': 'skip_connect',
+    '2_3': 'skip_connect'
+}
+for t in query_nb201_trial_stats(arch, 200, 'cifar100'):
+    pprint.pprint(t)
+# %%
+# Intermediate results are also available.
+for t in query_nb201_trial_stats(arch, None, 'imagenet16-120', include_intermediates=True):
+    print(t['config'])
+    print('Intermediates:', len(t['intermediates']))
+# %%
+# NDS
+# ---
+#
+# Use the following architecture as an example:
+#
+# .. image:: ../../img/nas-bench-nds-example.png
+#
+# Here, ``bot_muls``, ``ds``, ``num_gs``, ``ss`` and ``ws`` stand for "bottleneck multipliers",
+# "depths", "number of groups", "strides" and "widths" respectively.
+# %%
+model_spec = {
+    'bot_muls': [0.0, 0.25, 0.25, 0.25],
+    'ds': [1, 16, 1, 4],
+    'num_gs': [1, 2, 1, 2],
+    'ss': [1, 1, 2, 2],
+    'ws': [16, 64, 128, 16]
+}
+# %%
+# Use none as a wildcard.
+for t in query_nds_trial_stats('residual_bottleneck', None, None, model_spec, None, 'cifar10'):
+    pprint.pprint(t)
+# %%
+model_spec = {
+    'bot_muls': [0.0, 0.25, 0.25, 0.25],
+    'ds': [1, 16, 1, 4],
+    'num_gs': [1, 2, 1, 2],
+    'ss': [1, 1, 2, 2],
+    'ws': [16, 64, 128, 16]
+}
+for t in query_nds_trial_stats('residual_bottleneck', None, None, model_spec, None, 'cifar10', include_intermediates=True):
+    pprint.pprint(t['intermediates'][:10])
+# %%
+model_spec = {'ds': [1, 12, 12, 12], 'ss': [1, 1, 2, 2], 'ws': [16, 24, 24, 40]}
+for t in query_nds_trial_stats('residual_basic', 'resnet', 'random', model_spec, {}, 'cifar10'):
+    pprint.pprint(t)
+# %%
+# Get the first one.
+pprint.pprint(next(query_nds_trial_stats('vanilla', None, None, None, None, None)))
+# %%
+# Count number.
+model_spec = {'num_nodes_normal': 5, 'num_nodes_reduce': 5, 'depth': 12, 'width': 32, 'aux': False, 'drop_prob': 0.0}
+cell_spec = {
+    'normal_0_op_x': 'avg_pool_3x3',
+    'normal_0_input_x': 0,
+    'normal_0_op_y': 'conv_7x1_1x7',
+    'normal_0_input_y': 1,
+    'normal_1_op_x': 'sep_conv_3x3',
+    'normal_1_input_x': 2,
+    'normal_1_op_y': 'sep_conv_5x5',
+    'normal_1_input_y': 0,
+    'normal_2_op_x': 'dil_sep_conv_3x3',
+    'normal_2_input_x': 2,
+    'normal_2_op_y': 'dil_sep_conv_3x3',
+    'normal_2_input_y': 2,
+    'normal_3_op_x': 'skip_connect',
+    'normal_3_input_x': 4,
+    'normal_3_op_y': 'dil_sep_conv_3x3',
+    'normal_3_input_y': 4,
+    'normal_4_op_x': 'conv_7x1_1x7',
+    'normal_4_input_x': 2,
+    'normal_4_op_y': 'sep_conv_3x3',
+    'normal_4_input_y': 4,
+    'normal_concat': [3, 5, 6],
+    'reduce_0_op_x': 'avg_pool_3x3',
+    'reduce_0_input_x': 0,
+    'reduce_0_op_y': 'dil_sep_conv_3x3',
+    'reduce_0_input_y': 1,
+    'reduce_1_op_x': 'sep_conv_3x3',
+    'reduce_1_input_x': 0,
+    'reduce_1_op_y': 'sep_conv_3x3',
+    'reduce_1_input_y': 0,
+    'reduce_2_op_x': 'skip_connect',
+    'reduce_2_input_x': 2,
+    'reduce_2_op_y': 'sep_conv_7x7',
+    'reduce_2_input_y': 0,
+    'reduce_3_op_x': 'conv_7x1_1x7',
+    'reduce_3_input_x': 4,
+    'reduce_3_op_y': 'skip_connect',
+    'reduce_3_input_y': 4,
+    'reduce_4_op_x': 'conv_7x1_1x7',
+    'reduce_4_input_x': 0,
+    'reduce_4_op_y': 'conv_7x1_1x7',
+    'reduce_4_input_y': 5,
+    'reduce_concat': [3, 6]
+}
+for t in query_nds_trial_stats('nas_cell', None, None, model_spec, cell_spec, 'cifar10'):
+    assert t['config']['model_spec'] == model_spec
+    assert t['config']['cell_spec'] == cell_spec
+    pprint.pprint(t)
+# %%
+# Count number.
+print('NDS (amoeba) count:', len(list(query_nds_trial_stats(None, 'amoeba', None, None, None, None, None))))
--- a/docs/source/tutorials/nasbench_as_dataset.py.md5
+++ b/docs/source/tutorials/nasbench_as_dataset.py.md5
+651df59829f535210b4b58cc03027731
\ No newline at end of file
--- a/docs/source/tutorials/nasbench_as_dataset.rst
+++ b/docs/source/tutorials/nasbench_as_dataset.rst
+.. DO NOT EDIT.
+.. THIS FILE WAS AUTOMATICALLY GENERATED BY SPHINX-GALLERY.
+.. TO MAKE CHANGES, EDIT THE SOURCE PYTHON FILE:
+.. "tutorials/nasbench_as_dataset.py"
+.. LINE NUMBERS ARE GIVEN BELOW.
+.. only:: html
+    .. note::
+        :class: sphx-glr-download-link-note
+        Click :ref:`here <sphx_glr_download_tutorials_nasbench_as_dataset.py>`
+        to download the full example code
+.. rst-class:: sphx-glr-example-title
+.. _sphx_glr_tutorials_nasbench_as_dataset.py:
+Use NAS Benchmarks as Datasets
+==============================
+In this tutorial, we show how to use NAS Benchmarks as datasets.
+For research purposes we sometimes desire to query the benchmarks for architecture accuracies,
+rather than train them one by one from scratch.
+NNI has provided query tools so that users can easily get the retrieve the data in NAS benchmarks.
+.. GENERATED FROM PYTHON SOURCE LINES 12-19
+Prerequisites
+-------------
+This tutorial assumes that you have already prepared your NAS benchmarks under cache directory
+(by default, ``~/.cache/nni/nasbenchmark``).
+If you haven't, please follow the data preparation guide in :doc:`../NAS/Benchmarks`.
+As a result, the directory should look like:
+.. GENERATED FROM PYTHON SOURCE LINES 19-23
+.. code-block:: default
+    import os
+    os.listdir(os.path.expanduser('~/.cache/nni/nasbenchmark'))
+.. rst-class:: sphx-glr-script-out
+ Out:
+ .. code-block:: none
+    ['nasbench101-209f5694.db', 'nasbench201-b2b60732.db', 'nds-5745c235.db']
+.. GENERATED FROM PYTHON SOURCE LINES 24-30
+.. code-block:: default
+    import pprint
+    from nni.nas.benchmarks.nasbench101 import query_nb101_trial_stats
+    from nni.nas.benchmarks.nasbench201 import query_nb201_trial_stats
+    from nni.nas.benchmarks.nds import query_nds_trial_stats
+.. GENERATED FROM PYTHON SOURCE LINES 31-37
+NAS-Bench-101
+-------------
+Use the following architecture as an example:
+.. image:: ../../img/nas-bench-101-example.png
+.. GENERATED FROM PYTHON SOURCE LINES 37-54
+.. code-block:: default
+    arch = {
+        'op1': 'conv3x3-bn-relu',
+        'op2': 'maxpool3x3',
+        'op3': 'conv3x3-bn-relu',
+        'op4': 'conv3x3-bn-relu',
+        'op5': 'conv1x1-bn-relu',
+        'input1': [0],
+        'input2': [1],
+        'input3': [2],
+        'input4': [0],
+        'input5': [0, 3, 4],
+        'input6': [2, 5]
+    }
+    for t in query_nb101_trial_stats(arch, 108, include_intermediates=True):
+        pprint.pprint(t)
+.. rst-class:: sphx-glr-script-out
+ Out:
+ .. code-block:: none
+    [2022-02-22 18:52:29] INFO (nni.nas.benchmarks.utils/MainThread) "/home/yugzhan/.cache/nni/nasbenchmark/nasbench101-209f5694.db" already exists. Checking hash.
+    {'config': {'arch': {'input1': [0],
+                         'input2': [1],
+                         'input3': [2],
+                         'input4': [0],
+                         'input5': [0, 3, 4],
+                         'input6': [2, 5],
+                         'op1': 'conv3x3-bn-relu',
+                         'op2': 'maxpool3x3',
+                         'op3': 'conv3x3-bn-relu',
+                         'op4': 'conv3x3-bn-relu',
+                         'op5': 'conv1x1-bn-relu'},
+                'hash': '00005c142e6f48ac74fdcf73e3439874',
+                'id': 4,
+                'num_epochs': 108,
+                'num_vertices': 7},
+     'id': 10,
+     'intermediates': [{'current_epoch': 54,
+                        'id': 19,
+                        'test_acc': 77.40384340286255,
+                        'train_acc': 82.82251358032227,
+                        'training_time': 883.4580078125,
+                        'valid_acc': 77.76442170143127},
+                       {'current_epoch': 108,
+                        'id': 20,
+                        'test_acc': 92.11738705635071,
+                        'train_acc': 100.0,
+                        'training_time': 1769.1279296875,
+                        'valid_acc': 92.41786599159241}],
+     'parameters': 8.55553,
+     'test_acc': 92.11738705635071,
+     'train_acc': 100.0,
+     'training_time': 106147.67578125,
+     'valid_acc': 92.41786599159241}
+    {'config': {'arch': {'input1': [0],
+                         'input2': [1],
+                         'input3': [2],
+                         'input4': [0],
+                         'input5': [0, 3, 4],
+                         'input6': [2, 5],
+                         'op1': 'conv3x3-bn-relu',
+                         'op2': 'maxpool3x3',
+                         'op3': 'conv3x3-bn-relu',
+                         'op4': 'conv3x3-bn-relu',
+                         'op5': 'conv1x1-bn-relu'},
+                'hash': '00005c142e6f48ac74fdcf73e3439874',
+                'id': 4,
+                'num_epochs': 108,
+                'num_vertices': 7},
+     'id': 11,
+     'intermediates': [{'current_epoch': 54,
+                        'id': 21,
+                        'test_acc': 82.04126358032227,
+                        'train_acc': 87.96073794364929,
+                        'training_time': 883.6810302734375,
+                        'valid_acc': 82.91265964508057},
+                       {'current_epoch': 108,
+                        'id': 22,
+                        'test_acc': 91.90705418586731,
+                        'train_acc': 100.0,
+                        'training_time': 1768.2509765625,
+                        'valid_acc': 92.45793223381042}],
+     'parameters': 8.55553,
+     'test_acc': 91.90705418586731,
+     'train_acc': 100.0,
+     'training_time': 106095.05859375,
+     'valid_acc': 92.45793223381042}
+    {'config': {'arch': {'input1': [0],
+                         'input2': [1],
+                         'input3': [2],
+                         'input4': [0],
+                         'input5': [0, 3, 4],
+                         'input6': [2, 5],
+                         'op1': 'conv3x3-bn-relu',
+                         'op2': 'maxpool3x3',
+                         'op3': 'conv3x3-bn-relu',
+                         'op4': 'conv3x3-bn-relu',
+                         'op5': 'conv1x1-bn-relu'},
+                'hash': '00005c142e6f48ac74fdcf73e3439874',
+                'id': 4,
+                'num_epochs': 108,
+                'num_vertices': 7},
+     'id': 12,
+     'intermediates': [{'current_epoch': 54,
+                        'id': 23,
+                        'test_acc': 80.58894276618958,
+                        'train_acc': 86.34815812110901,
+                        'training_time': 883.4569702148438,
+                        'valid_acc': 81.1598539352417},
+                       {'current_epoch': 108,
+                        'id': 24,
+                        'test_acc': 92.15745329856873,
+                        'train_acc': 100.0,
+                        'training_time': 1768.9759521484375,
+                        'valid_acc': 93.04887652397156}],
+     'parameters': 8.55553,
+     'test_acc': 92.15745329856873,
+     'train_acc': 100.0,
+     'training_time': 106138.55712890625,
+     'valid_acc': 93.04887652397156}
+.. GENERATED FROM PYTHON SOURCE LINES 55-66
+An architecture of NAS-Bench-101 could be trained more than once.
+Each element of the returned generator is a dict which contains one of the training results of this trial config
+(architecture + hyper-parameters) including train/valid/test accuracy,
+training time, number of epochs, etc. The results of NAS-Bench-201 and NDS follow similar formats.
+NAS-Bench-201
+-------------
+Use the following architecture as an example:
+.. image:: ../../img/nas-bench-201-example.png
+.. GENERATED FROM PYTHON SOURCE LINES 66-78
+.. code-block:: default
+    arch = {
+        '0_1': 'avg_pool_3x3',
+        '0_2': 'conv_1x1',
+        '1_2': 'skip_connect',
+        '0_3': 'conv_1x1',
+        '1_3': 'skip_connect',
+        '2_3': 'skip_connect'
+    }
+    for t in query_nb201_trial_stats(arch, 200, 'cifar100'):
+        pprint.pprint(t)
+.. rst-class:: sphx-glr-script-out
+ Out:
+ .. code-block:: none
+    [2022-02-22 18:52:36] INFO (nni.nas.benchmarks.utils/MainThread) "/home/yugzhan/.cache/nni/nasbenchmark/nasbench201-b2b60732.db" already exists. Checking hash.
+    {'config': {'arch': {'0_1': 'avg_pool_3x3',
+                         '0_2': 'conv_1x1',
+                         '0_3': 'conv_1x1',
+                         '1_2': 'skip_connect',
+                         '1_3': 'skip_connect',
+                         '2_3': 'skip_connect'},
+                'dataset': 'cifar100',
+                'id': 7,
+                'num_cells': 5,
+                'num_channels': 16,
+                'num_epochs': 200},
+     'flops': 15.65322,
+     'id': 3,
+     'latency': 0.013182918230692545,
+     'ori_test_acc': 53.11,
+     'ori_test_evaluation_time': 1.0195916947864352,
+     'ori_test_loss': 1.7307863704681397,
+     'parameters': 0.135156,
+     'seed': 999,
+     'test_acc': 53.07999995727539,
+     'test_evaluation_time': 0.5097958473932176,
+     'test_loss': 1.731276072692871,
+     'train_acc': 57.82,
+     'train_loss': 1.5116578379058838,
+     'training_time': 2888.4371995925903,
+     'valid_acc': 53.14000000610351,
+     'valid_evaluation_time': 0.5097958473932176,
+     'valid_loss': 1.7302966793060304}
+    {'config': {'arch': {'0_1': 'avg_pool_3x3',
+                         '0_2': 'conv_1x1',
+                         '0_3': 'conv_1x1',
+                         '1_2': 'skip_connect',
+                         '1_3': 'skip_connect',
+                         '2_3': 'skip_connect'},
+                'dataset': 'cifar100',
+                'id': 7,
+                'num_cells': 5,
+                'num_channels': 16,
+                'num_epochs': 200},
+     'flops': 15.65322,
+     'id': 7,
+     'latency': 0.013182918230692545,
+     'ori_test_acc': 51.93,
+     'ori_test_evaluation_time': 1.0195916947864352,
+     'ori_test_loss': 1.7572312774658203,
+     'parameters': 0.135156,
+     'seed': 777,
+     'test_acc': 51.979999938964845,
+     'test_evaluation_time': 0.5097958473932176,
+     'test_loss': 1.7429540189743042,
+     'train_acc': 57.578,
+     'train_loss': 1.5114233912658692,
+     'training_time': 2888.4371995925903,
+     'valid_acc': 51.88,
+     'valid_evaluation_time': 0.5097958473932176,
+     'valid_loss': 1.7715086591720581}
+    {'config': {'arch': {'0_1': 'avg_pool_3x3',
+                         '0_2': 'conv_1x1',
+                         '0_3': 'conv_1x1',
+                         '1_2': 'skip_connect',
+                         '1_3': 'skip_connect',
+                         '2_3': 'skip_connect'},
+                'dataset': 'cifar100',
+                'id': 7,
+                'num_cells': 5,
+                'num_channels': 16,
+                'num_epochs': 200},
+     'flops': 15.65322,
+     'id': 11,
+     'latency': 0.013182918230692545,
+     'ori_test_acc': 53.38,
+     'ori_test_evaluation_time': 1.0195916947864352,
+     'ori_test_loss': 1.7281623031616211,
+     'parameters': 0.135156,
+     'seed': 888,
+     'test_acc': 53.67999998779297,
+     'test_evaluation_time': 0.5097958473932176,
+     'test_loss': 1.7327697801589965,
+     'train_acc': 57.792,
+     'train_loss': 1.5091403088760376,
+     'training_time': 2888.4371995925903,
+     'valid_acc': 53.08000000610352,
+     'valid_evaluation_time': 0.5097958473932176,
+     'valid_loss': 1.7235548280715942}
+.. GENERATED FROM PYTHON SOURCE LINES 79-80
+Intermediate results are also available.
+.. GENERATED FROM PYTHON SOURCE LINES 80-85
+.. code-block:: default
+    for t in query_nb201_trial_stats(arch, None, 'imagenet16-120', include_intermediates=True):
+        print(t['config'])
+        print('Intermediates:', len(t['intermediates']))
+.. rst-class:: sphx-glr-script-out
+ Out:
+ .. code-block:: none
+    {'id': 4, 'arch': {'0_1': 'avg_pool_3x3', '0_2': 'conv_1x1', '0_3': 'conv_1x1', '1_2': 'skip_connect', '1_3': 'skip_connect', '2_3': 'skip_connect'}, 'num_epochs': 12, 'num_channels': 16, 'num_cells': 5, 'dataset': 'imagenet16-120'}
+    Intermediates: 12
+    {'id': 8, 'arch': {'0_1': 'avg_pool_3x3', '0_2': 'conv_1x1', '0_3': 'conv_1x1', '1_2': 'skip_connect', '1_3': 'skip_connect', '2_3': 'skip_connect'}, 'num_epochs': 200, 'num_channels': 16, 'num_cells': 5, 'dataset': 'imagenet16-120'}
+    Intermediates: 200
+    {'id': 8, 'arch': {'0_1': 'avg_pool_3x3', '0_2': 'conv_1x1', '0_3': 'conv_1x1', '1_2': 'skip_connect', '1_3': 'skip_connect', '2_3': 'skip_connect'}, 'num_epochs': 200, 'num_channels': 16, 'num_cells': 5, 'dataset': 'imagenet16-120'}
+    Intermediates: 200
+    {'id': 8, 'arch': {'0_1': 'avg_pool_3x3', '0_2': 'conv_1x1', '0_3': 'conv_1x1', '1_2': 'skip_connect', '1_3': 'skip_connect', '2_3': 'skip_connect'}, 'num_epochs': 200, 'num_channels': 16, 'num_cells': 5, 'dataset': 'imagenet16-120'}
+    Intermediates: 200
+.. GENERATED FROM PYTHON SOURCE LINES 86-95
+NDS
+---
+Use the following architecture as an example:
+.. image:: ../../img/nas-bench-nds-example.png
+Here, ``bot_muls``, ``ds``, ``num_gs``, ``ss`` and ``ws`` stand for "bottleneck multipliers",
+"depths", "number of groups", "strides" and "widths" respectively.
+.. GENERATED FROM PYTHON SOURCE LINES 97-105
+.. code-block:: default
+    model_spec = {
+        'bot_muls': [0.0, 0.25, 0.25, 0.25],
+        'ds': [1, 16, 1, 4],
+        'num_gs': [1, 2, 1, 2],
+        'ss': [1, 1, 2, 2],
+        'ws': [16, 64, 128, 16]
+    }
+.. GENERATED FROM PYTHON SOURCE LINES 106-107
+Use none as a wildcard.
+.. GENERATED FROM PYTHON SOURCE LINES 107-110
+.. code-block:: default
+    for t in query_nds_trial_stats('residual_bottleneck', None, None, model_spec, None, 'cifar10'):
+        pprint.pprint(t)
+.. rst-class:: sphx-glr-script-out
+ Out:
+ .. code-block:: none
+    [2022-02-22 18:52:47] INFO (nni.nas.benchmarks.utils/MainThread) "/home/yugzhan/.cache/nni/nasbenchmark/nds-5745c235.db" already exists. Checking hash.
+    {'best_test_acc': 90.48,
+     'best_train_acc': 96.356,
+     'best_train_loss': 0.116,
+     'config': {'base_lr': 0.1,
+                'cell_spec': {},
+                'dataset': 'cifar10',
+                'generator': 'random',
+                'id': 45505,
+                'model_family': 'residual_bottleneck',
+                'model_spec': {'bot_muls': [0.0, 0.25, 0.25, 0.25],
+                               'ds': [1, 16, 1, 4],
+                               'num_gs': [1, 2, 1, 2],
+                               'ss': [1, 1, 2, 2],
+                               'ws': [16, 64, 128, 16]},
+                'num_epochs': 100,
+                'proposer': 'resnext-a',
+                'weight_decay': 0.0005},
+     'final_test_acc': 90.39,
+     'final_train_acc': 96.298,
+     'final_train_loss': 0.116,
+     'flops': 69.890986,
+     'id': 45505,
+     'iter_time': 0.065,
+     'parameters': 0.083002,
+     'seed': 1}
+.. GENERATED FROM PYTHON SOURCE LINES 111-121
+.. code-block:: default
+    model_spec = {
+        'bot_muls': [0.0, 0.25, 0.25, 0.25],
+        'ds': [1, 16, 1, 4],
+        'num_gs': [1, 2, 1, 2],
+        'ss': [1, 1, 2, 2],
+        'ws': [16, 64, 128, 16]
+    }
+    for t in query_nds_trial_stats('residual_bottleneck', None, None, model_spec, None, 'cifar10', include_intermediates=True):
+        pprint.pprint(t['intermediates'][:10])
+.. rst-class:: sphx-glr-script-out
+ Out:
+ .. code-block:: none
+    [{'current_epoch': 1,
+      'id': 4494501,
+      'test_acc': 41.76,
+      'train_acc': 30.421000000000006,
+      'train_loss': 1.793},
+     {'current_epoch': 2,
+      'id': 4494502,
+      'test_acc': 54.66,
+      'train_acc': 47.24,
+      'train_loss': 1.415},
+     {'current_epoch': 3,
+      'id': 4494503,
+      'test_acc': 59.97,
+      'train_acc': 56.983,
+      'train_loss': 1.179},
+     {'current_epoch': 4,
+      'id': 4494504,
+      'test_acc': 62.91,
+      'train_acc': 61.955,
+      'train_loss': 1.048},
+     {'current_epoch': 5,
+      'id': 4494505,
+      'test_acc': 66.16,
+      'train_acc': 64.493,
+      'train_loss': 0.983},
+     {'current_epoch': 6,
+      'id': 4494506,
+      'test_acc': 66.5,
+      'train_acc': 66.274,
+      'train_loss': 0.937},
+     {'current_epoch': 7,
+      'id': 4494507,
+      'test_acc': 67.55,
+      'train_acc': 67.426,
+      'train_loss': 0.907},
+     {'current_epoch': 8,
+      'id': 4494508,
+      'test_acc': 69.45,
+      'train_acc': 68.45400000000001,
+      'train_loss': 0.878},
+     {'current_epoch': 9,
+      'id': 4494509,
+      'test_acc': 70.14,
+      'train_acc': 69.295,
+      'train_loss': 0.857},
+     {'current_epoch': 10,
+      'id': 4494510,
+      'test_acc': 69.47,
+      'train_acc': 70.304,
+      'train_loss': 0.832}]
+.. GENERATED FROM PYTHON SOURCE LINES 122-126
+.. code-block:: default
+    model_spec = {'ds': [1, 12, 12, 12], 'ss': [1, 1, 2, 2], 'ws': [16, 24, 24, 40]}
+    for t in query_nds_trial_stats('residual_basic', 'resnet', 'random', model_spec, {}, 'cifar10'):
+        pprint.pprint(t)
+.. rst-class:: sphx-glr-script-out
+ Out:
+ .. code-block:: none
+    {'best_test_acc': 93.58,
+     'best_train_acc': 99.772,
+     'best_train_loss': 0.011,
+     'config': {'base_lr': 0.1,
+                'cell_spec': {},
+                'dataset': 'cifar10',
+                'generator': 'random',
+                'id': 108998,
+                'model_family': 'residual_basic',
+                'model_spec': {'ds': [1, 12, 12, 12],
+                               'ss': [1, 1, 2, 2],
+                               'ws': [16, 24, 24, 40]},
+                'num_epochs': 100,
+                'proposer': 'resnet',
+                'weight_decay': 0.0005},
+     'final_test_acc': 93.49,
+     'final_train_acc': 99.772,
+     'final_train_loss': 0.011,
+     'flops': 184.519578,
+     'id': 108998,
+     'iter_time': 0.059,
+     'parameters': 0.594138,
+     'seed': 1}
+.. GENERATED FROM PYTHON SOURCE LINES 127-128
+Get the first one.
+.. GENERATED FROM PYTHON SOURCE LINES 128-130
+.. code-block:: default
+    pprint.pprint(next(query_nds_trial_stats('vanilla', None, None, None, None, None)))
+.. rst-class:: sphx-glr-script-out
+ Out:
+ .. code-block:: none
+    {'best_test_acc': 84.5,
+     'best_train_acc': 89.66499999999999,
+     'best_train_loss': 0.302,
+     'config': {'base_lr': 0.1,
+                'cell_spec': {},
+                'dataset': 'cifar10',
+                'generator': 'random',
+                'id': 139492,
+                'model_family': 'vanilla',
+                'model_spec': {'ds': [1, 12, 12, 12],
+                               'ss': [1, 1, 2, 2],
+                               'ws': [16, 24, 32, 40]},
+                'num_epochs': 100,
+                'proposer': 'vanilla',
+                'weight_decay': 0.0005},
+     'final_test_acc': 84.35,
+     'final_train_acc': 89.633,
+     'final_train_loss': 0.303,
+     'flops': 208.36393,
+     'id': 154692,
+     'iter_time': 0.058,
+     'parameters': 0.68977,
+     'seed': 1}
+.. GENERATED FROM PYTHON SOURCE LINES 131-132
+Count number.
+.. GENERATED FROM PYTHON SOURCE LINES 132-183
+.. code-block:: default
+    model_spec = {'num_nodes_normal': 5, 'num_nodes_reduce': 5, 'depth': 12, 'width': 32, 'aux': False, 'drop_prob': 0.0}
+    cell_spec = {
+        'normal_0_op_x': 'avg_pool_3x3',
+        'normal_0_input_x': 0,
+        'normal_0_op_y': 'conv_7x1_1x7',
+        'normal_0_input_y': 1,
+        'normal_1_op_x': 'sep_conv_3x3',
+        'normal_1_input_x': 2,
+        'normal_1_op_y': 'sep_conv_5x5',
+        'normal_1_input_y': 0,
+        'normal_2_op_x': 'dil_sep_conv_3x3',
+        'normal_2_input_x': 2,
+        'normal_2_op_y': 'dil_sep_conv_3x3',
+        'normal_2_input_y': 2,
+        'normal_3_op_x': 'skip_connect',
+        'normal_3_input_x': 4,
+        'normal_3_op_y': 'dil_sep_conv_3x3',
+        'normal_3_input_y': 4,
+        'normal_4_op_x': 'conv_7x1_1x7',
+        'normal_4_input_x': 2,
+        'normal_4_op_y': 'sep_conv_3x3',
+        'normal_4_input_y': 4,
+        'normal_concat': [3, 5, 6],
+        'reduce_0_op_x': 'avg_pool_3x3',
+        'reduce_0_input_x': 0,
+        'reduce_0_op_y': 'dil_sep_conv_3x3',
+        'reduce_0_input_y': 1,
+        'reduce_1_op_x': 'sep_conv_3x3',
+        'reduce_1_input_x': 0,
+        'reduce_1_op_y': 'sep_conv_3x3',
+        'reduce_1_input_y': 0,
+        'reduce_2_op_x': 'skip_connect',
+        'reduce_2_input_x': 2,
+        'reduce_2_op_y': 'sep_conv_7x7',
+        'reduce_2_input_y': 0,
+        'reduce_3_op_x': 'conv_7x1_1x7',
+        'reduce_3_input_x': 4,
+        'reduce_3_op_y': 'skip_connect',
+        'reduce_3_input_y': 4,
+        'reduce_4_op_x': 'conv_7x1_1x7',
+        'reduce_4_input_x': 0,
+        'reduce_4_op_y': 'conv_7x1_1x7',
+        'reduce_4_input_y': 5,
+        'reduce_concat': [3, 6]
+    }
+    for t in query_nds_trial_stats('nas_cell', None, None, model_spec, cell_spec, 'cifar10'):
+        assert t['config']['model_spec'] == model_spec
+        assert t['config']['cell_spec'] == cell_spec
+        pprint.pprint(t)
+.. rst-class:: sphx-glr-script-out
+ Out:
+ .. code-block:: none
+    {'best_test_acc': 93.37,
+     'best_train_acc': 99.91,
+     'best_train_loss': 0.006,
+     'config': {'base_lr': 0.1,
+                'cell_spec': {'normal_0_input_x': 0,
+                              'normal_0_input_y': 1,
+                              'normal_0_op_x': 'avg_pool_3x3',
+                              'normal_0_op_y': 'conv_7x1_1x7',
+                              'normal_1_input_x': 2,
+                              'normal_1_input_y': 0,
+                              'normal_1_op_x': 'sep_conv_3x3',
+                              'normal_1_op_y': 'sep_conv_5x5',
+                              'normal_2_input_x': 2,
+                              'normal_2_input_y': 2,
+                              'normal_2_op_x': 'dil_sep_conv_3x3',
+                              'normal_2_op_y': 'dil_sep_conv_3x3',
+                              'normal_3_input_x': 4,
+                              'normal_3_input_y': 4,
+                              'normal_3_op_x': 'skip_connect',
+                              'normal_3_op_y': 'dil_sep_conv_3x3',
+                              'normal_4_input_x': 2,
+                              'normal_4_input_y': 4,
+                              'normal_4_op_x': 'conv_7x1_1x7',
+                              'normal_4_op_y': 'sep_conv_3x3',
+                              'normal_concat': [3, 5, 6],
+                              'reduce_0_input_x': 0,
+                              'reduce_0_input_y': 1,
+                              'reduce_0_op_x': 'avg_pool_3x3',
+                              'reduce_0_op_y': 'dil_sep_conv_3x3',
+                              'reduce_1_input_x': 0,
+                              'reduce_1_input_y': 0,
+                              'reduce_1_op_x': 'sep_conv_3x3',
+                              'reduce_1_op_y': 'sep_conv_3x3',
+                              'reduce_2_input_x': 2,
+                              'reduce_2_input_y': 0,
+                              'reduce_2_op_x': 'skip_connect',
+                              'reduce_2_op_y': 'sep_conv_7x7',
+                              'reduce_3_input_x': 4,
+                              'reduce_3_input_y': 4,
+                              'reduce_3_op_x': 'conv_7x1_1x7',
+                              'reduce_3_op_y': 'skip_connect',
+                              'reduce_4_input_x': 0,
+                              'reduce_4_input_y': 5,
+                              'reduce_4_op_x': 'conv_7x1_1x7',
+                              'reduce_4_op_y': 'conv_7x1_1x7',
+                              'reduce_concat': [3, 6]},
+                'dataset': 'cifar10',
+                'generator': 'random',
+                'id': 1,
+                'model_family': 'nas_cell',
+                'model_spec': {'aux': False,
+                               'depth': 12,
+                               'drop_prob': 0.0,
+                               'num_nodes_normal': 5,
+                               'num_nodes_reduce': 5,
+                               'width': 32},
+                'num_epochs': 100,
+                'proposer': 'amoeba',
+                'weight_decay': 0.0005},
+     'final_test_acc': 93.27,
+     'final_train_acc': 99.91,
+     'final_train_loss': 0.006,
+     'flops': 664.400586,
+     'id': 1,
+     'iter_time': 0.281,
+     'parameters': 4.190314,
+     'seed': 1}
+.. GENERATED FROM PYTHON SOURCE LINES 184-185
+Count number.
+.. GENERATED FROM PYTHON SOURCE LINES 185-186
+.. code-block:: default
+    print('NDS (amoeba) count:', len(list(query_nds_trial_stats(None, 'amoeba', None, None, None, None, None))))
+.. rst-class:: sphx-glr-script-out
+ Out:
+ .. code-block:: none
+    NDS (amoeba) count: 5107
+.. rst-class:: sphx-glr-timing
+   **Total running time of the script:** ( 0 minutes  25.047 seconds)
+.. _sphx_glr_download_tutorials_nasbench_as_dataset.py:
+.. only :: html
+ .. container:: sphx-glr-footer
+    :class: sphx-glr-footer-example
+  .. container:: sphx-glr-download sphx-glr-download-python
+     :download:`Download Python source code: nasbench_as_dataset.py <nasbench_as_dataset.py>`
+  .. container:: sphx-glr-download sphx-glr-download-jupyter
+     :download:`Download Jupyter notebook: nasbench_as_dataset.ipynb <nasbench_as_dataset.ipynb>`
+.. only:: html
+ .. rst-class:: sphx-glr-signature
+    `Gallery generated by Sphinx-Gallery <https://sphinx-gallery.github.io>`_