More improvements of NAS documents (#4741)

84b9c9b2 · Yuge Zhang · GitHub · a31d37e5 · 84b9c9b2 · 84b9c9b2
Unverified Commit 84b9c9b2 authored Apr 07, 2022 by Yuge Zhang Committed by GitHub Apr 07, 2022
20 changed files
--- a/docs/img/nasnet_cell.png
+++ b/docs/img/nasnet_cell.png
--- a/docs/source/conf.py
+++ b/docs/source/conf.py
@@ -91,6 +91,9 @@ autodoc_typehints = 'description'
 autodoc_typehints_description_target = 'documented'
 autodoc_inherit_docstrings = False
+# Sphinx will warn about all references where the target cannot be found.
+nitpicky = False  # disabled for now
 # Bibliography files
 bibtex_bibfiles = ['refs.bib']
@@ -145,7 +148,6 @@ tutorials_copy_list = [
    ('tutorials/pruning_quick_start_mnist.rst', 'tutorials/cp_global_quickstart_compression.rst'),
    # Others in full-scale materials
-    ('tutorials/hello_nas.rst', 'tutorials/cp_hello_nas_quickstart.rst'),
    ('tutorials/pruning_quick_start_mnist.rst', 'tutorials/cp_pruning_quick_start_mnist.rst'),
    ('tutorials/pruning_speedup.rst', 'tutorials/cp_pruning_speedup.rst'),
    ('tutorials/quantization_quick_start_mnist.rst', 'tutorials/cp_quantization_quick_start_mnist.rst'),

--- a/docs/source/examples.rst
+++ b/docs/source/examples.rst
@@ -7,8 +7,6 @@ Examples
   :maxdepth: 2
   :hidden:
-   tutorials/hello_nas
-   tutorials/nasbench_as_dataset
   tutorials/pruning_quick_start_mnist
   tutorials/pruning_speedup
   tutorials/quantization_quick_start_mnist

--- a/docs/source/nas/benchmarks.rst
+++ b/docs/source/nas/benchmarks.rst
-NAS Benchmarks
+NAS Benchmark
-==============
+=============
 .. toctree::
   :hidden:
@@ -8,15 +8,11 @@ NAS Benchmarks
 .. note:: :doc:`Example usage of NAS benchmarks </tutorials/nasbench_as_dataset>`.
-Introduction
------------
 To improve the reproducibility of NAS algorithms as well as reducing computing resource requirements, researchers proposed a series of NAS benchmarks such as `NAS-Bench-101 <https://arxiv.org/abs/1902.09635>`__, `NAS-Bench-201 <https://arxiv.org/abs/2001.00326>`__, `NDS <https://arxiv.org/abs/1905.13214>`__, etc. NNI provides a query interface for users to acquire these benchmarks. Within just a few lines of code, researcher are able to evaluate their NAS algorithms easily and fairly by utilizing these benchmarks.
 Prerequisites
 -------------
 * Please prepare a folder to household all the benchmark databases. By default, it can be found at ``${HOME}/.cache/nni/nasbenchmark``. Or you can place it anywhere you like, and specify it in ``NASBENCHMARK_DIR`` via ``export NASBENCHMARK_DIR=/path/to/your/nasbenchmark`` before importing NNI.
 * Please install ``peewee`` via ``pip3 install peewee``, which NNI uses to connect to database.
@@ -51,7 +47,7 @@ Please make sure there is at least 10GB free disk space and note that the conver
 Example Usages
 --------------
-Please refer to `examples usages of Benchmarks API <../BenchmarksExample.rst>`__.
+Please refer to :doc:`examples usages of Benchmarks API </tutorials/nasbench_as_dataset>`.
 NAS-Bench-101
 -------------
@@ -63,12 +59,7 @@ NAS-Bench-101 contains 423,624 unique neural networks, combined with 4 variation
 Notably, NAS-Bench-101 eliminates invalid cells (e.g., there is no path from input to output, or there is redundant computation). Furthermore, isomorphic cells are de-duplicated, i.e., all the remaining cells are computationally unique.
-API Documentation
+See :doc:`example usages </tutorials/nasbench_as_dataset>` and :ref:`API references <nas-bench-101-reference>`.
-^^^^^^^^^^^^^^^^^
-.. automodule:: nni.nas.benchmarks.nasbench101
-   :members:
-   :imported-members:
 NAS-Bench-201
 -------------
@@ -79,12 +70,7 @@ NAS-Bench-201
 NAS-Bench-201 is a cell-wise search space that views nodes as tensors and edges as operators. The search space contains all possible densely-connected DAGs with 4 nodes, resulting in 15,625 candidates in total. Each operator (i.e., edge) is selected from a pre-defined operator set (\ ``NONE``, ``SKIP_CONNECT``, ``CONV_1X1``, ``CONV_3X3`` and ``AVG_POOL_3X3``\ ). Training appraoches vary in the dataset used (CIFAR-10, CIFAR-100, ImageNet) and number of epochs scheduled (12 and 200). Each combination of architecture and training approach is repeated 1 - 3 times with different random seeds.
-API Documentation
+See :doc:`example usages </tutorials/nasbench_as_dataset>` and :ref:`API references <nas-bench-201-reference>`.
-^^^^^^^^^^^^^^^^^
-.. automodule:: nni.nas.benchmarks.nasbench201
-   :members:
-   :imported-members:
 NDS
 ---
@@ -96,17 +82,9 @@ NDS
 Instead of storing results obtained with different configurations in separate files, we dump them into one single database to enable comparison in multiple dimensions. Specifically, we use ``model_family`` to distinguish model types, ``model_spec`` for all hyper-parameters needed to build this model, ``cell_spec`` for detailed information on operators and connections if it is a NAS cell, ``generator`` to denote the sampling policy through which this configuration is generated. Refer to API documentation for details.
-Available Operators
-------------------
 Here is a list of available operators used in NDS.
 .. automodule:: nni.nas.benchmarks.nds.constants
   :noindex:
-API Documentation
+See :doc:`example usages </tutorials/nasbench_as_dataset>` and :ref:`API references <nds-reference>`.
-^^^^^^^^^^^^^^^^^
-.. automodule:: nni.nas.benchmarks.nds
-   :members:
-   :imported-members:
--- a/docs/source/nas/construct_space.rst
+++ b/docs/source/nas/construct_space.rst
 Construct Model Space
 =====================
-NNI provides powerful APIs for users to easily express model space (or search space).
+NNI provides powerful (and multi-level) APIs for users to easily express model space (or search space).
-Firstly, users can use high-level APIs (e.g., ValueChoice, LayerChoice) which are building blocks / skeletons of building blocks to construct their search space.
-For advanced cases, NNI also provides interface to customize new mutators for expressing more complicated model spaces.
+* *Mutation Primitives*: high-level APIs (e.g., ValueChoice, LayerChoice) that are utilities to build blocks in search space. In most cases, mutation pritimives should be straightforward yet expressive enough. **We strongly recommend users to try them first,** and report issues if those APIs are not satisfying.
+* *Hyper-module Library*: plug-and-play modules that are proved useful. They are usually well studied in research, and comes with pre-searched results. (For example, the optimal activation function in `AutoActivation <https://arxiv.org/abs/1710.05941>`__ is reported to be `Swish <https://pytorch.org/docs/stable/generated/torch.nn.SiLU.html>`__).
-.. tip:: In most cases, this should be simple but expressive enough. We strongly recommend users to try them first, and report issues if those APIs are not satisfying.
+* *Mutator*: for advanced users only. NNI provides interface to customize new mutators for expressing more complicated model spaces.
-.. _mutation-primitives:
+The following table summarizes all the APIs we have provided for constructing search space.
-Mutation Primitives
+.. list-table::
-------------------
+   :header-rows: 1
+   :widths: auto
-To make users easily express a model space within their PyTorch/TensorFlow model, NNI provides some inline mutation APIs as shown below.
+   * - Name
-.. note:: We can actively adding more mutation primitives. If you have any suggestions, feel free to `ask here <https://github.com/microsoft/nni/issues>`__.
+     - Category
+     - Brief Description
-.. _nas-layer-choice:
+   * - :class:`LayerChoice <nni.retiarii.nn.pytorch.LayerChoice>`
+     - :ref:`Mutation Primitives <mutation-primitives>`
-LayerChoice
+     - Select from some PyTorch modules
-^^^^^^^^^^^
+   * - :class:`InputChoice <nni.retiarii.nn.pytorch.InputChoice>`
+     - :ref:`Mutation Primitives <mutation-primitives>`
-.. autoclass:: nni.retiarii.nn.pytorch.LayerChoice
+     - Select from some inputs (tensors)
-   :members:
+   * - :class:`ValueChoice <nni.retiarii.nn.pytorch.ValueChoice>`
+     - :ref:`Mutation Primitives <mutation-primitives>`
-.. _nas-input-choice:
+     - Select from some candidate values
+   * - :class:`Repeat <nni.retiarii.nn.pytorch.Repeat>`
-InputChoice
+     - :ref:`Mutation Primitives <mutation-primitives>`
-^^^^^^^^^^^
+     - Repeat a block by a variable number of times
+   * - :class:`Cell <nni.retiarii.nn.pytorch.Cell>`
-.. autoclass:: nni.retiarii.nn.pytorch.InputChoice
+     - :ref:`Mutation Primitives <mutation-primitives>`
-   :members:
+     - Cell structure popularly used in literature
+   * - :class:`NasBench101Cell <nni.retiarii.nn.pytorch.NasBench101Cell>`
-.. autoclass:: nni.retiarii.nn.pytorch.ChosenInputs
+     - :ref:`Mutation Primitives <mutation-primitives>`
-   :members:
+     - Cell structure (variant) proposed by NAS-Bench-101
+   * - :class:`NasBench201Cell <nni.retiarii.nn.pytorch.NasBench201Cell>`
-.. _nas-value-choice:
+     - :ref:`Mutation Primitives <mutation-primitives>`
+     - Cell structure (variant) proposed by NAS-Bench-201
-ValueChoice
+   * - :class:`AutoActivation <nni.retiarii.nn.pytorch.AutoActivation>`
-^^^^^^^^^^^
+     - :ref:`Hyper-modules Library <hyper-modules>`
+     - Searching for activation functions
-.. autoclass:: nni.retiarii.nn.pytorch.ValueChoice
+   * - :class:`Mutator <nni.retiarii.Mutator>`
-   :members:
+     - :doc:`Mutator <mutator>`
-   :inherited-members: Module
+     - Flexible mutations on graphs. :doc:`See tutorial here <mutator>`
-.. _nas-model-parameter-choice:
-ModelParameterChoice
-^^^^^^^^^^^^^^^^^^^^
-.. autoclass:: nni.retiarii.nn.pytorch.ModelParameterChoice
-   :members:
-   :inherited-members: Module
-.. _nas-repeat:
-Repeat
-^^^^^^
-.. autoclass:: nni.retiarii.nn.pytorch.Repeat
-   :members:
-.. _nas-cell:
-Cell
-^^^^
-.. autoclass:: nni.retiarii.nn.pytorch.Cell
-   :members:
-.. _nas-cell-101:
-NasBench101Cell
-^^^^^^^^^^^^^^^
-.. autoclass:: nni.retiarii.nn.pytorch.NasBench101Cell
-   :members:
-.. _nas-cell-201:
-NasBench201Cell
-^^^^^^^^^^^^^^^
-.. autoclass:: nni.retiarii.nn.pytorch.NasBench201Cell
-   :members:
-.. _hyper-modules:
-Hyper-module Library (experimental)
-----------------------------------
-Hyper-module is a (PyTorch) module which contains many architecture/hyperparameter candidates for this module. By using hypermodule in user defined model, NNI will help users automatically find the best architecture/hyperparameter of the hyper-modules for this model. This follows the design philosophy of Retiarii that users write DNN model as a space.
-We are planning to support some of the hyper-modules commonly used in the community, such as AutoDropout, AutoActivation. These are considered complementary to :ref:`mutation-primitives`, as they are often more concrete, specific, and tailored for particular needs.
-.. _nas-autoactivation:
-AutoActivation
-^^^^^^^^^^^^^^
-..  autoclass:: nni.retiarii.nn.pytorch.AutoActivation
-    :members:
-Mutators (advanced)
-------------------
-Besides the inline mutation APIs demonstrated :ref:`above <mutation-primitives>`, NNI provides a more general approach to express a model space, i.e., *Mutator*, to cover more complex model spaces. Those inline mutation APIs are also implemented with mutator in the underlying system, which can be seen as a special case of model mutation. Please read :doc:`./mutator` for details.
--- a/docs/source/nas/customize_strategy.rst
+++ b/docs/source/nas/customize_strategy.rst
@@ -4,7 +4,7 @@ Customize Exploration Strategy
 Customize Multi-trial Strategy
 ------------------------------
-If users want to innovate a new exploration strategy, they can easily customize a new one following the interface provided by NNI. Specifically, users should inherit the base strategy class ``BaseStrategy``, then implement the member function ``run``. This member function takes ``base_model`` and ``applied_mutators`` as its input arguments. It can simply apply the user specified mutators in ``applied_mutators`` onto ``base_model`` to generate a new model. When a mutator is applied, it should be bound with a sampler (e.g., ``RandomSampler``). Every sampler implements the ``choice`` function which chooses value(s) from candidate values. The ``choice`` functions invoked in mutators are executed with the sampler.
+If users want to innovate a new exploration strategy, they can easily customize a new one following the interface provided by NNI. Specifically, users should inherit the base strategy class :class:`nni.retiarii.strategy.BaseStrategy`, then implement the member function ``run``. This member function takes ``base_model`` and ``applied_mutators`` as its input arguments. It can simply apply the user specified mutators in ``applied_mutators`` onto ``base_model`` to generate a new model. When a mutator is applied, it should be bound with a sampler (e.g., ``RandomSampler``). Every sampler implements the ``choice`` function which chooses value(s) from candidate values. The ``choice`` functions invoked in mutators are executed with the sampler.
 Below is a very simple random strategy, which makes the choices completely random.
@@ -38,18 +38,7 @@ Below is a very simple random strategy, which makes the choices completely rando
 You can find that this strategy does not know the search space beforehand, it passively makes decisions every time ``choice`` is invoked from mutators. If a strategy wants to know the whole search space before making any decision (e.g., TPE, SMAC), it can use ``dry_run`` function provided by ``Mutator`` to obtain the space. An example strategy can be found :githublink:`here <nni/retiarii/strategy/tpe_strategy.py>`.
-After generating a new model, the strategy can use our provided APIs (e.g., ``submit_models``, ``is_stopped_exec``) to submit the model and get its reported results.
+After generating a new model, the strategy can use our provided APIs (e.g., :func:`nni.retiarii.execution.submit_models`, :func:`nni.retiarii.execution.is_stopped_exec`) to submit the model and get its reported results.
-References
-^^^^^^^^^^
-..  autoclass:: nni.retiarii.Sampler
-    :members:
-    :noindex:
-..  autoclass:: nni.retiarii.strategy.BaseStrategy
-    :members:
-    :noindex:
 Customize a New One-shot Trainer (legacy)
 -----------------------------------------
@@ -58,12 +47,12 @@ One-shot trainers should inherit :class:`nni.retiarii.oneshot.BaseOneShotTrainer
 Writing a one-shot trainer is very different to single-arch evaluator. First of all, there are no more restrictions on init method arguments, any Python arguments are acceptable. Secondly, the model fed into one-shot trainers might be a model with Retiarii-specific modules, such as LayerChoice and InputChoice. Such model cannot directly forward-propagate and trainers need to decide how to handle those modules.
-A typical example is DartsTrainer, where learnable-parameters are used to combine multiple choices in LayerChoice. Retiarii provides ease-to-use utility functions for module-replace purposes, namely ``replace_layer_choice``, ``replace_input_choice``. A simplified example is as follows: 
+A typical example is DartsTrainer, where learnable-parameters are used to combine multiple choices in LayerChoice. Retiarii provides ease-to-use utility functions for module-replace purposes, namely :meth:`nni.retiarii.oneshot.pytorch.utils.replace_layer_choice`, :meth:`nni.retiarii.oneshot.pytorch.utils.replace_input_choice`. A simplified example is as follows: 
 .. code-block:: python
    from nni.retiarii.oneshot import BaseOneShotTrainer
-    from nni.retiarii.oneshot.pytorch import replace_layer_choice, replace_input_choice
+    from nni.retiarii.oneshot.pytorch.utils import replace_layer_choice, replace_input_choice
    class DartsLayerChoice(nn.Module):
@@ -107,10 +96,3 @@ A typical example is DartsTrainer, where learnable-parameters are used to combin
            return result
 The full code of DartsTrainer is available to Retiarii source code. Please have a check at :githublink:`DartsTrainer <nni/retiarii/oneshot/pytorch/darts.py>`.
-References
-^^^^^^^^^^
-..  autoclass:: nni.retiarii.oneshot.BaseOneShotTrainer
-    :members:
-    :noindex:
--- a/docs/source/nas/evaluator.rst
+++ b/docs/source/nas/evaluator.rst
-Model Evaluators
+Model Evaluator
-================
+===============
 A model evaluator is for training and validating each generated model. They are necessary to evaluate the performance of new explored models.
@@ -8,7 +8,7 @@ A model evaluator is for training and validating each generated model. They are
 Customize Evaluator with Any Function
 -------------------------------------
-The simplest way to customize a new evaluator is with functional APIs, which is very easy when training code is already available. Users only need to write a fit function that wraps everything, which usually includes training, validating and testing of a single model. This function takes one positional arguments (``model_cls``) and possible keyword arguments. The keyword arguments (other than ``model_cls``) are fed to FunctionEvaluator as its initialization parameters (note that they will be :doc:`serialized <./serialization>`). In this way, users get everything under their control, but expose less information to the framework and as a result, further optimizations like :ref:`CGO <cgo-execution-engine>` might be not feasible. An example is as belows:
+The simplest way to customize a new evaluator is with :class:`FunctionalEvaluator <nni.retiarii.evaluator.FunctionalEvaluator>`, which is very easy when training code is already available. Users only need to write a fit function that wraps everything, which usually includes training, validating and testing of a single model. This function takes one positional arguments (``model_cls``) and possible keyword arguments. The keyword arguments (other than ``model_cls``) are fed to :class:`FunctionalEvaluator <nni.retiarii.evaluator.FunctionalEvaluator>` as its initialization parameters (note that they will be :doc:`serialized <./serialization>`). In this way, users get everything under their control, but expose less information to the framework and as a result, further optimizations like :ref:`CGO <cgo-execution-engine>` might be not feasible. An example is as belows:
 .. code-block:: python
@@ -48,11 +48,14 @@ Evaluators with PyTorch-Lightning
 Use Built-in Evaluators
 ^^^^^^^^^^^^^^^^^^^^^^^
-NNI provides some commonly used model evaluators for users' convenience. These evaluators are built upon the awesome library PyTorch-Lightning.
+NNI provides some commonly used model evaluators for users' convenience. These evaluators are built upon the awesome library PyTorch-Lightning. Read the :doc:`reference </reference/nas/evaluator>` for their detailed usages.
-We recommend to read the `serialization tutorial <./Serialization.rst>`__ before using these evaluators. A few notes to summarize the tutorial:
+* :class:`nni.retiarii.evaluator.pytorch.Classification`: for classification tasks.
+* :class:`nni.retiarii.evaluator.pytorch.Regression`: for regression tasks.
-1. :class:`nni.retarii.evaluator.pytorch.DataLoader`` should be used in place of ``torch.utils.data.DataLoader``.
+We recommend to read the :doc:`serialization tutorial <serialization>` before using these evaluators. A few notes to summarize the tutorial:
+1. :class:`nni.retiarii.evaluator.pytorch.DataLoader` should be used in place of ``torch.utils.data.DataLoader``.
 2. The datasets used in data-loader should be decorated with :meth:`nni.trace` recursively.
 For example,
@@ -141,39 +144,3 @@ Then, users need to wrap everything (including LightningModule, trainer and data
                             train_dataloader=pl.DataLoader(train_dataset, batch_size=100),
                             val_dataloaders=pl.DataLoader(test_dataset, batch_size=100))
    experiment = RetiariiExperiment(base_model, lightning, mutators, strategy)
-References
----------
-FunctionalEvaluator
-^^^^^^^^^^^^^^^^^^^
-..  autoclass:: nni.retiarii.evaluator.FunctionalEvaluator
-    :members:
-    :noindex:
-.. _classification-evaluator:
-Classification
-^^^^^^^^^^^^^^
-..  autoclass:: nni.retiarii.evaluator.pytorch.lightning.Classification
-    :members:
-    :noindex:
-.. _regression-evaluator:
-Regression
-^^^^^^^^^^
-..  autoclass:: nni.retiarii.evaluator.pytorch.lightning.Regression
-    :members:
-    :noindex:
-Lightning
-^^^^^^^^^
-..  autoclass:: nni.retiarii.evaluator.pytorch.lightning.Lightning
-    :members:
-    :noindex:
--- a/docs/source/nas/execution_engine.rst
+++ b/docs/source/nas/execution_engine.rst
@@ -3,9 +3,9 @@ Execution Engines
 Execution engine is for running Retiarii Experiment. NNI supports three execution engines, users can choose a specific engine according to the type of their model mutation definition and their requirements for cross-model optimizations. 
-* **Pure-python execution engine** is the default engine, it supports the model space expressed by `inline mutation API <./MutationPrimitives.rst>`__. 
+* **Pure-python execution engine** is the default engine, it supports the model space expressed by :doc:`mutation primitives <construct_space>`.
-* **Graph-based execution engine** supports the use of `inline mutation APIs <./MutationPrimitives.rst>`__ and model spaces represented by `mutators <./Mutators.rst>`__. It requires the user's model to be parsed by `TorchScript <https://pytorch.org/docs/stable/jit.html>`__.
+* **Graph-based execution engine** supports the use of :doc:`mutation primitives <construct_space>` and model spaces represented by :doc:`mutators <mutator>`. It requires the user's model to be parsed by `TorchScript <https://pytorch.org/docs/stable/jit.html>`__.
 * **CGO execution engine** has the same requirements and capabilities as the **Graph-based execution engine**. But further enables cross-model optimizations, which makes model space exploration faster.
@@ -14,9 +14,7 @@ Pure-python Execution Engine
 Pure-python Execution Engine is the default engine, we recommend users to keep using this execution engine, if they are new to NNI NAS. Pure-python execution engine plays magic within the scope of inline mutation APIs, while does not touch the rest of user model. Thus, it has minimal requirement on user model. 
-One steps are needed to use this engine now.
+Rememeber to add :meth:`nni.retiarii.model_wrapper` decorator outside the whole PyTorch model before using this engine.
-1. Add ``@nni.retiarii.model_wrapper`` decorator outside the whole PyTorch model.
 .. note:: You should always use ``super().__init__()`` instead of ``super(MyNetwork, self).__init__()`` in the PyTorch model, because the latter one has issues with model wrapper.
@@ -25,11 +23,11 @@ Graph-based Execution Engine
 For graph-based execution engine, it converts user-defined model to a graph representation (called graph IR) using `TorchScript <https://pytorch.org/docs/stable/jit.html>`__, each instantiated module in the model is converted to a subgraph. Then mutations are applied to the graph to generate new graphs. Each new graph is then converted back to PyTorch code and executed on the user specified training service.
-Users may find ``@basic_unit`` helpful in some cases. ``@basic_unit`` here means the module will not be converted to a subgraph, instead, it is converted to a single graph node as a basic unit.
+Users may find ``@basic_unit`` helpful in some cases. :meth:`nni.retiarii.basic_unit` here means the module will not be converted to a subgraph, instead, it is converted to a single graph node as a basic unit.
 ``@basic_unit`` is usually used in the following cases:
-* When users want to tune initialization parameters of a module using ``ValueChoice``, then decorate the module with ``@basic_unit``. For example, ``self.conv = MyConv(kernel_size=nn.ValueChoice([1, 3, 5]))``, here ``MyConv`` should be decorated.
+* When users want to tune initialization parameters of a module using :class:`nni.retiarii.nn.pytorch.ValueChoice`, then decorate the module with ``@basic_unit``. For example, ``self.conv = MyConv(kernel_size=nn.ValueChoice([1, 3, 5]))``, here ``MyConv`` should be decorated.
 * When a module cannot be successfully parsed to a subgraph, decorate the module with ``@basic_unit``. The parse failure could be due to complex control flow. Currently Retiarii does not support adhoc loop, if there is adhoc loop in a module's forward, this class should be decorated as serializable module. For example, the following ``MyModule`` should be decorated.
@@ -43,12 +41,12 @@ Users may find ``@basic_unit`` helpful in some cases. ``@basic_unit`` here means
        for i in range(10): # <- adhoc loop
          ...
-* Some inline mutation APIs require their handled module to be decorated with ``@basic_unit``. For example, user-defined module that is provided to ``LayerChoice`` as a candidate op should be decorated.
+* Some inline mutation APIs require their handled module to be decorated with ``@basic_unit``. For example, user-defined module that is provided to :class:`nni.retiarii.nn.pytorch.LayerChoice` as a candidate op should be decorated.
 Three steps are need to use graph-based execution engine.
 1. Remove ``@nni.retiarii.model_wrapper`` if there is any in your model.
-2. Add ``config.execution_engine = 'base'`` to ``RetiariiExeConfig``. The default value of ``execution_engine`` is 'py', which means pure-python execution engine.
+2. Add ``config.execution_engine = 'base'`` to :class:`nni.retiarii.experiment.pytorch.RetiariiExeConfig`. The default value of ``execution_engine`` is 'py', which means pure-python execution engine.
 3. Add ``@basic_unit`` when necessary following the above guidelines.
 For exporting top models, graph-based execution engine supports exporting source code for top models by running ``exp.export_top_models(formatter='code')``.
@@ -106,15 +104,3 @@ Advanced users can also implement their own trainers by inheriting ``MultiModelS
 Sometimes, a mutated model cannot be executed (e.g., due to shape mismatch). When a trial running multiple models contains 
 a bad model, CGO execution engine will re-run each model independently in separate trials without cross-model optimizations.
-References
-^^^^^^^^^^
-..  autoclass:: nni.retiarii.evaluator.pytorch.cgo.evaluator.MultiModelSupervisedLearningModule
-    :members:
-..  autoclass:: nni.retiarii.evaluator.pytorch.cgo.evaluator.Classification
-    :members:
-..  autoclass:: nni.retiarii.evaluator.pytorch.cgo.evaluator.Regression
-    :members:
--- a/docs/source/nas/exploration_strategy.rst
+++ b/docs/source/nas/exploration_strategy.rst
--- a/docs/source/nas/hardware_aware_nas.rst
+++ b/docs/source/nas/hardware_aware_nas.rst
@@ -51,7 +51,7 @@ In ``exp_config``, ``dummy_input`` is required for tracing shape info.
 End-to-end ProxylessNAS with Latency Constraints
 ------------------------------------------------
-`ProxylessNAS <https://arxiv.org/pdf/1812.00332.pdf>`__ is a hardware-aware one-shot NAS algorithm. ProxylessNAS applies the expected latency of the model to build a differentiable metric and design efficient neural network architectures for hardware. The latency loss is added as a regularization term for architecture parameter optimization. In this example, nn-Meter provides a latency estimator to predict expected latency for the mixed operation on other types of mobile and edge hardware. 
+`ProxylessNAS <https://arxiv.org/abs/1812.00332>`__ is a hardware-aware one-shot NAS algorithm. ProxylessNAS applies the expected latency of the model to build a differentiable metric and design efficient neural network architectures for hardware. The latency loss is added as a regularization term for architecture parameter optimization. In this example, nn-Meter provides a latency estimator to predict expected latency for the mixed operation on other types of mobile and edge hardware. 
 To run the one-shot ProxylessNAS demo, first install nn-Meter by running:

--- a/docs/source/nas/index.rst
+++ b/docs/source/nas/index.rst
-Retiarii for Neural Architecture Search
+Neural Architecture Search
-=======================================
+==========================
 .. toctree::
   :hidden:
-   :titlesonly:
-   Quick Start <../tutorials/cp_hello_nas_quickstart>
+   Quickstart </tutorials/hello_nas>
   construct_space
   exploration_strategy
   evaluator
@@ -15,138 +14,74 @@ Retiarii for Neural Architecture Search
 .. note:: PyTorch is the **only supported framework on Retiarii**. Inquiries of NAS support on Tensorflow is in `this discussion <https://github.com/microsoft/nni/discussions/4605>`__. If you intend to run NAS with DL frameworks other than PyTorch and Tensorflow, please `open new issues <https://github.com/microsoft/nni/issues>`__ to let us know.
-.. Using rubric to prevent the section heading to be include into toc
+Basics
+------
-.. rubric:: Motivation
 Automatic neural architecture search is playing an increasingly important role in finding better models. Recent research has proven the feasibility of automatic NAS and has led to models that beat many manually designed and tuned models. Representative works include `NASNet <https://arxiv.org/abs/1707.07012>`__, `ENAS <https://arxiv.org/abs/1802.03268>`__, `DARTS <https://arxiv.org/abs/1806.09055>`__, `Network Morphism <https://arxiv.org/abs/1806.10282>`__, and `Evolution <https://arxiv.org/abs/1703.01041>`__. In addition, new innovations continue to emerge.
-However, it is pretty hard to use existing NAS work to help develop common DNN models. Therefore, we designed `Retiarii <https://www.usenix.org/system/files/osdi20-zhang_quanlu.pdf>`__, a novel NAS/HPO framework, and implemented it in NNI. It helps users easily construct a model space (or search space, tuning space), and utilize existing NAS algorithms. The framework also facilitates NAS innovation and is used to design new NAS algorithms.
+High-level speaking, aiming to solve any particular task with neural architecture search typically requires: search space design, search strategy selection, and performance evaluation. The three components work together with the following loop (from the famous `NAS survey <https://arxiv.org/abs/1808.05377>`__):
-In summary, we highlight the following features for Retiarii:
+.. image:: ../../img/nas_abstract_illustration.png
+   :align: center
+   :width: 700
-* Simple APIs are provided for defining model search space within a deep learning model.
+In this figure:
-* SOTA NAS algorithms are built-in to be used for exploring model search space.
-* System-level optimizations are implemented for speeding up the exploration.
-.. rubric:: Overview
+* *Model search space*  means a set of models from which the best model is explored/searched. Sometimes we use *search space* or *model space* in short.
+* *Exploration strategy* is the algorithm that is used to explore a model search space. Sometimes we also call it *search strategy*.
+* *Model evaluator* is responsible for training a model and evaluating its performance.
-High-level speaking, aiming to solve any particular task with neural architecture search typically requires: search space design, search strategy selection, and performance evaluation. The three components work together with the following loop (the figure is from the famous `NAS survey <https://arxiv.org/abs/1808.05377>`__):
+The process is similar to :doc:`Hyperparameter Optimization </hpo/index>`, except that the target is the best architecture rather than hyperparameter. Concretely, an exploration strategy selects an architecture from a predefined search space. The architecture is passed to a performance evaluation to get a score, which represents how well this architecture performs on a particular task. This process is repeated until the search process is able to find the best architecture.
-.. image:: ../../img/nas_abstract_illustration.png
+Key Features
+------------
+The current NAS framework in NNI is powered by the research of `Retiarii: A Deep Learning Exploratory-Training Framework <https://www.usenix.org/system/files/osdi20-zhang_quanlu.pdf>`__, where we highlight the following features:
+* :doc:`Simple APIs to construct search space easily <construct_space>`
+* :doc:`SOTA NAS algorithms to explore search space <exploration_strategy>`
+* :doc:`Experiment backend support to scale up experiments on large-scale AI platforms </experiment/overview>`
+Why NAS with NNI
+----------------
+We list out the three perspectives where NAS can be particularly challegning without NNI. NNI provides solutions to relieve users' engineering effort when they want to try NAS techniques in their own scenario.
+Search Space Design
+^^^^^^^^^^^^^^^^^^^
+The search space defines which architectures can be represented in principle. Incorporating prior knowledge about typical properties of architectures well-suited for a task can reduce the size of the search space and simplify the search. However, this also introduces a human bias, which may prevent finding novel architectural building blocks that go beyond the current human knowledge. Search space design can be very challenging for beginners, who might not possess the experience to balance the richness and simplicity.
+In NNI, we provide a wide range of APIs to build the search space. There are :doc:`high-level APIs <construct_space>`, that enables incorporating human knowledge about what makes a good architecture or search space. There are also :doc:`low-level APIs <mutator>`, that is a list of primitives to construct a network from operator to operator.
+Exploration strategy
+^^^^^^^^^^^^^^^^^^^^
+The exploration strategy details how to explore the search space (which is often exponentially large). It encompasses the classical exploration-exploitation trade-off since, on the one hand, it is desirable to find well-performing architectures quickly, while on the other hand, premature convergence to a region of suboptimal architectures should be avoided. The "best" exploration strategy for a particular scenario is usually found via trial-and-error. As many state-of-the-art strategies are implemented with their own code-base, it becomes very troublesome to switch from one to another.
+In NNI, we have also provided :doc:`a list of strategies <exploration_strategy>`. Some of them are powerful yet time consuming, while others might be suboptimal but really efficient. Given that all strategies are implemented with a unified interface, users can always find one that matches their need.
+Performance estimation
+^^^^^^^^^^^^^^^^^^^^^^
+The objective of NAS is typically to find architectures that achieve high predictive performance on unseen data. Performance estimation refers to the process of estimating this performance. The problem with performance estimation is mostly its scalability, i.e., how can I run and manage multiple trials simultaneously.
+In NNI, we standardize this process is implemented with :doc:`evaluator <evaluator>`, which is responsible of estimating a model's performance. The choices of evaluators also range from the simplest option, e.g., to perform a standard training and validation of the architecture on data, to complex configurations and implementations. Evaluators are run in *trials*, where trials can be spawn onto distributed platforms with our powerful :doc:`training service </experiment/training_service>`.
+Tutorials
+---------
+To start using NNI NAS framework, we recommend at least going through the following tutorials:
+* :doc:`Quickstart </tutorials/hello_nas>`
+* :doc:`construct_space`
+* :doc:`exploration_strategy`
+* :doc:`evaluator`
+Resources
+---------
+The following articles will help with a better understanding of the current arts of NAS:
-To be consistent, we will use the following terminologies throughout our documentation:
+* `Neural Architecture Search: A Survey <https://arxiv.org/abs/1808.05377>`__
+* `A Comprehensive Survey of Neural Architecture Search: Challenges and Solutions <https://arxiv.org/abs/2006.02903>`__
-* *Model search space*: it means a set of models from which the best model is explored/searched. Sometimes we use *search space* or *model space* in short.
-* *Exploration strategy*: the algorithm that is used to explore a model search space. Sometimes we also call it *search strategy*.
-* *Model evaluator*: it is used to train a model and evaluate the model's performance.
-Concretely, an exploration strategy selects an architecture from a predefined search space. The architecture is passed to a performance evaluation to get a score, which represents how well this architecture performs on a particular task. This process is repeated until the search process is able to find the best architecture.
-During such process, we list out the core engineering challenges (which are also pointed out by the famous `NAS survey <https://arxiv.org/abs/1808.05377>`__) and the solutions NNI has provided to address them:
-* **Search space design:** The search space defines which architectures can be represented in principle. Incorporating prior knowledge about typical properties of architectures well-suited for a task can reduce the size of the search space and simplify the search. However, this also introduces a human bias, which may prevent finding novel architectural building blocks that go beyond the current human knowledge. In NNI, we provide a wide range of APIs to build the search space. There are :doc:`high-level APIs <construct_space>`, that enables incorporating human knowledge about what makes a good architecture or search space. There are also :doc:`low-level APIs <mutator>`, that is a list of primitives to construct a network from operator to operator.
-* **Exploration strategy:** The exploration strategy details how to explore the search space (which is often exponentially large). It encompasses the classical exploration-exploitation trade-off since, on the one hand, it is desirable to find well-performing architectures quickly, while on the other hand, premature convergence to a region of suboptimal architectures should be avoided. In NNI, we have also provided :doc:`a list of strategies <exploration_strategy>`. Some of them are powerful, but time consuming, while others might be suboptimal but really efficient. Users can always find one that matches their need.
-* **Performance estimation / evaluator:** The objective of NAS is typically to find architectures that achieve high predictive performance on unseen data. Performance estimation refers to the process of estimating this performance. In NNI, this process is implemented with :doc:`evaluator <evaluator>`, which is responsible of estimating a model's performance. The choices of evaluators also range from the simplest option, e.g., to perform a standard training and validation of the architecture on data, to complex configurations and implementations.
-.. rubric:: Writing Model Space
-The following APIs are provided to ease the engineering effort of writing a new search space.
-.. list-table::
-   :header-rows: 1
-   :widths: auto
-   * - Name
-     - Category
-     - Brief Description
-   * - :ref:`nas-layer-choice`
-     - :ref:`Mutation Primitives <mutation-primitives>`
-     - Select from some PyTorch modules
-   * - :ref:`nas-input-choice`
-     - :ref:`Mutation Primitives <mutation-primitives>`
-     - Select from some inputs (tensors)
-   * - :ref:`nas-value-choice`
-     - :ref:`Mutation Primitives <mutation-primitives>`
-     - Select from some candidate values
-   * - :ref:`nas-repeat`
-     - :ref:`Mutation Primitives <mutation-primitives>`
-     - Repeat a block by a variable number of times
-   * - :ref:`nas-cell`
-     - :ref:`Mutation Primitives <mutation-primitives>`
-     - Cell structure popularly used in literature
-   * - :ref:`nas-cell-101`
-     - :ref:`Mutation Primitives <mutation-primitives>`
-     - Cell structure (variant) proposed by NAS-Bench-101
-   * - :ref:`nas-cell-201`
-     - :ref:`Mutation Primitives <mutation-primitives>`
-     - Cell structure (variant) proposed by NAS-Bench-201
-   * - :ref:`nas-autoactivation`
-     - :ref:`Hyper-modules <hyper-modules>`
-     - Searching for activation functions
-   * - :doc:`Mutator <mutator>`
-     - :doc:`mutator`
-     - Flexible mutations on graphs
-.. rubric:: Exploring the Search Space
-We provide the following (built-in) algorithms to explore the user-defined search space.
-.. list-table::
-   :header-rows: 1
-   :widths: auto
-   * - Name
-     - Category
-     - Brief Description
-   * - :ref:`random-strategy`
-     - :ref:`Multi-trial <multi-trial-nas>`
-     - Randomly sample an architecture each time
-   * - :ref:`grid-search-strategy`
-     - :ref:`Multi-trial <multi-trial-nas>`
-     - Traverse the search space and try all possibilities
-   * - :ref:`regularized-evolution-strategy`
-     - :ref:`Multi-trial <multi-trial-nas>`
-     - Evolution algorithm for NAS. `Reference <https://arxiv.org/abs/1802.01548>`__
-   * - :ref:`tpe-strategy`
-     - :ref:`Multi-trial <multi-trial-nas>`
-     - Tree-structured Parzen Estimator (TPE). `Reference <https://papers.nips.cc/paper/4443-algorithms-for-hyper-parameter-optimization.pdf>`__
-   * - :ref:`policy-based-rl-strategy`
-     - :ref:`Multi-trial <multi-trial-nas>`
-     - Policy-based reinforcement learning, based on implementation of tianshou. `Reference <https://arxiv.org/abs/1611.01578>`__
-   * - :ref:`darts-strategy`
-     - :ref:`One-shot <one-shot-nas>`
-     - Continuous relaxation of the architecture representation, allowing efficient search of the architecture using gradient descent. `Reference <https://arxiv.org/abs/1806.09055>`__
-   * - :ref:`enas-strategy`
-     - :ref:`One-shot <one-shot-nas>`
-     - RL controller learns to generate the best network on a super-net. `Reference <https://arxiv.org/abs/1802.03268>`__
-   * - :ref:`fbnet-strategy`
-     - :ref:`One-shot <one-shot-nas>`
-     - Choose the best block by using Gumbel Softmax random sampling and differentiable training. `Reference <https://arxiv.org/abs/1812.03443>`__
-   * - :ref:`spos-strategy`
-     - :ref:`One-shot <one-shot-nas>`
-     - Train a super-net with uniform path sampling. `Reference <https://arxiv.org/abs/1904.00420>`__
-   * - :ref:`proxylessnas-strategy`
-     - :ref:`One-shot <one-shot-nas>`
-     - A low-memory-consuming optimized version of differentiable architecture search. `Reference <https://arxiv.org/abs/1812.00332>`__
-.. rubric:: Evaluators
-The evaluator APIs can be used to build performance assessment component of your neural architecture search process.
-.. list-table::
-   :header-rows: 1
-   :widths: auto
-   * - Name
-     - Type
-     - Brief Description
-   * - :ref:`functional-evaluator`
-     - General
-     - Evaluate with any Python function
-   * - :ref:`classification-evaluator`
-     - Built upon `PyTorch Lightning <https://www.pytorchlightning.ai/>`__
-     - For classification tasks
-   * - :ref:`regression-evaluator`
-     - Built upon `PyTorch Lightning <https://www.pytorchlightning.ai/>`__
-     - For regression tasks
--- a/docs/source/nas/mutator.rst
+++ b/docs/source/nas/mutator.rst
-Construct Space with Mutators
+Construct Space with Mutator
-=============================
+============================
-Besides the inline mutation APIs demonstrated :ref:`above <mutation-primitives>`, NNI provides a more general approach to express a model space, i.e., *Mutator*, to cover more complex model spaces. Those inline mutation APIs are also implemented with mutator in the underlying system, which can be seen as a special case of model mutation.
+Besides the mutation primitives demonstrated in the :doc:`basic tutorial <construct_space>`, NNI provides a more general approach to express a model space, i.e., *Mutator*, to cover more complex model spaces. The high-level APIs are also implemented with mutator in the underlying system, which can be seen as a special case of model mutation.
-.. note:: Mutator and inline mutation APIs cannot be used together.
+.. warning:: Mutator and inline mutation APIs can NOT be used together.
 A mutator is a piece of logic to express how to mutate a given model. Users are free to write their own mutators. Then a model space is expressed with a base model and a list of mutators. A model in the model space is sampled by applying the mutators on the base model one after another. An example is shown below.
@@ -18,7 +18,7 @@ A mutator is a piece of logic to express how to mutate a given model. Users are
 Write a mutator
 ---------------
-User-defined mutator should inherit ``Mutator`` class, and implement mutation logic in the member function ``mutate``.
+User-defined mutator should inherit :class:`nni.retiarii.Mutator` class, and implement mutation logic in the member function :meth:`nni.retiarii.Mutator.mutate`.
 .. code-block:: python
@@ -35,9 +35,9 @@ User-defined mutator should inherit ``Mutator`` class, and implement mutation lo
        chosen_op = self.choice(self.candidate_op_list)
        node.update_operation(chosen_op.type, chosen_op.params)
-The input of ``mutate`` is graph IR (Intermediate Representation) of the base model (please refer to `here <./ApiReference.rst>`__ for the format and APIs of the IR), users can mutate the graph using the graph's member functions (e.g., ``get_nodes_by_label``, ``update_operation``). The mutation operations can be combined with the API ``self.choice``, in order to express a set of possible mutations. In the above example, the node's operation can be changed to any operation from ``candidate_op_list``.
+The input of :meth:`nni.retiarii.Mutator.mutate` is graph IR (Intermediate Representation) of the base model, users can mutate the graph using the graph's member functions (e.g., :meth:`nni.retiarii.Model.get_nodes_by_label`). The mutation operations can be combined with the API ``self.choice``, in order to express a set of possible mutations. In the above example, the node's operation can be changed to any operation from ``candidate_op_list``.
-Use placeholder to make mutation easier: ``nn.Placeholder``. If you want to mutate a subgraph or node of your model, you can define a placeholder in this model to represent the subgraph or node. Then, use mutator to mutate this placeholder to make it real modules.
+Use placeholder to make mutation easier: :class:`nni.retiarii.nn.pytorch.Placeholder`. If you want to mutate a subgraph or node of your model, you can define a placeholder in this model to represent the subgraph or node. Then, use mutator to mutate this placeholder to make it real modules.
 .. code-block:: python
@@ -62,51 +62,3 @@ Starting an experiment is almost the same as using inline mutation APIs. The onl
  exp_config.max_trial_number = 10
  exp_config.training_service.use_active_gpu = False
  exp.run(exp_config, 8081)
-References
----------
-Placeholder
-^^^^^^^^^^^
-..  autoclass:: nni.retiarii.nn.pytorch.Placeholder
-    :members:
-    :noindex:
-Mutator
-^^^^^^^
-..  autoclass:: nni.retiarii.Mutator
-    :members:
-    :noindex:
-..  autoclass:: nni.retiarii.Sampler
-    :members:
-    :noindex:
-..  autoclass:: nni.retiarii.InvalidMutation
-    :members:
-    :noindex:
-Graph
-^^^^^
-..  autoclass:: nni.retiarii.Model
-    :members:
-    :noindex:
-..  autoclass:: nni.retiarii.Graph
-    :members:
-    :noindex:
-..  autoclass:: nni.retiarii.Node
-    :members:
-    :noindex:
-..  autoclass:: nni.retiarii.Edge
-    :members:
-    :noindex:
-..  autoclass:: nni.retiarii.Operation
-    :members:
-    :noindex:
--- a/docs/source/nas/serialization.rst
+++ b/docs/source/nas/serialization.rst
 Serialization
 =============
-In multi-trial NAS, a sampled model should be able to be executed on a remote machine or a training platform (e.g., AzureML, OpenPAI). "Serialization" enables re-instantiation of model evaluator in another process or machine, such that, both the model and its model evaluator should be correctly serialized. To make NNI correctly serialize model evaluator, users should apply ``nni.trace`` on some of their functions and objects. API references can be found in :func:`nni.trace`.
+In multi-trial NAS, a sampled model should be able to be executed on a remote machine or a training platform (e.g., AzureML, OpenPAI). "Serialization" enables re-instantiation of model evaluator in another process or machine, such that, both the model and its model evaluator should be correctly serialized. To make NNI correctly serialize model evaluator, users should apply :func:`nni.trace <nni.common.serializer.trace>` on some of their functions and objects. API references can be found in :func:`nni.trace <nni.common.serializer.trace>`.
 Serialization is implemented as a combination of `json-tricks <https://json-tricks.readthedocs.io/en/latest/>`_ and `cloudpickle <https://github.com/cloudpipe/cloudpickle>`_. Essentially, it is json-tricks, that is a enhanced version of Python JSON, enabling handling of serialization of numpy arrays, date/times, decimal, fraction and etc. The difference lies in the handling of class instances. Json-tricks deals with class instances with ``__dict__`` and ``__class__``, which in most of our cases are not reliable (e.g., datasets, dataloaders). Rather, our serialization deals with class instances with two methods:
-1. If the class / factory that creates the object is decorated with ``nni.trace``, we can serialize the class / factory function, along with the parameters, such that the instance can be re-instantiated.
+1. If the class / factory that creates the object is decorated with :func:`nni.trace <nni.common.serializer.trace>`, we can serialize the class / factory function, along with the parameters, such that the instance can be re-instantiated.
 2. Otherwise, cloudpickle is used to serialize the object into a binary.
-The recommendation is, unless you are absolutely certain that there is no problem and extra burden to serialize the object into binary, always add ``nni.trace``. In most cases, it will be more clean and neat, and enables possibilities such as mutation of parameters (will be supported in future).
+The recommendation is, unless you are absolutely certain that there is no problem and extra burden to serialize the object into binary, always add :func:`nni.trace <nni.common.serializer.trace>`. In most cases, it will be more clean and neat, and enables possibilities such as mutation of parameters (will be supported in future).
 .. warning::
    **What will happen if I forget to "trace" my objects?**
-    It is likely that the program can still run. NNI will try to serialize the untraced object into a binary. It might fail in complex cases. For example, when the object is too large. Even if it succeeds, the result might be a substantially large object. For example, if you forgot to add ``nni.trace`` on ``MNIST``, the MNIST dataset object wil be serialized into binary, which will be dozens of megabytes because the object has the whole 60k images stored inside. You might see warnings and even errors when running experiments. To avoid such issues, the easiest way is to always remember to add ``nni.trace`` to non-primitive objects.
+    It is likely that the program can still run. NNI will try to serialize the untraced object into a binary. It might fail in complex cases. For example, when the object is too large. Even if it succeeds, the result might be a substantially large object. For example, if you forgot to add :func:`nni.trace <nni.common.serializer.trace>` on ``MNIST``, the MNIST dataset object wil be serialized into binary, which will be dozens of megabytes because the object has the whole 60k images stored inside. You might see warnings and even errors when running experiments. To avoid such issues, the easiest way is to always remember to add :func:`nni.trace <nni.common.serializer.trace>` to non-primitive objects.
-.. note:: In Retiarii, serializer will throw exception when one of an single object in the recursive serialization is larger than 64 KB when binary serialized. This indicates that such object needs to be wrapped by ``nni.trace``. In rare cases, if you insist on pickling large data, the limit can be overridden by setting an environment variable ``PICKLE_SIZE_LIMIT``, whose unit is byte. Please note that even if the experiment might be able to run, this can still cause performance issues and even the crash of NNI experiment.
+.. note:: In Retiarii, serializer will throw exception when one of an single object in the recursive serialization is larger than 64 KB when binary serialized. This indicates that such object needs to be wrapped by :func:`nni.trace <nni.common.serializer.trace>`. In rare cases, if you insist on pickling large data, the limit can be overridden by setting an environment variable ``PICKLE_SIZE_LIMIT``, whose unit is byte. Please note that even if the experiment might be able to run, this can still cause performance issues and even the crash of NNI experiment.
 To trace a function or class, users can use decorator like,
@@ -26,11 +26,15 @@ To trace a function or class, users can use decorator like,
    class MyClass:
        ...
-Inline trace that traces instantly on the object instantiation or function invoke is also acceptable: ``nni.trace(MyClass)(parameters)``.
+Inline trace that traces instantly on the object instantiation or function invoke is also acceptable:
-Assuming a class ``cls`` is already traced, when it is serialized, its class type along with initialization parameters will be dumped. As the parameters are possibly class instances (if not primitive types like ``int`` and ``str``), their serialization will be a similar problem. We recommend decorate them with ``nni.trace`` as well. In other words, ``nni.trace`` should be applied recursively if necessary.
+.. code-block:: python
+   nni.trace(MyClass)(parameters)
+Assuming a class ``cls`` is already traced, when it is serialized, its class type along with initialization parameters will be dumped. As the parameters are possibly class instances (if not primitive types like ``int`` and ``str``), their serialization will be a similar problem. We recommend decorate them with :func:`nni.trace <nni.common.serializer.trace>` as well. In other words, :func:`nni.trace <nni.common.serializer.trace>` should be applied recursively if necessary.
-Below is an example, ``transforms.Compose``, ``transforms.Normalize``, and ``MNIST`` are serialized manually using ``nni.trace``. ``nni.trace`` takes a class / function as its argument, and returns a wrapped class and function that has the same behavior with the original class / function. The usage of the wrapped class / function is also identical to the original one, except that the arguments are recorded. No need to apply ``nni.trace`` to ``pl.Classification`` and ``pl.DataLoader`` because they are already traced.
+Below is an example, ``transforms.Compose``, ``transforms.Normalize``, and ``MNIST`` are serialized manually using :func:`nni.trace <nni.common.serializer.trace>`. :func:`nni.trace <nni.common.serializer.trace>` takes a class / function as its argument, and returns a wrapped class and function that has the same behavior with the original class / function. The usage of the wrapped class / function is also identical to the original one, except that the arguments are recorded. No need to apply :func:`nni.trace <nni.common.serializer.trace>` to :class:`pl.Classification <nni.retiarii.evaluator.pytorch.Classification>` and :class:`pl.DataLoader <nni.retiarii.evaluator.pytorch.DataLoader>` because they are already traced.
 .. code-block:: python
@@ -57,6 +61,6 @@ Below is an example, ``transforms.Compose``, ``transforms.Normalize``, and ``MNI
    **What's the relationship between model_wrapper, basic_unit and nni.trace?**
-    They are fundamentally different. ``model_wrapper`` is used to wrap a base model (search space), ``basic_unit`` to annotate a module as primitive. ``nni.trace`` is to enable serialization of general objects. Though they share similar underlying implementations, but do keep in mind that you will experience errors if you mix them up.
+    They are fundamentally different. :func:`model_wrapper <nni.retiarii.model_wrapper>` is used to wrap a base model (search space), :func:`basic_unit <nni.retiarii.basic_unit>` to annotate a module as primitive. :func:`nni.trace <nni.common.serializer.trace>` is to enable serialization of general objects. Though they share similar underlying implementations, but do keep in mind that you will experience errors if you mix them up.
-    .. seealso:: Please refer to API reference of :meth:`nni.retiarii.model_wrapper`, :meth:`nni.retiarii.basic_unit`, and :meth:`nni.trace`.
+    Please refer to API reference of :meth:`nni.retiarii.model_wrapper`, :meth:`nni.retiarii.basic_unit`, and :func:`nni.trace <nni.common.serializer.trace>`.
--- a/docs/source/reference/nas.rst
+++ b/docs/source/reference/nas.rst
-Neural Architecture Search
-==========================
-nni.retiarii
------------
-..  automodule:: nni.retiarii
-    :imported-members:
-    :members:
-nni.retiarii.codegen
--------------------
-..  automodule:: nni.retiarii.codegen
-    :imported-members:
-    :members:
-nni.retiarii.converter
----------------------
-..  automodule:: nni.retiarii.converter
-    :imported-members:
-    :members:
-nni.retiarii.evaluator
----------------------
-..  automodule:: nni.retiarii.evaluator
-    :imported-members:
-    :members:
-..  automodule:: nni.retiarii.evaluator.pytorch
-    :imported-members:
-    :members:
-    :exclude-members: Trainer, DataLoader
-..  autoclass:: nni.retiarii.evaluator.pytorch.Trainer
-..  autoclass:: nni.retiarii.evaluator.pytorch.DataLoader
-nni.retiarii.execution
----------------------
-..  automodule:: nni.retiarii.execution
-    :imported-members:
-    :members:
-    :undoc-members:
-nni.retiarii.experiment.pytorch
-------------------------------
-..  automodule:: nni.retiarii.experiment.pytorch
-    :members:
-nni.retiarii.nn.pytorch
-----------------------
-..  automodule:: nni.retiarii.nn.pytorch.api
-    :imported-members:
-    :members:
-    :noindex:
-..  automodule:: nni.retiarii.nn.pytorch.component
-    :imported-members:
-    :members:
-    :noindex:
-..  automodule:: nni.retiarii.nn.pytorch.hypermodule
-    :imported-members:
-    :members:
-    :noindex:
-..  automodule:: nni.retiarii.nn.pytorch.mutation_utils
-    :imported-members:
-    :members:
-nni.retiarii.oneshot
--------------------
-..  automodule:: nni.retiarii.oneshot
-    :imported-members:
-    :members:
-nni.retiarii.operation_def
--------------------------
-..  automodule:: nni.retiarii.operation_def
-    :imported-members:
-    :members:
-nni.retiarii.strategy
---------------------
-..  automodule:: nni.retiarii.strategy
-    :imported-members:
-    :members:
-nni.retiarii.utils
------------------
-..  automodule:: nni.retiarii.utils
-    :members:
--- a/docs/source/reference/nas/evaluator.rst
+++ b/docs/source/reference/nas/evaluator.rst
+Evaluator
+=========
+FunctionalEvaluator
+-------------------
+..  autoclass:: nni.retiarii.evaluator.FunctionalEvaluator
+    :members:
+Classification
+--------------
+..  autoclass:: nni.retiarii.evaluator.pytorch.Classification
+    :members:
+Regression
+----------
+..  autoclass:: nni.retiarii.evaluator.pytorch.Regression
+    :members:
+Utilities
+---------
+..  autoclass:: nni.retiarii.evaluator.pytorch.Trainer
+..  autoclass:: nni.retiarii.evaluator.pytorch.DataLoader
+Customization
+-------------
+..  autoclass:: nni.retiarii.evaluator.pytorch.Lightning
+..  autoclass:: nni.retiarii.evaluator.pytorch.LightningModule
+Cross-graph Optimization (experimental)
+---------------------------------------
+..  autoclass:: nni.retiarii.evaluator.pytorch.cgo.evaluator.MultiModelSupervisedLearningModule
+    :members:
+..  autoclass:: nni.retiarii.evaluator.pytorch.cgo.evaluator.Classification
+    :members:
+..  autoclass:: nni.retiarii.evaluator.pytorch.cgo.evaluator.Regression
+    :members:
--- a/docs/source/reference/nas/index.rst
+++ b/docs/source/reference/nas/index.rst
+Neural Architecture Search
+==========================
+.. toctree::
+   :maxdepth: 2
+   search_space
+   strategy
+   evaluator
+   Others <others>
--- a/docs/source/reference/nas/others.rst
+++ b/docs/source/reference/nas/others.rst
+Uncategorized Modules
+=====================
+Experiment
+----------
+..  autoclass:: nni.retiarii.experiment.pytorch.RetiariiExeConfig
+    :members:
+..  autoclass:: nni.retiarii.experiment.pytorch.RetiariiExperiment
+    :members:
+NAS Benchmarks
+--------------
+.. _nas-bench-101-reference:
+NAS-Bench-101
+^^^^^^^^^^^^^
+.. automodule:: nni.nas.benchmarks.nasbench101
+   :members:
+   :imported-members:
+.. _nas-bench-201-reference:
+NAS-Bench-201
+^^^^^^^^^^^^^
+.. automodule:: nni.nas.benchmarks.nasbench201
+   :members:
+   :imported-members:
+.. _nds-reference:
+NDS
+^^^
+.. automodule:: nni.nas.benchmarks.nds
+   :members:
+   :imported-members:
+Retrain (Architecture Evaluation)
+---------------------------------
+..  autofunction:: nni.retiarii.fixed_arch
+Utilities
+---------
+..  autofunction:: nni.retiarii.basic_unit
+..  autofunction:: nni.retiarii.model_wrapper
+..  automodule:: nni.retiarii.nn.pytorch.mutation_utils
+    :imported-members:
+    :members:
+..  automodule:: nni.retiarii.utils
+    :members:
--- a/docs/source/reference/nas/search_space.rst
+++ b/docs/source/reference/nas/search_space.rst
+Search Space
+============
+.. _mutation-primitives:
+Mutation Pritimives
+-------------------
+LayerChoice
+^^^^^^^^^^^
+.. autoclass:: nni.retiarii.nn.pytorch.LayerChoice
+   :members:
+InputChoice
+^^^^^^^^^^^
+.. autoclass:: nni.retiarii.nn.pytorch.InputChoice
+   :members:
+.. autoclass:: nni.retiarii.nn.pytorch.ChosenInputs
+   :members:
+ValueChoice
+^^^^^^^^^^^
+.. autoclass:: nni.retiarii.nn.pytorch.ValueChoice
+   :members:
+   :inherited-members: Module
+ModelParameterChoice
+^^^^^^^^^^^^^^^^^^^^
+.. autoclass:: nni.retiarii.nn.pytorch.ModelParameterChoice
+   :members:
+   :inherited-members: Module
+Repeat
+^^^^^^
+.. autoclass:: nni.retiarii.nn.pytorch.Repeat
+   :members:
+Cell
+^^^^
+.. autoclass:: nni.retiarii.nn.pytorch.Cell
+   :members:
+NasBench101Cell
+^^^^^^^^^^^^^^^
+.. autoclass:: nni.retiarii.nn.pytorch.NasBench101Cell
+   :members:
+NasBench201Cell
+^^^^^^^^^^^^^^^
+.. autoclass:: nni.retiarii.nn.pytorch.NasBench201Cell
+   :members:
+.. _hyper-modules:
+Hyper-module Library (experimental)
+-----------------------------------
+AutoActivation
+^^^^^^^^^^^^^^
+..  autoclass:: nni.retiarii.nn.pytorch.AutoActivation
+    :members:
+Mutators (advanced)
+-------------------
+Mutator
+^^^^^^^
+..  autoclass:: nni.retiarii.Mutator
+    :members:
+..  autoclass:: nni.retiarii.Sampler
+    :members:
+..  autoclass:: nni.retiarii.InvalidMutation
+    :members:
+Placeholder
+^^^^^^^^^^^
+..  autoclass:: nni.retiarii.nn.pytorch.Placeholder
+    :members:
+Graph
+^^^^^
+..  autoclass:: nni.retiarii.Model
+    :members:
+..  autoclass:: nni.retiarii.Graph
+    :members:
+..  autoclass:: nni.retiarii.Node
+    :members:
+..  autoclass:: nni.retiarii.Edge
+    :members:
+..  autoclass:: nni.retiarii.Operation
+    :members:
--- a/docs/source/reference/nas/strategy.rst
+++ b/docs/source/reference/nas/strategy.rst
--- a/docs/source/reference/others.rst
+++ b/docs/source/reference/others.rst
 Uncategorized Modules
 =====================
+nni.common.serializer
+---------------------
+.. automodule:: nni.common.serializer
+    :members:
 nni.typehint
 ------------