Update Chinese Translation (#2871)

* New Crowdin translations by Github Action * remove removed files in en_US Co-authored-by: Crowdin Bot <support+bot@crowdin.com>

Update Chinese Translation (#2871)
* New Crowdin translations by Github Action * remove removed files in en_US Co-authored-by: Crowdin Bot <support+bot@crowdin.com>
0e49e659 · Junwei Sun · GitHub · 171400ee · 0e49e659 · 0e49e659
Unverified Commit 0e49e659 authored Sep 18, 2020 by Junwei Sun Committed by GitHub Sep 18, 2020
20 changed files
--- a/README_zh_CN.md
+++ b/README_zh_CN.md
@@ -10,7 +10,7 @@

 **NNI (Neural Network Intelligence)** 是一个轻量但强大的工具包，帮助用户**自动**的进行[特征工程](docs/zh_CN/FeatureEngineering/Overview.md)，[神经网络架构搜索](docs/zh_CN/NAS/Overview.md)，[超参调优](docs/zh_CN/Tuner/BuiltinTuner.md)以及[模型压缩](docs/zh_CN/Compressor/Overview.md)。

-NNI 管理自动机器学习 (AutoML) 的 Experiment，**调度运行**由调优算法生成的 Trial 任务来找到最好的神经网络架构和/或超参，支持**各种训练环境**，如[本机](docs/zh_CN/TrainingService/LocalMode.md)，[远程服务器](docs/zh_CN/TrainingService/RemoteMachineMode.md)，[OpenPAI](docs/zh_CN/TrainingService/PaiMode.md)，[Kubeflow](docs/zh_CN/TrainingService/KubeflowMode.md)，[基于 K8S 的 FrameworkController（如，AKS 等)](docs/zh_CN/TrainingService/FrameworkControllerMode.md)， [DLWorkspace (又称 DLTS)](docs/zh_CN/TrainingService/DLTSMode.md) 和其它云服务。
+NNI 管理自动机器学习 (AutoML) 的 Experiment，**调度运行**由调优算法生成的 Trial 任务来找到最好的神经网络架构和/或超参，支持**各种训练环境**，如[本机](docs/zh_CN/TrainingService/LocalMode. md)，[远程服务器](docs/zh_CN/TrainingService/RemoteMachineMode. md)，[OpenPAI](docs/zh_CN/TrainingService/PaiMode. md)，[Kubeflow](docs/zh_CN/TrainingService/KubeflowMode. md)，[基于 K8S 的 FrameworkController（如，AKS 等)](docs/zh_CN/TrainingService/FrameworkControllerMode. md)， [DLWorkspace](docs/zh_CN/TrainingService/DLTSMode. md) (又称 DLTS)</a>, [AML](docs/zh_CN/TrainingService/AMLMode.md) (Azure Machine Learning) 以及其它环境。

 ## **使用场景**

@@ -19,7 +19,7 @@ NNI 管理自动机器学习 (AutoML) 的 Experiment，**调度运行**由调优
 * 想要更容易**实现或试验新的自动机器学习算法**的研究员或数据科学家，包括：超参调优算法，神经网络搜索算法以及模型压缩算法。
 * 在机器学习平台中**支持自动机器学习**。

-### **[NNI v1.6 已发布！](https://github.com/microsoft/nni/releases) &nbsp;[<img width="48" src="docs/img/release_icon.png" />](#nni-released-reminder)**
+### **[NNI v1.8 已发布！](https://github.com/microsoft/nni/releases) &nbsp;[<img width="48" src="docs/img/release_icon.png" />](#nni-released-reminder)**

 ## **NNI 功能一览**

@@ -164,6 +164,7 @@ NNI 提供命令行工具以及友好的 WebUI 来管理训练的 Experiment。
      <ul>
        <li><a href="docs/zh_CN/TrainingService/LocalMode.md">本机</a></li>
        <li><a href="docs/zh_CN/TrainingService/RemoteMachineMode.md">远程计算机</a></li>
+        <li><a href="docs/zh_CN/TrainingService/AMLMode.md">AML(Azure Machine Learning)</a></li>
        <li><b>基于 Kubernetes 的平台</b></li>
            <ul><li><a href="docs/zh_CN/TrainingService/PaiMode.md">OpenPAI</a></li>
            <li><a href="docs/zh_CN/TrainingService/KubeflowMode.md">Kubeflow</a></li>
@@ -208,7 +209,7 @@ NNI 提供命令行工具以及友好的 WebUI 来管理训练的 Experiment。

 ### **安装**

-NNI 支持并在 Ubuntu >= 16.04, macOS >= 10.14.1, 和 Windows 10 >= 1809 通过了测试。 在 `python 64-bit >= 3.5` 的环境中，只需要运行 `pip install` 即可完成安装。
+NNI 支持并在 Ubuntu >= 16.04, macOS >= 10.14.1, 和 Windows 10 >= 1809 通过了测试。 在 `python 64-bit >= 3.6` 的环境中，只需要运行 `pip install` 即可完成安装。

 Linux 或 macOS

@@ -239,7 +240,7 @@ Linux 和 macOS 下 NNI 系统需求[参考这里](https://nni.readthedocs.io/zh
 * 通过克隆源代码下载示例。
   
   ```bash
-   git clone -b v1.6 https://github.com/Microsoft/nni.git
+   git clone -b v1.8 https://github.com/Microsoft/nni.git
   ```

 * 运行 MNIST 示例。

--- a/deployment/docker/README_zh_CN.md
+++ b/deployment/docker/README_zh_CN.md
@@ -4,15 +4,17 @@

 这是 NNI 项目的 Dockerfile 文件。 其中包含了 NNI 以及多个流行的深度学习框架。 在 `Ubuntu 16.04 LTS` 上进行过测试：

-    CUDA 9.0, CuDNN 7.0
-    numpy 1.14.3,scipy 1.1.0
-    TensorFlow-gpu 1.10.0
-    Keras 2.1.6
-    PyTorch 0.4.1
-    scikit-learn 0.20.0
+    CUDA 9.0
+    CuDNN 7.0
+    numpy 1.14.3
+    scipy 1.1.0
+    tensorflow-gpu 1.15.0
+    keras 2.1.6
+    torch 1.4.0
+    scikit-learn 0.23.2
    pandas 0.23.4
    lightgbm 2.2.2
-    NNI v0.7
+    nni
    

 此 Dockerfile 可作为定制的参考。

--- a/docs/zh_CN/Assessor/CustomizeAssessor.md
+++ b/docs/zh_CN/Assessor/CustomizeAssessor.md
@@ -54,7 +54,7 @@ assessor:

 注意在 **2** 中， `trial_history` 对象与 Trial 通过 `report_intermediate_result` 函数返回给 Assessor 的对象完全一致。

-Assessor 的工作目录是`<home>/nni/experiments/<experiment_id>/log` 可从环境变量 `NNI_LOG_DIRECTORY` 中获取。
+Assessor 的工作目录是`<home>/nni-experiments/<experiment_id>/log` 可从环境变量 `NNI_LOG_DIRECTORY` 中获取。

 更多示例，可参考：


--- a/docs/zh_CN/CommunitySharings/HpoComparision.md
+++ b/docs/zh_CN/CommunitySharings/HpoComparision.md
 # 超参数优化的对比
-
 *匿名作者*

 超参优化算法（HPO）在几个问题上的对比。

 超参数优化算法如下：

- [Random Search（随机搜索）](../Tuner/BuiltinTuner.md)
- [Grid Search（遍历搜索）](../Tuner/BuiltinTuner.md)
+- [Random Search](../Tuner/BuiltinTuner.md)
+- [Grid Search](../Tuner/BuiltinTuner.md)
 - [Evolution](../Tuner/BuiltinTuner.md)
- [Anneal（退火算法）](../Tuner/BuiltinTuner.md)
+- [Anneal](../Tuner/BuiltinTuner.md)
 - [Metis](../Tuner/BuiltinTuner.md)
 - [TPE](../Tuner/BuiltinTuner.md)
 - [SMAC](../Tuner/BuiltinTuner.md)
@@ -20,15 +19,16 @@

 环境：

-    OS: Linux Ubuntu 16.04 LTS
-    CPU: Intel(R) Xeon(R) CPU E5-2690 v3 @ 2.60GHz 2600 MHz
-    Memory: 112 GB
-    NNI Version: v0.7
-    NNI 模式(local|pai|remote): local
-    Python 版本: 3.6
-    使用的虚拟环境: Conda
-    是否在 Docker 中运行: no
-    
+```
+OS: Linux Ubuntu 16.04 LTS
+CPU: Intel(R) Xeon(R) CPU E5-2690 v3 @ 2.60GHz 2600 MHz
+Memory: 112 GB
+NNI Version: v0.7
+NNI 模式(local|pai|remote): local
+Python 版本: 3.6
+使用的虚拟环境: Conda
+是否在 Docker 中运行: no
+```

 ## AutoGBDT 示例

@@ -67,12 +67,12 @@
 }
 ```

-总搜索空间为 1, 204, 224 次，将最大 Trial 次数设置为1000。 时间限制为 48 小时。
+总搜索空间为 1, 204, 224 次，将最大 Trial 次数设置为 1000。 时间限制为 48 小时。

 ### 结果

-| 算法            | 最好的损失值       | 最好的 5 次损失的平均值 | 最好的 10 次损失的平均 |
-| ------------- | ------------ | ------------- | ------------- |
+| 算法            | 最好的损失值       | 最好的 5 次损失的平均值 | 最好的 10 次损失的平均值 |
+| ------------- | ------------ | ------------- | -------------- |
 | Random Search | 0.418854     | 0.420352      | 0.421553       |
 | Random Search | 0.417364     | 0.420024      | 0.420997       |
 | Random Search | 0.417861     | 0.419744      | 0.420642       |
@@ -114,21 +114,22 @@

 #### 计算机配置

-    RocksDB:    version 6.1
-    CPU:        6 * Intel(R) Xeon(R) CPU E5-2690 v4 @ 2.60GHz
-    CPUCache:   35840 KB
-    Keys:       16 bytes each
-    Values:     100 bytes each (50 bytes after compression)
-    Entries:    1000000
-    
+```
+RocksDB:    version 6.1
+CPU:        6 * Intel(R) Xeon(R) CPU E5-2690 v4 @ 2.60GHz
+CPUCache:   35840 KB
+Keys:       16 bytes each
+Values:     100 bytes each (50 bytes after compression)
+Entries:    1000000
+```

 #### 存储性能

 **延迟**：每个 IO 请求都需要一些时间才能完成，这称为平均延迟。 有几个因素会影响此时间，包括网络连接质量和硬盘IO性能。

-**IOPS**： **每秒的 IO 操作数量**，这意味着可以在一秒钟内完成的*读取或写入操作次数*。
+**IOPS**：**每秒的 IO 操作数量**，这意味着可以在一秒钟内完成的_读取或写入操作次数_。

-**IO 大小**： **每个 IO 请求的大小**。 根据操作系统和需要磁盘访问的应用程序、服务，它将同时发出读取或写入一定数量数据的请求。
+**IO 大小**：**每个 IO 请求的大小**。 根据操作系统和需要磁盘访问的应用程序、服务，它将同时发出读取或写入一定数量数据的请求。

 **吞吐量（以 MB/s 为单位）= 平均 IO 大小 x IOPS **

@@ -200,7 +201,7 @@ IOPS 与在线处理能力有关，我们在实验中使用 IOPS 作为指标。
 | SMAC      | 491067          | 490472          | **491136**      |
 | Metis     | 444920          | 457060          | 454438          |

-Figure:
+图：

 ![](../../img/hpo_rocksdb_fillrandom.png)

@@ -215,6 +216,6 @@ Figure:
 | SMAC      | 2270874         | 2284904         | 2282266         |
 | Metis     | **2287696**     | 2283496         | 2277701         |

-Figure:
+图：

 ![](../../img/hpo_rocksdb_readrandom.png)
--- a/docs/zh_CN/CommunitySharings/ModelCompressionComparison.md
+++ b/docs/zh_CN/CommunitySharings/ModelCompressionComparison.md
+# 滤波器剪枝算法比较
+
+为了初步了解各种滤波器剪枝算法的性能，在一些基准模型和数据集上使用各种剪枝算法进行了广泛的实验。 此文档中展示了实验结果。 此外，还对这些实验的复现提供了友好的说明，以促进对这项工作的进一步贡献。
+
+## 实验设置
+
+实验使用以下剪枝器/数据集/模型进行:
+
+* 模型：[VGG16, ResNet18, ResNet50](https://github.com/microsoft/nni/tree/master/examples/model_compress/models/cifar10)
+
+* 数据集：CIFAR-10
+
+* 剪枝器：
+    - 剪枝器包括：
+        - 迭代式剪枝器 : `SimulatedAnnealing Pruner`, `NetAdapt Pruner`, `AutoCompress Pruner`。 给定总体稀疏度要求，这类剪枝器可以在不同层中自动分配稀疏度。
+        - 单轮剪枝器：`L1Filter Pruner`，`L2Filter Pruner`，`FPGM Pruner`。 每层的稀疏度与实验设置的总体稀疏度相同。
+    - 这里只比较 **filter pruning** 的剪枝效果。
+
+    对于迭代式剪枝器，使用 `L1Filter Pruner` 作为基础算法。 也就是说, 在迭代式剪枝器决定了稀疏度分布之后，使用 `L1Filter Pruner` 进行真正的剪枝。
+
+    - 上面列出来的所有的剪枝器都已经在 [NNI](https://github.com/microsoft/nni/tree/master/docs/zh_CN/Compressor/Overview.md) 中实现。
+
+## 实验结果
+
+对于每一个数据集/模型/剪枝器的组合，设置不同的目标稀疏度对模型进行剪枝。
+
+这里展示了**权重数量 - 性能**曲线，还展示了**FLOPs - 性能**曲线。 同时在图上画出论文 [AutoCompress: An Automatic DNN Structured Pruning Framework for Ultra-High Compression Rates](http://arxiv.org/abs/1907.03141) 中对 VGG16 和 ResNet18 在 CIFAR-10 上的剪枝结果作为对比。
+
+实验结果如下图所示：
+
+CIFAR-10, VGG16:
+
+![](../../../examples/model_compress/comparison_of_pruners/img/performance_comparison_vgg16.png)
+
+CIFAR-10, ResNet18:
+
+![](../../../examples/model_compress/comparison_of_pruners/img/performance_comparison_resnet18.png)
+
+CIFAR-10, ResNet50:
+
+![](../../../examples/model_compress/comparison_of_pruners/img/performance_comparison_resnet50.png)
+
+## 分析
+
+从实验结果中，得到以下结论：
+
+* 如果稀疏度是通过限制参数量，那么迭代式剪枝器 ( `AutoCompress Pruner` , `SimualatedAnnealing Pruner` ) 比单轮剪枝器表现好。 但是在以 FLOPs 稀疏度为标准的情况下，它们相比于单轮剪枝器就没有了优势，因为当前的这些剪枝算法都是根据参数稀疏度来剪枝的。
+* 在上述实验中，简单的单轮剪枝器 `L1Filter Pruner` , `L2Filter Pruner` , `FPGM Pruner` 表现比较相近。
+* `NetAdapt Pruner` 无法达到比较高的压缩率。 因为它的机制是一次迭代只剪枝一层。 这就导致如果每次迭代剪掉的稀疏度远小于指定的总的稀疏度的话，会导致不可接受的剪枝复杂度。
+
+## 实验复现
+
+### 实现细节
+
+* 实验结果都是在 NNI 中使用剪枝器的默认配置收集的，这意味着当我们在 NNI 中调用一个剪枝器类时，我们不会更改任何默认的类参数。
+
+* FLOPs 和 参数数量均通过 [模型 FLOPs 和参数计数器](https://github.com/microsoft/nni/blob/master/docs/zh_CN/Compressor/CompressionUtils.md#model-flopsparameters-counter)在[模型加速](https://github.com/microsoft/nni/blob/master/docs/zh_CN/Compressor/ModelSpeedup.md)后计算。 这避免了依据掩码模型计算的潜在问题。
+
+* 实验代码在[这里](https://github.com/microsoft/nni/tree/master/examples/model_compress/auto_pruners_torch.py)。
+
+### 实验结果展示
+
+* 如果遵循[示例](https://github.com/microsoft/nni/tree/master/examples/model_compress/auto_pruners_torch.py)的做法，对于每一次剪枝实验，实验结果将以JSON格式保存如下：
+    ``` json
+    {
+        "performance": {"original": 0.9298, "pruned": 0.1, "speedup": 0.1, "finetuned": 0.7746}, 
+        "params": {"original": 14987722.0, "speedup": 167089.0}, 
+        "flops": {"original": 314018314.0, "speedup": 38589922.0}
+    }
+    ```
+
+* 实验结果保存在[这里](https://github.com/microsoft/nni/tree/master/examples/model_compress/comparison_of_pruners)。 可以参考[分析](https://github.com/microsoft/nni/blob/master/examples/model_compress/comparison_of_pruners/analyze.py)来绘制新的性能比较图。
+
+## 贡献
+
+### 待办事项
+
+* 有 FLOPS/延迟 限制的剪枝器
+* 更多剪枝算法/数据集/模型
+
+### 问题
+关于算法实现及实验问题，请[发起 issue](https://github.com/microsoft/nni/issues/new/)。
--- a/docs/zh_CN/CommunitySharings/NNI_colab_support.md
+++ b/docs/zh_CN/CommunitySharings/NNI_colab_support.md
+
+# 在 Google Colab 上使用 NNI
+在 Google Colab 上轻松使用 NNI。 Colab 没有暴露它的公网 IP 及端口，因此默认情况下无法在 Colab 中访问 NNI 的 Web 界面。 为解决此问题，需要使用反向代理软件，例如 `ngrok` 或 `frp`。 此教程将展示如何使用 ngrok 在 Colab 上访问 NNI 的Web 界面。
+
+## 如何在 Google Colab 上打开 NNI 的 Web 界面
+
+1. 安装需要的包和软件。
+
+
+```
+! pip install nni # install nni
+! wget https://bin.equinox.io/c/4VmDzA7iaHb/ngrok-stable-linux-amd64.zip # download ngrok and unzip it
+! unzip ngrok-stable-linux-amd64.zip
+! mkdir -p nni_repo
+! git clone https://github.com/microsoft/nni.git nni_repo/nni # clone NNI's offical repo to get examples
+```
+
+2. 在[此处](https://ngrok.com/)注册 ngrok 账号，然后通过 authtoken 来连接。
+
+
+```
+! ./ngrok authtoken <your-authtoken>
+```
+
+3. 在大于 1024 的端口号上启动 NNI 样例，之后在相同端口上启动 ngrok。 如果希望使用 GPU，确保 config.yml 中 gpuNum >= 1 。 因为使用 `! ngrok http 5000 &` 会停止响应，要使用 </0> get_ipython()</code> 来启动 ngrok。
+
+
+```
+! nnictl create --config nni_repo/nni/examples/trials/mnist-pytorch/config.yml --port 5000 &
+get_ipython().system_raw('./ngrok http 5000 &')
+```
+
+4. 查看公网 url 。
+
+
+```
+! curl -s http://localhost:4040/api/tunnels # don't change the port number 4040
+```
+
+在步骤 4 后将会看到类似 http://xxxx.ngrok.io 的 url，打开此url即可看到 NNI 的Web 界面。 玩得开心 :)
+
+## 使用 frp 访问 Web 界面
+
+frp 是另一款提供了相似功能的反向代理软件。 但 frp 不提供免费的公网 url，因此可能需要一台拥有公网 IP 的服务器来作为 frp 的服务器端。 参考[这里](https://github.com/fatedier/frp)来了解如何部署 frp。
--- a/docs/zh_CN/CommunitySharings/NasComparision.md
+++ b/docs/zh_CN/CommunitySharings/NasComparision.md
 # 神经网络结构搜索的对比
-
 *匿名作者*

 训练和比较 NAS（神经网络架构搜索）的模型，包括 Autokeras，DARTS，ENAS 和 NAO。

 源码链接如下：

- Autokeras: <https://github.com/jhfjhfj1/autokeras>
+- Autokeras: [https://github.com/jhfjhfj1/autokeras](https://github.com/jhfjhfj1/autokeras)

- DARTS: <https://github.com/quark0/darts>
+- DARTS: [https://github.com/quark0/darts](https://github.com/quark0/darts)

- ENAS: <https://github.com/melodyguan/enas>
+- ENAS: [https://github.com/melodyguan/enas](https://github.com/melodyguan/enas)

- NAO: <https://github.com/renqianluo/NAO>
+- NAO: [https://github.com/renqianluo/NAO](https://github.com/renqianluo/NAO)

 ## 实验说明

@@ -27,6 +26,8 @@
 | [ImageNet-10-1](http://www.image-net.org/)                                              | 9,750   | 10    | 咖啡杯、电脑键盘、餐桌、衣柜、割草机、麦克风、秋千、缝纫机、里程表和燃气泵。                      |
 | [ImageNet-10-2](http://www.image-net.org/)                                              | 9,750   | 10    | 鼓，班吉，口哨，三角钢琴，小提琴，管风琴，原声吉他，长号，长笛和萨克斯。                        |

+
+
 没有改变源码中的 Fine-tuning 方法。 为了匹配每个任务，改变了源码中模型的输入图片大小和输出类别数目的部分。

 所有 NAS 方法模型搜索时间和重训练时间都是**两天**。  所有结果都是基于**三次重复实验**。 评估计算机有一块 Nvidia Tesla P100 GPU、112GB 内存和 2.60GHz CPU (Intel E5-2690)。
@@ -54,15 +55,15 @@ NAO 需要太多的计算资源，因此只使用提供 Pipeline 脚本的 NAO-W
 | --------- | ------------ |:----------------:|:----------------:|:--------------:|:-----------:|
 | CIFAR- 10 | 88.56(best)  |   96.13(best)    |   97.11(best)    | 97.17(average) | 96.47(best) |

-AutoKeras，由于其算法中的随机因素，它在所有数据集中的表现相对较差。
+对于 AutoKeras，由于其算法中的随机因素，它在所有数据集中的表现相对较差。

-ENAS，ENAS（macro）在 OUI-Adience-Age 数据集中表现较好，并且 ENAS（micro）在 CIFAR-10 数据集中表现较好。
+对于 ENAS，ENAS（macro）在 OUI-Adience-Age 数据集中表现较好，并且 ENAS（micro）在 CIFAR-10 数据集中表现较好。

-对于DARTS，在某些数据集上具有良好的结果，但在某些数据集中具有比较大的方差。 DARTS 三次实验中的差异在 OUI-Audience-Age 数据集上可达 5.37％（绝对值），在 ImageNet-10-1 数据集上可达4.36％（绝对值）。
+对于 DARTS，在某些数据集上具有良好的结果，但在某些数据集中具有比较大的方差。 DARTS 三次实验中的差异在 OUI-Audience-Age 数据集上可达 5.37％（绝对值），在 ImageNet-10-1 数据集上可达4.36％（绝对值）。

 NAO-WS 在 ImageNet-10-2 中表现良好，但在 OUI-Adience-Age 中表现非常差。

-## 参考文献
+## 参考

 1. Jin, Haifeng, Qingquan Song, and Xia Hu. "Efficient neural architecture search with network morphism." *arXiv preprint arXiv:1806.10282* (2018).


--- a/docs/zh_CN/CommunitySharings/RecommendersSvd.md
+++ b/docs/zh_CN/CommunitySharings/RecommendersSvd.md
-# 在 NNI 上自动调优 SVD
+# 自动调优 SVD（在推荐系统中使用 NNI）

 本教程中，会首先介绍 GitHub 存储库：[Recommenders](https://github.com/Microsoft/Recommenders)。 它使用 Jupyter Notebook 提供了构建推荐系统的一些示例和实践技巧。 其中大量的模型被广泛的应用于推荐系统中。 为了提供完整的体验，每个示例都通过以下五个关键任务中展示：


--- a/docs/zh_CN/CommunitySharings/TuningSystems.md
+++ b/docs/zh_CN/CommunitySharings/TuningSystems.md
-# 使用 NNI 自动调优系统
-
-随着计算机系统和网络变得越来越复杂，通过显式的规则和启发式的方法来手工优化已经越来越难，甚至不可能了。 下面是使用 NNI 来优化系统的两个示例。 可根据这些示例来调优自己的系统。
-
-* [在 NNI 上调优 RocksDB](../TrialExample/RocksdbExamples.md)
-* [使用 NNI 调优 SPTAG (Space Partition Tree And Graph) 参数](SptagAutoTune.md)
-
-参考[论文](https://dl.acm.org/citation.cfm?id=3352031)了解详情：
-
-Mike Liang, Chieh-Jan, et al. "The Case for Learning-and-System Co-design." ACM SIGOPS Operating Systems Review 53.1 (2019): 68-74.
--- a/docs/zh_CN/CommunitySharings/automodel.rst
+++ b/docs/zh_CN/CommunitySharings/automodel.rst
+######################
+自动模型调优
+######################
+
+NNI 可以应用于各种模型调优任务。 一些最先进的模型搜索算法，如EfficientNet，可以很容易地在NNI上构建。 流行的模型，例如，推荐模型，可以使用 NNI 进行调优。 下面是一些用例，展示了如何在您的模型调优任务中使用 NNI，以及如何使用 NNI 构建您自己的流水线。
+
+..  toctree::
+    :maxdepth: 1
+
+    SVD 自动调优 <RecommendersSvd>
+    NNI 中的 EfficientNet <./TrialExample/EfficientNet>
+    用于阅读理解的自动模型架构搜索<../TrialExample/SquadEvolutionExamples>
+    TPE 的并行优化<ParallelizingTpeSearch>
\ No newline at end of file
--- a/docs/zh_CN/CommunitySharings/autosys.rst
+++ b/docs/zh_CN/CommunitySharings/autosys.rst
+#######################
+自动系统调优
+#######################
+
+数据库、张量算子实现等系统的性能往往需要进行调优，以适应特定的硬件配置、目标工作负载等。 手动调优系统非常复杂，并且通常需要对硬件和工作负载有详细的了解。 NNI 可以使这些任务变得更容易，并帮助系统所有者自动找到系统的最佳配置。 自动系统调优的详细设计思想可以在[这篇文章](https://dl.acm.org/doi/10.1145/3352020.3352031)中找到。 以下是 NNI 可以发挥作用的一些典型案例。
+
+..  toctree::
+    :maxdepth: 1
+
+    自动调优 SPTAG（Space Partition Tree And Graph）<SptagAutoTune>
+    调优 RocksDB 的性能<../TrialExample/RocksdbExamples>
+    自动调优张量算子<../TrialExample/OpEvoExamples>
\ No newline at end of file
--- a/docs/zh_CN/CommunitySharings/community_sharings.rst
+++ b/docs/zh_CN/CommunitySharings/community_sharings.rst
-######################
-社区分享
-######################
+#######################
+用例与解决方案
+#######################

-除了官方的教程和示例之外，也支持社区贡献者分享自己的自动机器学习实践经验，特别是使用 NNI 的实践经验。
+与文档其他部分中展示功能用法的教程和示例不同，本部分主要介绍端到端方案和用例，以帮助用户进一步了解NNI如何为他们提供帮助。 NNI 可广泛应用于各种场景。 除了官方的教程和示例之外，也支持社区贡献者分享自己的自动机器学习实践经验，特别是使用 NNI 的实践经验。

 ..  toctree::
    :maxdepth: 2

-    Recommenders 中使用 NNI<RecommendersSvd>
-    使用 NNI 为 SPTAG 自动调参 <SptagAutoTune>
-    神经网络结构搜索（NAS）的对比<NasComparision>
-    超参调优算法的对比<HpoComparision>
-    TPE 的并行优化<ParallelizingTpeSearch>
-    使用 NNI 自动调优系统 <TuningSystems>
-    来自知乎的评论：作者 Garvin Li <NNI_AutoFeatureEng>
+    自动模型调优（HPO/NAS）<automodel>
+    自动系统调优（AutoSys）<autosys>
+    模型压缩<model_compression>
+    特征工程<feature_engineering>
+    性能测量，比较和分析<perf_compare>
+    在 Google Colab 中使用 NNI <NNI_colab_support>
--- a/docs/zh_CN/CommunitySharings/feature_engineering.rst
+++ b/docs/zh_CN/CommunitySharings/feature_engineering.rst
+###################
+特征工程
+###################
+
+以下是关于 NNI 如何助力特征工程的文章，由社区贡献者分享。 将来会添加更多用例和解决方案。
+
+..  toctree::
+    :maxdepth: 1
+
+    来自知乎的评论：作者 Garvin Li <NNI_AutoFeatureEng>
\ No newline at end of file
--- a/docs/zh_CN/CommunitySharings/model_compression.rst
+++ b/docs/zh_CN/CommunitySharings/model_compression.rst
+#################
+模型压缩
+#################
+
+以下介绍了如何将知识蒸馏应用于 NNI 模型压缩。 将来会添加更多用例和解决方案。
+
+..  toctree::
+    :maxdepth: 1
+
+    使用 NNI 模型压缩进行知识蒸馏<../TrialExample/KDExample>
\ No newline at end of file
--- a/docs/zh_CN/CommunitySharings/perf_compare.rst
+++ b/docs/zh_CN/CommunitySharings/perf_compare.rst
+################################################
+性能测量，比较和分析
+################################################
+
+性能比较和分析可以帮助用户在他们的场景中选择适合的算法（例如 Tuner，NAS 算法）。 以下是一些供用户参考的测量和比较数据。
+
+..  toctree::
+    :maxdepth: 1
+
+    神经网络结构搜索（NAS）的对比<NasComparison>
+    超参调优算法的对比<HpoComparison>
+    模型压缩算法的对比<ModelCompressionComparison>
\ No newline at end of file
--- a/docs/zh_CN/Compressor/AutoCompression.md
+++ b/docs/zh_CN/Compressor/AutoCompression.md
@@ -84,7 +84,7 @@ config_list_agp = [{'initial_sparsity': 0, 'final_sparsity': conv0_sparsity,
                   {'initial_sparsity': 0, 'final_sparsity': conv1_sparsity,
                    'start_epoch': 0, 'end_epoch': 3,
                    'frequency': 1,'op_name': 'conv1' },]
-PRUNERS = {'level':LevelPruner(model, config_list_level), 'agp':AGP_Pruner(model, config_list_agp)}
+PRUNERS = {'level':LevelPruner(model, config_list_level), 'agp':AGPPruner(model, config_list_agp)}
 pruner = PRUNERS(params['prune_method']['_name'])
 pruner.compress()
 ... # fine tuning

--- a/docs/zh_CN/Compressor/CompressionUtils.md
+++ b/docs/zh_CN/Compressor/CompressionUtils.md
@@ -127,5 +127,6 @@ from nni.compression.torch.utils.counter import count_flops_params

 # 给定输入大小 (1, 1, 28, 28) 
 flops, params = count_flops_params(model, (1, 1, 28, 28))
+# 将输出大小格式化为 M (例如, 10^6)
 print(f'FLOPs: {flops/1e6:.3f}M,  Params: {params/1e6:.3f}M)
 ```
\ No newline at end of file
--- a/docs/zh_CN/Compressor/LotteryTicketHypothesis.md
+++ b/docs/zh_CN/Compressor/LotteryTicketHypothesis.md
-Lottery Ticket 假设
-===
-
-## 介绍
-
-[The Lottery Ticket Hypothesis: Finding Sparse, Trainable Neural Networks](https://arxiv.org/abs/1803.03635) 是主要衡量和分析的论文，它提供了非常有意思的见解。 为了在 NNI 上支持此算法，主要实现了找到*获奖彩票*的训练方法。
-
-本文中，作者使用叫做*迭代*修剪的方法：
-> 1. 随机初始化一个神经网络 f(x;theta_0) (其中 theta_0 为 D_{theta}).
-> 2. 将网络训练 j 次，得出参数 theta_j。
-> 3. 在 theta_j 修剪参数的 p%，创建掩码 m。
-> 4. 将其余参数重置为 theta_0 的值，创建获胜彩票 f(x;m*theta_0)。
-> 5. 重复步骤 2、3 和 4。
-
-如果配置的最终稀疏度为 P (e.g., 0.8) 并且有 n 次修建迭代，每次迭代修剪前一轮中剩余权重的 1-(1-P)^(1/n)。
-
-## 重现结果
-
-在重现时，在 MNIST 使用了与论文相同的配置。 [此处](https://github.com/microsoft/nni/tree/master/examples/model_compress/lottery_torch_mnist_fc.py)为实现代码。 在次实验中，修剪了10次，在每次修剪后，训练了 50 个 epoch。
-
-![](../../img/lottery_ticket_mnist_fc.png)
-
-上图展示了全连接网络的结果。 `round0-sparsity-0.0` 是没有剪枝的性能。 与论文一致，修剪约 80% 也能获得与不修剪时相似的性能，收敛速度也会更快。 如果修剪过多（例如，大于 94%），则精度会降低，收敛速度会稍慢。 与本文稍有不同，论文中数据的趋势比较明显。
--- a/docs/zh_CN/Compressor/Overview.md
+++ b/docs/zh_CN/Compressor/Overview.md
@@ -42,6 +42,7 @@ NNI 的模型压缩工具包，提供了最先进的模型压缩算法和策略
 | [SimulatedAnnealing Pruner](https://nni.readthedocs.io/zh/latest/Compressor/Pruner.html#simulatedannealing-pruner)           | 通过启发式的模拟退火算法进行自动剪枝 [参考论文](https://arxiv.org/abs/1907.03141)                                                                                   |
 | [AutoCompress Pruner](https://nni.readthedocs.io/zh/latest/Compressor/Pruner.html#autocompress-pruner)                       | 通过迭代调用 SimulatedAnnealing Pruner 和 ADMM Pruner 进行自动剪枝 [参考论文](https://arxiv.org/abs/1907.03141)                                                |

+参考此[基准测试](https://github.com/microsoft/nni/blob/master/docs/zh_CN/CommunitySharings/ModelCompressionComparison.md)来查看这些剪枝器在一些基准问题上的表现。

 ### 量化算法

@@ -64,7 +65,7 @@ NNI 的模型压缩工具包，提供了最先进的模型压缩算法和策略

 ## 压缩工具

-压缩工具包括了一些有用的工具，能帮助用户理解并分析要压缩的模型。 例如，可检查每层对剪枝的敏感度。 可很容易的计算模型的 FLOPs 和参数数量。 [点击这里](./CompressionUtils.md)，查看压缩工具的完整列表。
+压缩工具包括了一些有用的工具，能帮助用户理解并分析要压缩的模型。 例如，可检查每层对剪枝的敏感度。 可很容易地计算模型的 FLOPs 和参数数量。 点击[这里](./CompressionUtils.md)，查看压缩工具的完整列表。

 ## 自定义压缩算法


--- a/docs/zh_CN/Compressor/Pruner.md
+++ b/docs/zh_CN/Compressor/Pruner.md