[Tutorial] Tutorial fix (SSE link & Transformer) (#285)

* upd tutorial * upd tutorial

[Tutorial] Tutorial fix (SSE link & Transformer) (#285)
* upd tutorial * upd tutorial
8bc01c63 · Zihao Ye · Minjie Wang · 322bd713 · 8bc01c63 · 8bc01c63
Commit 8bc01c63 authored Dec 11, 2018 by Zihao Ye Committed by Minjie Wang Dec 10, 2018
Hide whitespace changes
Inline Side-by-side

Showing with 10 additions and 5 deletions

tutorials/models/1_gnn/README.txt tutorials/models/1_gnn/README.txt +2 -2

tutorials/models/4_old_wines/7_transformer.py tutorials/models/4_old_wines/7_transformer.py +8 -3

No files found.
--- a/tutorials/models/1_gnn/README.txt
+++ b/tutorials/models/1_gnn/README.txt
@@ -32,8 +32,8 @@ Graph Neural Network and its variant
  operation, sparse-matrix tensor operations, along with message-passing with
  DGL.
-* **SSE** `[paper] <http://proceedings.mlr.press/v80/dai18a/dai18a.pdf>`__
+* **SSE** `[paper] <http://proceedings.mlr.press/v80/dai18a/dai18a.pdf>`__ `[tutorial]
-  `[tutorial <1_gnn/8_sse_mx.html>]` `[code]
+  <1_gnn/8_sse_mx.html>`__ `[code]
  <https://github.com/jermainewang/dgl/blob/master/examples/mxnet/sse>`__:
  the emphasize here is *giant* graph that cannot fit comfortably on one GPU
  card. SSE is an example to illustrate the co-design of both algorithm and

--- a/tutorials/models/4_old_wines/7_transformer.py
+++ b/tutorials/models/4_old_wines/7_transformer.py
@@ -44,7 +44,7 @@ Transformer Tutorial
 #    v_i = W_v\cdot x_i\\
 #    \textrm{score} = q_j^T k_i
 #
-# where :math:`W_q, W_k, W_v \in \mathbb{R}^{n\times n}` map the
+# where :math:`W_q, W_k, W_v \in \mathbb{R}^{n\times d_k}` map the
 # representations :math:`x` to “query”, “key”, and “value” space
 # respectively.
 #
@@ -472,8 +472,8 @@ Transformer Tutorial
 #
 #    By calling ``update_graph`` function, we can “DIY our own
 #    Transformer” on any subgraphs with nearly the same code. This
-#    flexibility enables us to discover new, sparse structures (c.f.
+#    flexibility enables us to discover new, sparse structures (c.f. local attention
-#    `here <https://arxiv.org/pdf/1508.04025.pdf>`__). Note in our
+#    mentioned `here <https://arxiv.org/pdf/1508.04025.pdf>`__). Note in our
 #    implementation we does not use mask or padding, which makes the logic
 #    more clear and saves memory. The trade-off is that the implementation is
 #    slower; we will improve with future DGL optimizations.
@@ -874,3 +874,8 @@ Transformer Tutorial
 # .. |image9| image:: https://s1.ax1x.com/2018/12/06/F1sGod.png
 # .. |image10| image:: https://s1.ax1x.com/2018/12/06/F1r8Cq.gif
 #
+# .. note::
+#     We apologize that this notebook itself is not runnable due to many dependencies,
+#     please download the `7_transformer.py <https://s3.us-east-2.amazonaws.com/dgl.ai/tutorial/7_transformer.py>`__, 
+#     and copy the python script to directory ``examples/pytorch/transformer`` 
+#     then run ``python 7_transformer.py`` to see how it works.