fix incorrect docs about what gradient is computed

269a3ed1 · Davis King · 92106718 · 269a3ed1
Commit 269a3ed1 authored Apr 13, 2021 by Davis King
Show whitespace changes
Inline Side-by-side

Showing with 9 additions and 8 deletions

dlib/cuda/tensor_tools.h dlib/cuda/tensor_tools.h +9 -8

No files found.
--- a/dlib/cuda/tensor_tools.h
+++ b/dlib/cuda/tensor_tools.h
@@ -1639,18 +1639,19 @@ namespace dlib { namespace tt
    void gelu_gradient (
        tensor& grad,
-        const tensor& dest,
+        const tensor& src,
        const tensor& gradient_input
    );
    /*!
        requires
-            - have_same_dimensions(dest,gradient_input) == true
+            - have_same_dimensions(src,gradient_input) == true
-            - have_same_dimensions(dest,grad) == true
+            - have_same_dimensions(src,grad) == true
        ensures
-            - This function computes the gradient of f() with respect to SRC and stores
+            - Recalling that dest is the output of gelu(dest,src), let f(src) ==
-              it to grad.  Moreover, if is_same_object(grad,gradient_input)==true then
+              dot(gradient_input,dest). Then this function computes the gradient of f() with respect
-              the output is assigned to grad, replacing its previous contents.
+              to src and stores it to grad.  Moreover, if is_same_object(grad,gradient_input)==true
-              Otherwise the output is added to grad.
+              then the output is assigned to grad, replacing its previous contents.  Otherwise the
+              output is added to grad.
            - This function supports in-place operation, i.e. having
              is_same_object(grad, gradient_input)==true
    !*/