1. 08 Sep, 2023 2 commits
    • Thien Tran's avatar
      Update missing docs on `activation_dropout` and fix DropOut docs for SEW-D (#26031) · 18ee1fe7
      Thien Tran authored
      * add missing doc for activation dropout
      
      * fix doc for SEW-D dropout
      
      * deprecate hidden_dropout for SEW-D
      18ee1fe7
    • Alexander Krauck's avatar
      Fix Dropout Implementation in Graphormer (#24817) · 0c67a72c
      Alexander Krauck authored
      This commit corrects the dropout implementation in Graphormer, aligning it with the original implementation and improving performance. Specifically:
      
      1. The `attention_dropout` variable, intended for use in GraphormerMultiheadAttention, was defined but not used. This has been corrected to use `attention_dropout` instead of the regular `dropout`.
      2. The `activation_dropout` for the activations in the feed-forward layers was missing. Instead, the regular `dropout` was used. This commit adds `activation_dropout` to the feed-forward layers.
      
      These changes ensure the dropout implementation matches the original Graphormer and delivers empirically better performance.
      0c67a72c
  2. 07 Sep, 2023 9 commits
  3. 06 Sep, 2023 7 commits
  4. 05 Sep, 2023 22 commits