1. 02 Aug, 2022 1 commit
    • Piotr Dabkowski's avatar
      Fix uninitialized parameter in conformer relative attention. (#18368) · 68a894a5
      Piotr Dabkowski authored
      `torch.Tensor` creates an unitialized tensor (as via `torch.empty`), this leads to undeterministic behavior, poor initialization, and nans if you have unlucky init. The paper does not specify the initialization for bias terms, so I guess zero seems like a good choice - no bias initially. `torch.Tensor` is usually populated with zeros, so this fix will be close to the intended behavior:
      
      ```
      >>> torch.Tensor(100, 100).sum()
      tensor(0.)
      >>> torch.Tensor(100, 100).sum()
      tensor(nan)
      >>> torch.Tensor(100, 100).sum()
      tensor(0.)
      ```
      68a894a5
  2. 01 Aug, 2022 19 commits
  3. 30 Jul, 2022 1 commit
  4. 29 Jul, 2022 6 commits
  5. 28 Jul, 2022 10 commits
  6. 27 Jul, 2022 3 commits