Remove the nonexistent parameter from fused attention documentation (#181)

* Remove the nonexistent parameter from fused attention documentation Signed-off-by: Przemek Tredak <ptredak@nvidia.com> * Remove the second instance Signed-off-by: Przemek Tredak <ptredak@nvidia.com> --------- Signed-off-by: Przemek Tredak <ptredak@nvidia.com>

Remove the nonexistent parameter from fused attention documentation (#181)
* Remove the nonexistent parameter from fused attention documentation Signed-off-by: Przemek Tredak <ptredak@nvidia.com> * Remove the second instance Signed-off-by: Przemek Tredak <ptredak@nvidia.com> --------- Signed-off-by: Przemek Tredak <ptredak@nvidia.com>
1a868ff3 · Przemyslaw Tredak · GitHub · e5a69d92 · 1a868ff3
Unverified Commit 1a868ff3 authored Apr 27, 2023 by Przemyslaw Tredak Committed by GitHub Apr 27, 2023
Hide whitespace changes
Inline Side-by-side

Showing with 0 additions and 2 deletions

transformer_engine/common/include/transformer_engine/fused_attn.h ...mer_engine/common/include/transformer_engine/fused_attn.h +0 -2

No files found.
--- a/transformer_engine/common/include/transformer_engine/fused_attn.h
+++ b/transformer_engine/common/include/transformer_engine/fused_attn.h
@@ -133,7 +133,6 @@ void nvte_fused_attn_fwd_qkvpacked(
 *  \param[in]     Aux_CTX_Tensors       Auxiliary tensors from forward when in training mode.
 *  \param[out]    dQKV                  The gradient of the QKV tensor.
 *  \param[in]     cu_seqlens            Accumulative sequence lengths, [batch_size + 1].
- *  \param[in]     rng_state             Seed and offset of CUDA random number generator.
 *  \param[in]     max_seqlen            Max sequence length used for computing,
 *                                       it may be >= max(cu_seqlens). 
 *  \param[in]     attn_scale            Scaling factor for Q * K.T.
@@ -222,7 +221,6 @@ void nvte_fused_attn_fwd_kvpacked(
 *  \param[out]    dKV                   The gradient of the KV tensor.
 *  \param[in]     cu_seqlens_q          Accumulative sequence lengths for Q, [batch_size + 1].
 *  \param[in]     cu_seqlens_kv         Accumulative sequence lengths for KV, [batch_size + 1].
- *  \param[in]     rng_state             Seed and offset of CUDA random number generator.
 *  \param[in]     max_seqlen_q          Max sequence length used for computing for Q.  
 *                                       it may be >= max(cu_seqlens_q). 
 *  \param[in]     max_seqlen_kv         Max sequence length used for computing for KV.