Unverified Commit 9e74d9d0 authored by Charles Riggins's avatar Charles Riggins Committed by GitHub
Browse files

Correct alignment in the seq_len diagram. (#5592)


Co-authored-by: default avatarLiqian Chen <liqian.chen@deeplang.ai>
parent 9333fb8e
...@@ -83,7 +83,7 @@ class FlashAttentionMetadata(AttentionMetadata): ...@@ -83,7 +83,7 @@ class FlashAttentionMetadata(AttentionMetadata):
# |---------------- N iteration ---------------------| # |---------------- N iteration ---------------------|
# |- tokenA -|......................|-- newTokens ---| # |- tokenA -|......................|-- newTokens ---|
# |---------- context_len ----------| # |---------- context_len ----------|
# |-------------------- seq_len ----------------------| # |-------------------- seq_len ---------------------|
# |-- query_len ---| # |-- query_len ---|
# Maximum query length in the batch. None for decoding. # Maximum query length in the batch. None for decoding.
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment