[Models][GDN] Remove GPU/CPU syncs in `GDNAttentionMetadata.build` during...
[Models][GDN] Remove GPU/CPU syncs in `GDNAttentionMetadata.build` during speculative decoding (#38047)
Signed-off-by:
Lukas Geiger <lukas.geiger94@gmail.com>
Showing
Please register or sign in to comment