-
Tao He authored
[bugifx] QWen-1M context support[2/3] using current cuda stream in the DCA's kernel for bugfix. (#8611) Signed-off-by:
Tao He <linzhu.ht@alibaba-inc.com> Co-authored-by:
sa-buc <linzhu.ht@w32d09270.cloud.sqa.na131>
5d15fb8c
[bugifx] QWen-1M context support[2/3] using current cuda stream in the DCA's kernel for bugfix. (#8611) Signed-off-by:Tao He <linzhu.ht@alibaba-inc.com> Co-authored-by:
sa-buc <linzhu.ht@w32d09270.cloud.sqa.na131>