"git@developer.sourcefind.cn:chenpangpang/transformers.git" did not exist on "f5c2a122e34836b87abb6042cf641b040e790e1c"
Ignore non-causal mask in more cases with SDPA (#30138)
* update non-causal mask for sdpa * add test * update docstrings * add one more test * fix cross attention bug * gentler atol/rtol
Showing
Please register or sign in to comment