attention_test:ö T
input weights bias
mask_indexresultAttention_0" Attention* num_heads attention_testZ
input € €Z weights € €Z bias €Z mask_index €b result € €B