You need to sign in or sign up before continuing.
[CPU] support the case where num_attention_heads or intermediate_size is not...
[CPU] support the case where num_attention_heads or intermediate_size is not divisible by the TP size (#6771)
Showing
Please register or sign in to comment