[CPU] support the case where num_attention_heads or intermediate_size is not...
[CPU] support the case where num_attention_heads or intermediate_size is not divisible by the TP size (#6771)
Showing
Please register or sign in to comment
[CPU] support the case where num_attention_heads or intermediate_size is not divisible by the TP size (#6771)