"src/include/blockwise_2d_tensor_op.hpp" did not exist on "39775d484c4d15a5b895edfc9d2323f05ab2d3d4"
[Feature] Support Llama-2 with GQA (#147)
* add GQA for llama2 * fix model conversion * fix lint & remove dev log * update news * minor * fix allocation size * fix split_dim for w_qkv.bias
Showing
Please register or sign in to comment