Support precomputed_embeddings for Llama 4 (#8156)
Signed-off-by:Xinyuan Tong <xinyuantong.cs@gmail.com> Co-authored-by:
Xiang (Kevin) Li <lik@nvidia.com> Co-authored-by:
Xinyuan Tong <115166877+JustinTong0323@users.noreply.github.com> Co-authored-by:
Xinyuan Tong <xinyuantong.cs@gmail.com>
Showing
This source diff could not be displayed because it is too large. You can view the blob instead.
Please register or sign in to comment