Support precomputed_embeddings for Llama 4 (#8156)
Signed-off-by:Xinyuan Tong <xinyuantong.cs@gmail.com> Co-authored-by:
Xiang (Kevin) Li <lik@nvidia.com> Co-authored-by:
Xinyuan Tong <115166877+JustinTong0323@users.noreply.github.com> Co-authored-by:
Xinyuan Tong <xinyuantong.cs@gmail.com>
Showing
This diff is collapsed.
Please register or sign in to comment