[torch.compile] Don't do the fast moe cold start optimization if there is...
[torch.compile] Don't do the fast moe cold start optimization if there is speculative decoding (#33624) Signed-off-by:Richard Zou <zou3519@gmail.com> Co-authored-by:
Robert Shaw <114415538+robertgshaw2-redhat@users.noreply.github.com> (cherry picked from commit 5eac9a1b)
Showing
Please register or sign in to comment