[Llama4] Enable attention temperature tuning by default for long context (>32k) (#16439)
Signed-off-by:Ye (Charlotte) Qi <yeq@meta.com> Co-authored-by:
Ye (Charlotte) Qi <yeq@meta.com>
Showing
Please register or sign in to comment
Signed-off-by:Ye (Charlotte) Qi <yeq@meta.com> Co-authored-by:
Ye (Charlotte) Qi <yeq@meta.com>