[Models][Qwen3 ViT] Keep `max_seqlen` on CPU to prevent D2H sync (#37139)
Signed-off-by:Lukas Geiger <lukas.geiger94@gmail.com> Co-authored-by:
Isotr0py <mozf@mail2.sysu.edu.cn>
Showing
Please register or sign in to comment
Signed-off-by:Lukas Geiger <lukas.geiger94@gmail.com> Co-authored-by:
Isotr0py <mozf@mail2.sysu.edu.cn>