Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Repository
4f0f844b1675419fd2171bc5e981a82386ec552b
Switch branch/tag
vllm_cscc
vllm
model_executor
models
llama4.py
Find file
Blame
History
Permalink
Fix cuda illegal mem access with Llama4 TP8 + rms_norm custom op (#22701)
· 4f0f844b
Po-Han Huang (NVIDIA)
authored
Aug 13, 2025
Signed-off-by:
Po-Han Huang
<
pohanh@nvidia.com
>
4f0f844b
llama4.py
30.8 KB
Edit
Web IDE
Replace llama4.py
×
Attach a file by drag & drop or
click to upload
Commit message
Replace llama4.py
Replace file
Cancel
A new branch will be created in your fork and a new merge request will be started.