Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
kecinstone
2024pra-vllm
Repository
84e4e37d1427bd254bea6ff366026773d36f3982
Switch branch/tag
2024-pra-vllm
vllm
model_executor
models
llama.py
Find file
Blame
History
Permalink
support sharding llama2-70b on more than 8 GPUs (#1209)
· a60b3530
Zhuohan Li
authored
Oct 02, 2023
Co-authored-by:
JiCheng
<
247153481@qq.com
>
a60b3530
llama.py
16.7 KB
Edit
Web IDE
Replace llama.py
×
Attach a file by drag & drop or
click to upload
Commit message
Replace llama.py
Replace file
Cancel
A new branch will be created in your fork and a new merge request will be started.