- 01 Aug, 2024 3 commits
- 31 Jul, 2024 1 commit
-
-
zhuwenwen authored
-
- 29 Jul, 2024 1 commit
-
-
zhangshao authored
-
- 25 Jul, 2024 6 commits
- 24 Jul, 2024 6 commits
-
-
zhuwenwen authored
-
zhuwenwen authored
# Conflicts: # vllm/model_executor/layers/linear.py # vllm/model_executor/models/baichuan.py
-
zhuwenwen authored
-
zhuwenwen authored
-
zhuwenwen authored
-
zhuwenwen authored
# Conflicts: # csrc/attention/attention_kernels.cu # csrc/attention/attention_utils.cuh # csrc/layernorm_kernels.cu # vllm/model_executor/layers/linear.py # vllm/model_executor/models/baichuan.py # vllm/model_executor/models/llama.py
-
- 23 Jul, 2024 20 commits
-
-
Simon Mo authored
-
Woosuk Kwon authored
-
Woosuk Kwon authored
-
youkaichao authored
-
Woosuk Kwon authored
-
zhuwenwen authored
-
Simon Mo authored
-
Simon Mo authored
-
Roger Wang authored
-
youkaichao authored
-
Cyrus Leung authored
-
youkaichao authored
[doc][distributed] add more doc for setting up multi-node environment (#6529)
-
Michael Goin authored
-
zhaotyer authored
Co-authored-by:
tianyi.zhao <tianyi.zhao@transwarp.io> Co-authored-by:
youkaichao <youkaichao@126.com>
-
youkaichao authored
-
Woosuk Kwon authored
-
Cheng Li authored
-
zhuwenwen authored
-
Cody Yu authored
-
Woosuk Kwon authored
-
- 22 Jul, 2024 3 commits
-
-
youkaichao authored
-
Jiaxin Shan authored
Co-authored-by:Antoni Baum <antoni.baum@protonmail.com>
-
Kevin H. Luu authored
Signed-off-by:kevin <kevin@anyscale.com>
-