- 24 Jul, 2024 5 commits
-
-
zhuwenwen authored
# Conflicts: # vllm/model_executor/layers/linear.py # vllm/model_executor/models/baichuan.py
-
zhuwenwen authored
-
zhuwenwen authored
-
zhuwenwen authored
-
zhuwenwen authored
# Conflicts: # csrc/attention/attention_kernels.cu # csrc/attention/attention_utils.cuh # csrc/layernorm_kernels.cu # vllm/model_executor/layers/linear.py # vllm/model_executor/models/baichuan.py # vllm/model_executor/models/llama.py
-
- 23 Jul, 2024 2 commits
- 22 Jul, 2024 4 commits
- 20 Jul, 2024 7 commits
- 18 Jul, 2024 2 commits
- 17 Jul, 2024 3 commits
- 16 Jul, 2024 3 commits
- 15 Jul, 2024 14 commits
-
-
youkaichao authored
-
Simon Mo authored
-
Thomas Parnell authored
Signed-off-by:Thomas Parnell <tpa@zurich.ibm.com>
-
Thomas Parnell authored
Signed-off-by:
Thomas Parnell <tpa@zurich.ibm.com> Co-authored-by:
Chih-Chieh-Yang <chih.chieh.yang@ibm.com>
-
Pernekhan Utemuratov authored
Co-authored-by:Pernekhan Utemuratov <pernekhan@deepinfra.com>
-
Tyler Michael Smith authored
-
youkaichao authored
-
Roger Wang authored
-
youkaichao authored
-
Cyrus Leung authored
-
youkaichao authored
-
DefTruth authored
-
zifeitong authored
Co-authored-by:Cyrus Leung <cyrus.tl.leung@gmail.com>
-
Simon Mo authored
-