Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
kecinstone
2024pra-vllm
Commits
3b7178cfa4a317922d4aef9dd3b2647b8d950e7d
Switch branch/tag
2024-pra-vllm
vllm
model_executor
sampling_metadata.py
28 Feb, 2024
1 commit
[Neuron] Support inference with transformers-neuronx (#2569)
· 3b7178cf
Liangfu Chen
authored
Feb 28, 2024
3b7178cf
21 Feb, 2024
1 commit
Support per-request seed (#2514)
· 7d2dcce1
Nick Hill
authored
Feb 21, 2024
7d2dcce1
03 Jan, 2024
1 commit
Use NCCL instead of ray for control-plane communication to remove serialization overhead (#2221)
· fd4ea8ef
Zhuohan Li
authored
Jan 04, 2024
fd4ea8ef
17 Dec, 2023
1 commit
Make sampler less blocking (#1889)
· a7347d9a
Antoni Baum
authored
Dec 17, 2023
a7347d9a
30 Nov, 2023
1 commit
Refactor Worker & InputMetadata (#1843)
· 27feead2
Woosuk Kwon
authored
Nov 29, 2023
27feead2