Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
norm
vllm
Commits
ee88a7e5f3acc1f81c52dfc45d2bdd542b4cd9ed
Switch branch/tag
vllm
benchmark
benchmark_latency.py
09 Apr, 2023
1 commit
Add an option to use dummy model weights (#33)
· ee88a7e5
Woosuk Kwon
authored
Apr 08, 2023
ee88a7e5
08 Apr, 2023
1 commit
Implement block copy kernel to optimize beam search (#32)
· 0f40557a
Woosuk Kwon
authored
Apr 07, 2023
0f40557a
05 Apr, 2023
1 commit
Add CUDA graph-based all reduce launcher (#26)
· 12659a0b
Woosuk Kwon
authored
Apr 05, 2023
12659a0b
31 Mar, 2023
1 commit
Optimize tensor parallel execution speed (#17)
· c45f3c3a
Zhuohan Li
authored
Apr 01, 2023
c45f3c3a
29 Mar, 2023
1 commit
FastAPI-based working frontend (#10)
· 721fa3df
Zhuohan Li
authored
Mar 29, 2023
721fa3df