Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Commits
f136da15e154b25c7eb3221772a85a15811f0318
Switch branch/tag
vllm_cscc
vllm
worker
tpu_worker.py
28 Jun, 2024
1 commit
[Hardware][TPU] Optimize KV cache swapping (#5878)
· f136da15
Woosuk Kwon
authored
Jun 27, 2024
f136da15
26 Jun, 2024
3 commits
[Bugfix][TPU] Fix CPU cache allocation (#5869)
· f5c8628f
Woosuk Kwon
authored
Jun 26, 2024
f5c8628f
[Hardware][TPU] Support parallel sampling & Swapping (#5855)
· cbc53b6b
Woosuk Kwon
authored
Jun 26, 2024
cbc53b6b
[Bugfix][TPU] Fix KV cache size calculation (#5860)
· 3439c5a8
Woosuk Kwon
authored
Jun 26, 2024
3439c5a8
25 Jun, 2024
1 commit
[Hardware][TPU] Refactor TPU backend (#5831)
· bc34937d
Woosuk Kwon
authored
Jun 25, 2024
bc34937d
12 Jun, 2024
1 commit
[Hardware] Initial TPU integration (#5292)
· 1a8bfd92
Woosuk Kwon
authored
Jun 12, 2024
1a8bfd92