Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Commits
c3dde367f16111b8968948a1f8e1a26bdac6ffdd
Switch branch/tag
vllm_cscc
vllm
worker
tpu_worker.py
26 Jun, 2024
3 commits
[Bugfix][TPU] Fix CPU cache allocation (#5869)
· f5c8628f
Woosuk Kwon
authored
Jun 26, 2024
f5c8628f
[Hardware][TPU] Support parallel sampling & Swapping (#5855)
· cbc53b6b
Woosuk Kwon
authored
Jun 26, 2024
cbc53b6b
[Bugfix][TPU] Fix KV cache size calculation (#5860)
· 3439c5a8
Woosuk Kwon
authored
Jun 26, 2024
3439c5a8
25 Jun, 2024
1 commit
[Hardware][TPU] Refactor TPU backend (#5831)
· bc34937d
Woosuk Kwon
authored
Jun 25, 2024
bc34937d
12 Jun, 2024
1 commit
[Hardware] Initial TPU integration (#5292)
· 1a8bfd92
Woosuk Kwon
authored
Jun 12, 2024
1a8bfd92