Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Commits
b1eb4ca1
Unverified
Commit
b1eb4ca1
authored
Apr 08, 2025
by
Chengji Yao
Committed by
GitHub
Apr 09, 2025
Browse files
[TPU] Update PyTorch/XLA (#16288)
Signed-off-by:
Chengji Yao
<
chengjiyao@google.com
>
parent
87b4ac56
Changes
2
Hide whitespace changes
Inline
Side-by-side
Showing
2 changed files
with
10 additions
and
14 deletions
+10
-14
requirements/tpu.txt
requirements/tpu.txt
+6
-6
tests/v1/tpu/test_pallas.py
tests/v1/tpu/test_pallas.py
+4
-8
No files found.
requirements/tpu.txt
View file @
b1eb4ca1
...
...
@@ -17,10 +17,10 @@ ray[data]
--find-links https://storage.googleapis.com/libtpu-releases/index.html
--find-links https://storage.googleapis.com/jax-releases/jax_nightly_releases.html
--find-links https://storage.googleapis.com/jax-releases/jaxlib_nightly_releases.html
torch @ https://storage.googleapis.com/pytorch-xla-releases/wheels/tpuvm/torch-2.8.0.dev2025040
6
-cp39-cp39-linux_x86_64.whl ; python_version == "3.9"
torch @ https://storage.googleapis.com/pytorch-xla-releases/wheels/tpuvm/torch-2.8.0.dev2025040
6
-cp310-cp310-linux_x86_64.whl ; python_version == "3.10"
torch @ https://storage.googleapis.com/pytorch-xla-releases/wheels/tpuvm/torch-2.8.0.dev2025040
6
-cp311-cp311-linux_x86_64.whl ; python_version == "3.11"
torch_xla[tpu, pallas] @ https://storage.googleapis.com/pytorch-xla-releases/wheels/tpuvm/torch_xla-2.8.0.dev2025040
6
-cp39-cp39-linux_x86_64.whl ; python_version == "3.9"
torch_xla[tpu, pallas] @ https://storage.googleapis.com/pytorch-xla-releases/wheels/tpuvm/torch_xla-2.8.0.dev2025040
6
-cp310-cp310-linux_x86_64.whl ; python_version == "3.10"
torch_xla[tpu, pallas] @ https://storage.googleapis.com/pytorch-xla-releases/wheels/tpuvm/torch_xla-2.8.0.dev2025040
6
-cp311-cp311-linux_x86_64.whl ; python_version == "3.11"
torch @ https://storage.googleapis.com/pytorch-xla-releases/wheels/tpuvm/torch-2.8.0.dev2025040
8
-cp39-cp39-linux_x86_64.whl ; python_version == "3.9"
torch @ https://storage.googleapis.com/pytorch-xla-releases/wheels/tpuvm/torch-2.8.0.dev2025040
8
-cp310-cp310-linux_x86_64.whl ; python_version == "3.10"
torch @ https://storage.googleapis.com/pytorch-xla-releases/wheels/tpuvm/torch-2.8.0.dev2025040
8
-cp311-cp311-linux_x86_64.whl ; python_version == "3.11"
torch_xla[tpu, pallas] @ https://storage.googleapis.com/pytorch-xla-releases/wheels/tpuvm/torch_xla-2.8.0.dev2025040
8
-cp39-cp39-linux_x86_64.whl ; python_version == "3.9"
torch_xla[tpu, pallas] @ https://storage.googleapis.com/pytorch-xla-releases/wheels/tpuvm/torch_xla-2.8.0.dev2025040
8
-cp310-cp310-linux_x86_64.whl ; python_version == "3.10"
torch_xla[tpu, pallas] @ https://storage.googleapis.com/pytorch-xla-releases/wheels/tpuvm/torch_xla-2.8.0.dev2025040
8
-cp311-cp311-linux_x86_64.whl ; python_version == "3.11"
tests/v1/tpu/test_pallas.py
View file @
b1eb4ca1
...
...
@@ -4,9 +4,7 @@ from unittest.mock import ANY, patch
import
torch
from
vllm.attention.backends.abstract
import
AttentionType
from
vllm.v1.attention.backends.pallas
import
(
NUM_KV_PAGES_PER_BLOCK
,
NUM_QUERIES_PER_BLOCK
,
PallasAttentionBackendImpl
,
from
vllm.v1.attention.backends.pallas
import
(
PallasAttentionBackendImpl
,
PallasMetadata
)
...
...
@@ -32,8 +30,6 @@ def test_ragged_paged_attention():
logits_soft_cap
=
logits_soft_cap
,
attn_type
=
AttentionType
.
DECODER
,
)
mock_vmem_limit_bytes
=
1024
attn_impl
.
vmem_limit_bytes
=
mock_vmem_limit_bytes
class
FakeAttentionLayer
:
_k_scale_float
:
float
...
...
@@ -88,9 +84,9 @@ def test_ragged_paged_attention():
ANY
,
# block_tables
ANY
,
# query_start_loc
ANY
,
# num_seqs
num_kv_pages_per_block
=
N
UM_KV_PAGES_PER_BLOCK
,
num_queries_per_block
=
N
UM_QUERIES_PER_BLOCK
,
vmem_limit_bytes
=
mock_vmem_limit_bytes
,
num_kv_pages_per_block
=
N
one
,
num_queries_per_block
=
N
one
,
vmem_limit_bytes
=
None
,
use_kernel
=
True
,
sm_scale
=
scale
,
sliding_window
=
sliding_window
,
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment