Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Commits
96d4afdc
Commit
96d4afdc
authored
Oct 31, 2025
by
zhuwenwen
Browse files
update moe configs name
parent
2b90ecd3
Changes
61
Show whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
3 additions
and
3 deletions
+3
-3
vllm/utils/__init__.py
vllm/utils/__init__.py
+3
-3
No files found.
vllm/utils/__init__.py
View file @
96d4afdc
...
@@ -84,9 +84,9 @@ DEFAULT_MAX_NUM_BATCHED_TOKENS = 2048
...
@@ -84,9 +84,9 @@ DEFAULT_MAX_NUM_BATCHED_TOKENS = 2048
POOLING_MODEL_MAX_NUM_BATCHED_TOKENS
=
32768
POOLING_MODEL_MAX_NUM_BATCHED_TOKENS
=
32768
MULTIMODAL_MODEL_MAX_NUM_BATCHED_TOKENS
=
5120
MULTIMODAL_MODEL_MAX_NUM_BATCHED_TOKENS
=
5120
gpuname
=
torch
.
cuda
.
get_device_properties
(
torch
.
cuda
.
current_device
()).
n
ame
GPU_ARCH
=
torch
.
cuda
.
get_device_properties
(
"cuda"
).
gcnArchN
ame
is_kme
=
gpuname
.
startswith
(
'K100_AI'
)
or
gpuname
.
startswith
(
'K500SM_AI'
)
is_kme
=
any
(
arch
in
GPU_ARCH
for
arch
in
[
"gfx928"
]
)
SUPPORT_TC
=
gpuname
.
startswith
(
'K100_AI'
)
or
gpuname
.
startswith
(
'K500SM_AI'
)
or
gpuname
.
startswith
(
'BW'
)
SUPPORT_TC
=
any
(
arch
in
GPU_ARCH
for
arch
in
[
"gfx928"
,
"gfx936"
]
)
def
_generate_random_int8
(
def
_generate_random_int8
(
tensor
:
torch
.
Tensor
,
tensor
:
torch
.
Tensor
,
...
...
Prev
1
2
3
4
Next
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment