Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
ox696c
ktransformers
Commits
0564ac84
Commit
0564ac84
authored
Feb 12, 2025
by
Azure
Browse files
update marlin expert example
parent
a2fc2a86
Changes
1
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
20 additions
and
2 deletions
+20
-2
ktransformers/optimize/optimize_rules/DeepSeek-V3-Chat-multi-gpu-marlin.yaml
...ize/optimize_rules/DeepSeek-V3-Chat-multi-gpu-marlin.yaml
+20
-2
No files found.
ktransformers/optimize/optimize_rules/DeepSeek-V3-Chat-multi-gpu-marlin.yaml
View file @
0564ac84
...
@@ -79,6 +79,24 @@
...
@@ -79,6 +79,24 @@
generate_device
:
"
cuda:1"
generate_device
:
"
cuda:1"
prefill_device
:
"
cuda:1"
prefill_device
:
"
cuda:1"
-
match
:
name
:
"
^model
\\
.layers
\\
.(0|[1-4])
\\
.mlp
\\
.experts$"
# inject experts in layer 0~4 as marlin expert
replace
:
class
:
ktransformers.operators.experts.KTransformersExperts
kwargs
:
generate_device
:
"
cuda:0"
# run in cuda:0
generate_op
:
"
KExpertsMarlin"
recursive
:
False
-
match
:
name
:
"
^model
\\
.layers
\\
.([3][0])
\\
.mlp
\\
.experts$"
# inject experts in layer 30~31 as marlin expert
replace
:
class
:
ktransformers.operators.experts.KTransformersExperts
kwargs
:
generate_device
:
"
cuda:1"
generate_op
:
"
KExpertsMarlin"
recursive
:
False
-
match
:
-
match
:
name
:
"
^model
\\
.layers
\\
.(0|[1-9]|[12][0-9])
\\
.mlp
\\
.experts$"
name
:
"
^model
\\
.layers
\\
.(0|[1-9]|[12][0-9])
\\
.mlp
\\
.experts$"
replace
:
replace
:
...
@@ -139,5 +157,5 @@
...
@@ -139,5 +157,5 @@
replace
:
replace
:
class
:
"
default"
class
:
"
default"
kwargs
:
kwargs
:
generate_device
:
"
cuda:
1
"
generate_device
:
"
cuda:
0
"
prefill_device
:
"
cuda:
1
"
prefill_device
:
"
cuda:
0
"
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment