Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Commits
767c727a
Unverified
Commit
767c727a
authored
Jun 08, 2024
by
Calvinn Ng
Committed by
GitHub
Jun 07, 2024
Browse files
fix DbrxFusedNormAttention missing cache_config (#5340)
Co-authored-by:
team
<
calvinn.ng@ahrefs.com
>
parent
6840a716
Changes
1
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
2 additions
and
1 deletion
+2
-1
vllm/model_executor/models/dbrx.py
vllm/model_executor/models/dbrx.py
+2
-1
No files found.
vllm/model_executor/models/dbrx.py
View file @
767c727a
...
...
@@ -247,11 +247,12 @@ class DbrxFusedNormAttention(nn.Module):
def
__init__
(
self
,
config
:
DbrxConfig
,
cache_config
:
Optional
[
CacheConfig
]
=
None
,
quant_config
:
Optional
[
QuantizationConfig
]
=
None
,
):
super
().
__init__
()
self
.
d_model
=
config
.
d_model
self
.
attn
=
DbrxAttention
(
config
,
quant_config
)
self
.
attn
=
DbrxAttention
(
config
,
cache_config
,
quant_config
)
self
.
norm_1
=
nn
.
LayerNorm
(
self
.
d_model
)
self
.
norm_2
=
nn
.
LayerNorm
(
self
.
d_model
)
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment