Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
FlashMLA
Commits
98c9821f
Commit
98c9821f
authored
Jun 05, 2026
by
shenzhe
Committed by
zhanghj2
Jun 06, 2026
Browse files
Fix CUTLASS submodule setup
parent
2c35de66
Changes
5
Hide whitespace changes
Inline
Side-by-side
Showing
5 changed files
with
22 additions
and
6 deletions
+22
-6
.gitmodules
.gitmodules
+4
-3
README.md
README.md
+4
-0
csrc/cutlass
csrc/cutlass
+0
-1
csrc/cutlass/cutlass_3.2.1
csrc/cutlass/cutlass_3.2.1
+1
-0
setup.py
setup.py
+13
-2
No files found.
.gitmodules
View file @
98c9821f
[submodule "csrc/cutlass"]
[submodule "csrc/cutlass/cutlass_3.2.1"]
path = csrc/cutlass
path = csrc/cutlass/cutlass_3.2.1
url = https://github.com/NVIDIA/cutlass.git
url = ssh://git@10.16.1.204:10022/dcutoolkit/deeplearing/cutlass_3.2.1.git
branch = feature/16x64-mmac
README.md
View file @
98c9821f
...
@@ -78,6 +78,10 @@ git submodule update --init --recursive
...
@@ -78,6 +78,10 @@ git submodule update --init --recursive
pip
install
-v
.
pip
install
-v
.
```
```
The CUTLASS dependency is pinned as
`csrc/cutlass/cutlass_3.2.1`
on branch
`feature/16x64-mmac`
. If the submodule is missing,
`setup.py`
will try to
initialize it before compiling.
## Usage
## Usage
### MLA Decoding
### MLA Decoding
...
...
cutlass
@
147f5673
Compare
147f5673
...
147f5673
Subproject commit 147f5673d0c1c3dcf66f78d677fd647e4a020219
cutlass_3.2.1
@
a46390fe
Subproject commit a46390fe2220776773f08eaeca8a13f9a76055b6
setup.py
View file @
98c9821f
...
@@ -38,11 +38,22 @@ def get_arch_flags():
...
@@ -38,11 +38,22 @@ def get_arch_flags():
# # nvcc_threads = os.getenv("NVCC_THREADS") or "32"
# # nvcc_threads = os.getenv("NVCC_THREADS") or "32"
# return ["--threads", nvcc_threads]
# return ["--threads", nvcc_threads]
# subprocess.run(["git", "submodule", "update", "--init", "csrc/cutlass"])
this_dir
=
os
.
path
.
dirname
(
os
.
path
.
abspath
(
__file__
))
this_dir
=
os
.
path
.
dirname
(
os
.
path
.
abspath
(
__file__
))
os
.
environ
[
'PYTORCH_NVCC'
]
=
'aicc'
os
.
environ
[
'PYTORCH_NVCC'
]
=
'aicc'
cutlass_dir
=
Path
(
this_dir
)
/
"csrc"
/
"cutlass"
/
"cutlass_3.2.1"
if
not
(
cutlass_dir
/
"include"
).
exists
():
subprocess
.
run
(
[
"git"
,
"submodule"
,
"update"
,
"--init"
,
"--recursive"
,
"csrc/cutlass/cutlass_3.2.1"
],
cwd
=
this_dir
,
check
=
True
,
)
if
not
(
cutlass_dir
/
"include"
).
exists
():
raise
RuntimeError
(
"CUTLASS dependency is missing. Run "
"`git submodule update --init --recursive csrc/cutlass/cutlass_3.2.1` first."
)
if
False
:
if
False
:
cxx_args
=
[
"/O2"
,
"/std:c++20"
,
"/DNDEBUG"
,
"/W0"
]
cxx_args
=
[
"/O2"
,
"/std:c++20"
,
"/DNDEBUG"
,
"/W0"
]
else
:
else
:
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment