Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
OpenDAS
TransformerEngine
Repository
fad3044bde1547eae9543a6a3f80401e59bb629e
Switch branch/tag
TransformerEngine
transformer_engine
common
transpose
cast_transpose.cu
Find file
Blame
History
Permalink
Use 4B vector loads/stores in cast-transpose kernel for small matrices (#101)
· 30632f31
Tim Moon
authored
Mar 17, 2023
Signed-off-by:
Tim Moon
<
tmoon@nvidia.com
>
30632f31
cast_transpose.cu
19.7 KB
Edit
Web IDE
Replace cast_transpose.cu
×
Attach a file by drag & drop or
click to upload
Commit message
Replace cast_transpose.cu
Replace file
Cancel
A new branch will be created in your fork and a new merge request will be started.