"vscode:/vscode.git/clone" did not exist on "e3ecfeda1055d0299f1a46674fbd93b7e6a3353f"
Conv3d new (#94)
* conv3d compiles but has memory error
* conv3d works
* fix performance issue by using __builtin_amdgc_readfirstlane
* change MakeBlock2CTileMap to MakeDefaultBlock2CTileMap; change c_blockid_to* to cblockid_to*
* clang-format
* remove CK_EXPERIMENTAL_PASS_TENSOR_DECRIPTOR_BY_*; moved wrapper into DeviceConv3d
* format
* remove useless marc
* add comment
Co-authored-by:
Chao Liu <chao.liu2@amd.com>
Showing
Please register or sign in to comment