Jianfeng Yan 8592df2211 Conv3d new (#94)
* conv3d compiles but has memory error

* conv3d works

* fix performance issue by using __builtin_amdgc_readfirstlane

* change MakeBlock2CTileMap to MakeDefaultBlock2CTileMap; change c_blockid_to* to cblockid_to*

* clang-format

* remove CK_EXPERIMENTAL_PASS_TENSOR_DECRIPTOR_BY_*; moved wrapper into DeviceConv3d

* format

* remove useless marc

* add comment

Co-authored-by: Chao Liu <chao.liu2@amd.com>

[ROCm/composable_kernel commit: 6dfb92bbef]
2022-02-22 22:45:28 -06:00
2022-02-18 21:44:11 -06:00
2022-02-22 22:45:28 -06:00
2022-02-22 22:45:28 -06:00
2022-02-22 22:45:28 -06:00
2022-02-22 22:45:28 -06:00
2022-02-06 22:32:47 -06:00
2022-02-22 22:45:28 -06:00
2018-10-08 22:49:58 -05:00
2021-08-08 17:41:54 +00:00
2022-02-18 21:44:11 -06:00
2022-02-18 21:44:11 -06:00
2022-02-18 21:44:11 -06:00
2022-02-18 21:44:11 -06:00
2022-02-18 21:44:11 -06:00
Description
[DEPRECATED] Moved to ROCm/rocm-libraries repo. NOTE: develop branch is maintained as a read-only mirror
Readme MIT 234 MiB
Languages
C++ 93.1%
Python 4.5%
CMake 1.5%
Shell 0.5%
Pawn 0.2%