Chao Liu
818e5cbfe7
magic division use __umulhi() ( #19 )
...
[ROCm/composable_kernel commit: c6f26bb480 ]
2021-08-23 10:40:27 -05:00
Chao Liu
e02d6a0f21
[MIOpen Downstream] Initial MIOpen integration ( #52 )
...
* update online kernel wrapper bundle all descriptors in a tuple
* change __CONSTANT__ to CONSTANT
* rename
* adding tuning
* added IsValidCompileParameter
* reorginze
* adding tunable for fp16 and int8
* fix kernel compile warning and bug fixes
* suppress warning about cast CONSTANT (address space 4) pointer
* fix building issue
[ROCm/composable_kernel commit: f63a23acb1 ]
2021-07-27 00:02:27 -05:00
Chao Liu
f4acec502e
Restructure gridwise and blockwise GEMM, add tensor contraction and FWD-v4r5 ( #36 )
...
* experimenting magic number division
* overhauling fwd-v4r4 to clearly reflect transformation graph
* added fwd-v4r5
* bug fix for make_dynamic_naive_tensor_descriptor_aligned_v2
* bug fix and added sanity-check in transform_dynamic_tensor_descriptor
* added conv_driver_v2
[ROCm/composable_kernel commit: 30072aec37 ]
2021-06-09 23:53:08 -05:00
Chao Liu
e8dee31b22
Initial implementation of magic number division and "Merge" transformation that use it ( #28 )
...
* initial implementation for magic number division and DynamicMerge_v2_magic_division that uses it
* turn off DynamicMerge_v2_magic_division that use magic number division by default
[ROCm/composable_kernel commit: 3bf52e60c5 ]
2021-04-12 21:32:55 -05:00