Aviral Goel
a535de0f75
chore(copyright): update copyright header for example directory ( #3273 )
...
* chore(copyright): update copyright header for codegen directory
* chore(copyright): update copyright header for example directory
[ROCm/composable_kernel commit: d85f065b15 ]
2025-11-24 18:02:41 -08:00
yinglu
3f44e675e4
TF32 POC in Conv3d on MI30x platform #2763 (second attempt) ( #2852 )
...
* Revert "Revert "feature:tf32:add initial conv3d fwd kernel support (#2763 )" (#2848 )"
This reverts commit 82da15ffa430a297fb072d0a15b3ada5753f69b1.
* fix compile error on gf12x
* only run tf32 example on gfx942
* only build tf32 instance on gfx942
* ckProfiler:only support tf32 in gfx942
* delete unuseful messages
[ROCm/composable_kernel commit: dd7af118d7 ]
2025-09-17 14:50:15 -07:00
Illia Silin
8cbf571d53
Revert "feature:tf32:add initial conv3d fwd kernel support ( #2763 )" ( #2848 )
...
This reverts commit 1a97bde100db0b7b5def711082bd2ea0e0aafc03.
[ROCm/composable_kernel commit: 03b59f8c76 ]
2025-09-15 08:27:04 -07:00
lym
5c712f856f
feature:tf32:add initial conv3d fwd kernel support ( #2763 )
...
[ROCm/composable_kernel commit: c51102144f ]
2025-09-15 21:03:00 +08:00
linqunAMD
ba922fdf80
Extend XDL kernel to Support RDNA3/4 - Part 4 ( #2724 )
...
* Fix example
* fix build error
* update pk_i4 & moe test case
* fix all instance build (examples)
* fix batched_gemm_gemm (example)
* disable example_gemm_bias_softmax_gemm_permute on gfx11
* remove unnecessary disable gfx11
* update tests
* update tests2
[ROCm/composable_kernel commit: 321627aec5 ]
2025-09-12 08:17:07 -07:00
Rostyslav Geyyer
591cd69bb2
Add instances for conv_scale with fp8 in/out ( #1193 )
...
* Add fp8 conv instances and client example
* Format
* Add example
* Update cmakelists
* Add profiler mode
* Format
* Fix copyright headers
[ROCm/composable_kernel commit: e626d5202a ]
2024-03-15 09:50:03 -07:00