Aviral Goel
91ffc9dd1e
chore(copyright): update copyright header for example directory ( #3273 )
...
* chore(copyright): update copyright header for codegen directory
* chore(copyright): update copyright header for example directory
[ROCm/composable_kernel commit: d85f065b15 ]
2025-11-24 18:02:41 -08:00
yinglu
19463895a8
TF32 POC in Conv3d on MI30x platform #2763 (second attempt) ( #2852 )
...
* Revert "Revert "feature:tf32:add initial conv3d fwd kernel support (#2763 )" (#2848 )"
This reverts commit 954db22b39 .
* fix compile error on gf12x
* only run tf32 example on gfx942
* only build tf32 instance on gfx942
* ckProfiler:only support tf32 in gfx942
* delete unuseful messages
[ROCm/composable_kernel commit: dd7af118d7 ]
2025-09-17 14:50:15 -07:00
Illia Silin
954db22b39
Revert "feature:tf32:add initial conv3d fwd kernel support ( #2763 )" ( #2848 )
...
This reverts commit d4dbf93119 .
[ROCm/composable_kernel commit: 03b59f8c76 ]
2025-09-15 08:27:04 -07:00
lym
d4dbf93119
feature:tf32:add initial conv3d fwd kernel support ( #2763 )
...
[ROCm/composable_kernel commit: c51102144f ]
2025-09-15 21:03:00 +08:00
linqunAMD
07def6b13d
Extend XDL kernel to Support RDNA3/4 - Part 4 ( #2724 )
...
* Fix example
* fix build error
* update pk_i4 & moe test case
* fix all instance build (examples)
* fix batched_gemm_gemm (example)
* disable example_gemm_bias_softmax_gemm_permute on gfx11
* remove unnecessary disable gfx11
* update tests
* update tests2
[ROCm/composable_kernel commit: 321627aec5 ]
2025-09-12 08:17:07 -07:00
Rostyslav Geyyer
17be33ccf9
Add instances for conv_scale with fp8 in/out ( #1193 )
...
* Add fp8 conv instances and client example
* Format
* Add example
* Update cmakelists
* Add profiler mode
* Format
* Fix copyright headers
[ROCm/composable_kernel commit: e626d5202a ]
2024-03-15 09:50:03 -07:00