Aviral Goel
d85f065b15
chore(copyright): update copyright header for example directory ( #3273 )
...
* chore(copyright): update copyright header for codegen directory
* chore(copyright): update copyright header for example directory
2025-11-24 18:02:41 -08:00
yinglu
dd7af118d7
TF32 POC in Conv3d on MI30x platform #2763 (second attempt) ( #2852 )
...
* Revert "Revert "feature:tf32:add initial conv3d fwd kernel support (#2763 )" (#2848 )"
This reverts commit 03b59f8c76 .
* fix compile error on gf12x
* only run tf32 example on gfx942
* only build tf32 instance on gfx942
* ckProfiler:only support tf32 in gfx942
* delete unuseful messages
2025-09-17 14:50:15 -07:00
Illia Silin
03b59f8c76
Revert "feature:tf32:add initial conv3d fwd kernel support ( #2763 )" ( #2848 )
...
This reverts commit c51102144f .
2025-09-15 08:27:04 -07:00
lym
c51102144f
feature:tf32:add initial conv3d fwd kernel support ( #2763 )
2025-09-15 21:03:00 +08:00
linqunAMD
321627aec5
Extend XDL kernel to Support RDNA3/4 - Part 4 ( #2724 )
...
* Fix example
* fix build error
* update pk_i4 & moe test case
* fix all instance build (examples)
* fix batched_gemm_gemm (example)
* disable example_gemm_bias_softmax_gemm_permute on gfx11
* remove unnecessary disable gfx11
* update tests
* update tests2
2025-09-12 08:17:07 -07:00
Rostyslav Geyyer
e626d5202a
Add instances for conv_scale with fp8 in/out ( #1193 )
...
* Add fp8 conv instances and client example
* Format
* Add example
* Update cmakelists
* Add profiler mode
* Format
* Fix copyright headers
2024-03-15 09:50:03 -07:00