yinglu
ba897f8435
ck:tf32:complement CK_ENABLE_TF32 controls ( #3426 )
2025-12-19 09:17:29 +08:00
yinglu
8fec8054b2
ck: add tf32 in DTYPES to control instances build( #3317 )
2025-12-08 16:24:20 +08:00
Aviral Goel
0aadb4b2c4
chore(copyright): update copyright header for profiler directory ( #3205 )
...
* chore(copyright): update copyright header for tile_engine directory
* chore(copyright): update copyright header for script directory
* chore(copyright): update copyright header for test_data directory
* chore(copyright): update copyright header for python directory
* chore(copyright): update copyright header for profiler directory
2025-11-14 11:19:25 -08:00
yinglu
fada1a3cae
Conv:TF32: add more instances - 2 ( #2879 )
...
* add instances of device_grouped_conv_fwd_xdl_f32_comp_instances
* add instances of device_grouped_conv_fwd_xdl_f32_tf32_mem_instances
* add instances of device_grouped_conv_fwd_xdl_large_tensor_f32_tf32_instances
* tf32:conv:add instances for base class DeviceConvFwd
* tf32:conv:add instances for base class DeviceGroupedConvBwdDataMultipleD
* tf32:conv:add instances for base class DeviceGroupedConvBwdWeight
* add tf32 in profiler
* remove gnhwc/ngchw/ngcdhw instances
* remove non-ndhwgc/nhwgc/nhwc instances
* add check in IsSupportedArgument()
2025-10-10 15:28:17 +08:00
Bartłomiej Kocot
4094ad158a
Integrate universal gemm with conv bwd data and add SplitK ( #1315 )
...
* Integrate universal gemm with conv bwd data
* Fix multi d kernel
* Add splitK support
* instances refactor
* instances refactor
* refactor
* fixeS
* fixes
* 16x16 instnaces
* Fixes
* Fix
* Fix
* Fix
* Fix
* Fix
* Fixes
* fix
* fix
2025-04-28 23:54:49 +02:00
Bartłomiej Kocot
8c0ab61ece
Grouped conv backward data GKCYX support ( #2029 )
...
* Grouped conv backward data GKCYX support
* profiler
* Converter
* split instances
2025-04-01 13:24:38 -07:00
Bartłomiej Kocot
c2e4898b4b
Grouped conv bwd data NGCHW ( #1967 )
...
* Grouped conv bwd data NGCHW
* fixes
* fix
* Improvements
* Fix
* Fix
* add client example
2025-03-17 13:32:00 +01:00
Bartłomiej Kocot
49180fd60b
Grouped 3d conv backward data support ( #799 )
...
* Grouped 3d conv backward data support
* Fix comments
2023-07-18 11:01:33 -05:00
Bartłomiej Kocot
63388e84ab
Support bf16/f32/f16 and NHWGC conv2d_bwd_data ( #757 )
...
* Support bf16/f32/f16 and NHWGC conv2d_bwd_data
* Add interface test
* clang format
* Comment fixes
* Add more friendly error message
2023-06-21 08:20:31 -05:00