Files
composable_kernel/CHANGELOG.md
Rostyslav Geyyer 246ceee49e Add Grouped Conv Backward Weight on Navi21 for ResNet50. (#505)
* Add DeviceOp and examples

* Format DeviceOp template arguments

* Remove bf16 example

* Format

* Format

* Update MakeABCGridDescriptor_A_K0_M_K1_B_K0_N_K1_C_M_N

* Refactor argument preparation

* Update conv_bwd_weight_dl to grouped_conv_bwd_weight_dl

* Rename device op file

* Update include directive in the example file

* Update descriptor preparation for grouped op

* Update the argument

* Update batch handling

* Add gridwise gemm supporting batched input

* Update blockwise indexing, working version

* Update copyright year

* Update check if argument is supported

* Refactor and make consistent with xdl examples

* Update check if argument is supported

* Add changelog entry

* Added comments on Dl op split_k>1 support

---------

Co-authored-by: Rosty Geyyer <rosty.geyyer@amd.com>
Co-authored-by: zjing14 <zhangjing14@gmail.com>
2023-02-22 11:59:53 -06:00

703 B

Change Log for Composable Kernel

Full documentation for Composable Kernel is not yet available.

CK 0.1.1 for ROCm 5.5.0

Fixed

  • Fixed a bug in 6-dimensional kernels (#555).
  • Fixed grouped ConvBwdWeight test case failure (#524).

Optimizations

  • Improve proformance of normalization kernel

Added

  • Added user tutorial (#563).
  • Added more instances for irregular GEMM sizes (#560).
  • Added inter-wave consumer-producer programming model for GEMM kernels (#310).
  • Added multi-D GEMM client APIs (#534).
  • Added multi-embeddings support (#542).
  • Added Navi3x blockwise GEMM and real GEMM support (#541).
  • Added Navi grouped ConvBwdWeight support (#505).

Changed

  • Changed ...