mirror of
https://github.com/ROCm/composable_kernel.git
synced 2026-05-18 12:00:07 +00:00
* Add maxpool f32 kernel and example
* Revise copyright
* Add device pool bwd device op
* Support f16 and bf16
* Add compute datatype for reference code.
Prevent error in bf16
* Fix type error
* Remove layout
* Fix bf16 error
* Add f16 and bf16 example
* Add more operations
* Implement IsSupportedArgument
* Add changelog
* Add comment
* Add comment
* Remove useless header
* Move initialize of workspace to the run
* Move set din zero to the device operator
* Save din_length_raw
* Remove useless header
* Calculate gridsize according to the number of CU
* Calculate gridSize according to the number of CU.
Remove useless header
* Add put example
* Remove useless header
* Fix CI fail
[ROCm/composable_kernel commit: 341ad95665]
28 lines
794 B
Markdown
28 lines
794 B
Markdown
# Change Log for Composable Kernel
|
|
|
|
Full documentation for Composable Kernel is not yet available.
|
|
|
|
## CK 0.2.0 for ROCm 5.5.0
|
|
|
|
### Fixed
|
|
- Fixed a bug in 6-dimensional kernels (#555).
|
|
- Fixed grouped ConvBwdWeight test case failure (#524).
|
|
|
|
### Optimizations
|
|
- Improve proformance of normalization kernel
|
|
|
|
### Added
|
|
- Added support on NAVI3x.
|
|
- Added user tutorial (#563).
|
|
- Added more instances for irregular GEMM sizes (#560).
|
|
- Added inter-wave consumer-producer programming model for GEMM kernels (#310).
|
|
- Added multi-D GEMM client APIs (#534).
|
|
- Added multi-embeddings support (#542).
|
|
- Added Navi3x blockwise GEMM and real GEMM support (#541).
|
|
- Added Navi grouped ConvBwdWeight support (#505).
|
|
- Added pool3d forward (#697).
|
|
- Added maxpool backward (#750).
|
|
|
|
### Changed
|
|
- Changed ...
|