Files
composable_kernel/client_example
Zoltán Lakatos 301eb5cf08 Implement device grouped gemm fixed nk multi abd for rdna4 (#3619)
* device struct implementation

* added xdl grouped multi abd fixed nk testing

* wmma implementation fixed

* avoid unnecessary device mem allocation and code cleanups

* cleanup instances definitions

* wmma examples added

* code cleanups

* fix clang format

* typo and compilation fixes related to reference gemm

* fix compilation error due to std::remove_cvref_t

* added missing hip_check_error includes

* correction to example instances

* review commentes addressed

* removed split-k from testing

* code formatting

---------

Co-authored-by: Zoltán Lakatos <zoltan.lakatos@streamhpc.com>
Co-authored-by: illsilin_amdeng <Illia.Silin@amd.com>
2026-02-02 13:58:11 -08:00
..
2025-06-24 12:13:18 -04:00

Back to the main page

Composable Kernel client examples

Client application links to CK library, and therefore CK library needs to be installed before building client applications.

Build

mkdir -p client_example/build
cd client_example/build
cmake                                                                 \
-D CMAKE_CXX_COMPILER=/opt/rocm/bin/hipcc                             \
-D CMAKE_PREFIX_PATH="/opt/rocm;${PATH_TO_CK_INSTALL_DIRECTORY}"      \
-D GPU_TARGETS="gfx908;gfx90a"                                        \
..

You must set the GPU_TARGETS macro to specify the GPU target architecture(s).

Build client example

 make -j