mirror of
https://github.com/ROCm/composable_kernel.git
synced 2026-04-19 22:39:03 +00:00
docs: update ckProfiler readme with selective building option (#3140)
* docs: update ckProfiler readme with selective building option * docs: add list of operations for ckProfiler
This commit is contained in:
@@ -1,5 +1,23 @@
|
||||
[Back to the main page](../README.md)
|
||||
# Composable Kernel profiler
|
||||
|
||||
## Building Specific Profilers
|
||||
To reduce build time, filter which operations to compile using CMake options:
|
||||
|
||||
```bash
|
||||
# Build all grouped_gemm variants (grouped_gemm, grouped_gemm_fastgelu, grouped_gemm_tile_loop, etc.)
|
||||
cmake -DCK_PROFILER_OP_FILTER="grouped_gemm" <other options> ..
|
||||
|
||||
# Build ONLY base grouped_gemm (excludes variants - use exact regex match with ^ and $)
|
||||
cmake -DCK_PROFILER_OP_FILTER="^grouped_gemm$" <other options> ..
|
||||
```
|
||||
|
||||
Both `CK_PROFILER_OP_FILTER` and `CK_PROFILER_INSTANCE_FILTER` accept regex patterns. Default builds all operations.
|
||||
|
||||
To find the complete list of operations, run the following command:
|
||||
```bash
|
||||
find profiler/src -name "profile_*.cpp" | sed 's|profiler/src/profile_||' | sed 's|.cpp||' | sort
|
||||
```
|
||||
## Profiler GEMM UNIVERSAL kernels
|
||||
```bash
|
||||
# arg1: tensor operation (gemm_universal: Universal GEMM)
|
||||
|
||||
Reference in New Issue
Block a user