docs: update ckProfiler readme with selective building option (#3140)

* docs: update ckProfiler readme with selective building option

* docs: add list of operations for ckProfiler
This commit is contained in:
Aviral Goel
2025-11-11 14:27:33 -05:00
committed by GitHub
parent ab68c9d384
commit c54ecd905b

View File

@@ -1,5 +1,23 @@
[Back to the main page](../README.md)
# Composable Kernel profiler
## Building Specific Profilers
To reduce build time, filter which operations to compile using CMake options:
```bash
# Build all grouped_gemm variants (grouped_gemm, grouped_gemm_fastgelu, grouped_gemm_tile_loop, etc.)
cmake -DCK_PROFILER_OP_FILTER="grouped_gemm" <other options> ..
# Build ONLY base grouped_gemm (excludes variants - use exact regex match with ^ and $)
cmake -DCK_PROFILER_OP_FILTER="^grouped_gemm$" <other options> ..
```
Both `CK_PROFILER_OP_FILTER` and `CK_PROFILER_INSTANCE_FILTER` accept regex patterns. Default builds all operations.
To find the complete list of operations, run the following command:
```bash
find profiler/src -name "profile_*.cpp" | sed 's|profiler/src/profile_||' | sed 's|.cpp||' | sort
```
## Profiler GEMM UNIVERSAL kernels
```bash
# arg1: tensor operation (gemm_universal: Universal GEMM)