mirror of
https://github.com/ROCm/composable_kernel.git
synced 2026-05-01 12:11:19 +00:00
* fix compile error * fix typo of padding * Add smoothquant op * Add smoothquant instance library * refine type * add test script * Re-generate smoothquant.hpp * Always use 'current year' in copyright * use Generic2dBlockShape instead * Add vector = 8 instance back * Find exe path automatically * Simplify the api condition * Remove debugging code * update year * Add blank line between function declaration * explicitly cast return value to dim3 * refine return value * Fix default warmup and repeat value * Add comment * refactor sommthquant cmake * Add README * Fix typo --------- Co-authored-by: Po Yen, Chen <PoYen.Chen@amd.com>
22 lines
558 B
Markdown
22 lines
558 B
Markdown
# smoothquant
|
|
|
|
This folder contains example for smoothquant using ck_tile tile-programming implementation.
|
|
|
|
## build
|
|
```
|
|
# in the root of ck_tile
|
|
mkdir build && cd build
|
|
sh ../script/cmake-ck-dev.sh ../ <arch> # you can replace this <arch> to gfx90a, gfx942...
|
|
make tile_smoothquant -j
|
|
```
|
|
This will result in an executable `build/bin/tile_smoothquant`
|
|
|
|
## cmdline
|
|
```
|
|
args:
|
|
-m m dimension (default:3328)
|
|
-n m dimension (default:4096)
|
|
-v cpu validation or not (default:1)
|
|
-prec precision (default:fp16)
|
|
```
|