Files
composable_kernel/example/ck_tile/14_moe_smoothquant
jakpiase 434d19f696 Add ck tile examples to package (#1880)
* add ck tile examples to package

* Update jenkinsfile

* fix for jenkinsfile

* fix for building ck tile code on non gfx9

* compile ck tile examples only for gfx94

* include ck tile examples in all target

* fix for basic gemm UseStructuredSparsity

* Update CMakeLists.txt

* Update gemm_pipeline_problem.hpp

* add targets to rocm install

---------

Co-authored-by: Illia Silin <98187287+illsilin@users.noreply.github.com>
2025-04-28 09:53:19 -07:00
..
2025-01-22 17:34:27 +08:00
2024-11-25 13:12:35 +08:00
2025-01-22 17:34:27 +08:00
2025-01-22 17:34:27 +08:00
2025-01-22 17:34:27 +08:00
2024-11-25 13:12:35 +08:00

moe-smoothquant

This folder contains example for moe-smoothquant using ck_tile tile-programming implementation.

Unlike standard smoothquant op, the input scale is from different expert [expert, hidden], we need reuse the topk-id from previous topk-softmax and select the corresponding expert from current topk, and expand the output/per-token-scale by topk

build

# in the root of ck_tile
mkdir build && cd build
sh ../script/cmake-ck-dev.sh  ../ <arch>  # you can replace this <arch> to gfx90a, gfx942...
make tile_example_moe_smoothquant -j

This will result in an executable build/bin/tile_example_moe_smoothquant