Split k f16 (#97)

* init for splitk f16

* a working prototype

* debug

* perf debug

* update example

* instances for mk kn

* add instances for all layers

* clean

* clean

* add tuning

* format

* add mn_padding into irregular tile

* clean

Co-authored-by: Chao Liu <chao.liu2@amd.com>
This commit is contained in:
zjing14
2022-02-25 01:19:37 -06:00
committed by GitHub
parent bdedf64b98
commit e221d11e51
11 changed files with 1713 additions and 30 deletions

View File

@@ -39,12 +39,12 @@ std::size_t GetFlops(ck::index_t N,
std::accumulate(std::begin(output_spatial_lengths),
std::end(output_spatial_lengths),
static_cast<std::size_t>(1),
std::multiplies<std::size_t>()) *
std::multiplies<std::size_t>()) *
C *
std::accumulate(std::begin(filter_spatial_lengths),
std::end(filter_spatial_lengths),
static_cast<std::size_t>(1),
std::multiplies<std::size_t>());
std::multiplies<std::size_t>());
}
/**