Update AMD buffer coherency (#3403)

* Update AMD buffer coherency [AICK-421]

* fixes

* fix

* fixes

* fixes

* Add backward compatilibity

* fix

* fixes

* fix

* fix

* fix

* Update grouped_convolution_backward_weight_kernel.hpp
This commit is contained in:
Bartłomiej Kocot
2025-12-18 10:16:22 +01:00
committed by GitHub
parent 15e81397a4
commit 700b2ec9c0
11 changed files with 268 additions and 98 deletions

View File

@@ -31,7 +31,9 @@ struct BaseFlatmmPipelineAGmemBGmemCRegV1
ck_tile::ignore = K;
if(M <= 416)
{
#if defined(__gfx942__) || defined(__gfx950__)
return ck_tile::amd_buffer_coherence_enum::WAVE_NT1;
#endif
}
return ck_tile::amd_buffer_coherence_enum::coherence_default;
}