mirror of
https://github.com/ROCm/composable_kernel.git
synced 2026-05-16 10:59:55 +00:00
ckProfiler for layernorm (#330)
* Refine parameter
* Add base class for layernorm
* Add layernorm instance
* Add layernorm to ckProfiler
* Remove redundant
* Add verification
* Fix compile error due to merge
[ROCm/composable_kernel commit: fdfd7eb597]
This commit is contained in:
@@ -46,7 +46,7 @@ using DeviceInstance = ck::tensor_operation::device::DeviceLayernorm<XDataType,
|
||||
8, // SrcScalarPerVector
|
||||
8, // GammaScalarPerVector
|
||||
8, // BetaScalarPerVector
|
||||
1>; // OutScalarPerVector
|
||||
8>; // OutScalarPerVector
|
||||
|
||||
int main()
|
||||
{
|
||||
|
||||
Reference in New Issue
Block a user