mirror of
https://github.com/ROCm/composable_kernel.git
synced 2026-05-11 17:00:18 +00:00
* add more instances for bfp16 * reduce the gemm input values to prevent round-off errors --------- Co-authored-by: Jing Zhang <jizha@amd.com> Co-authored-by: illsilin <Illia.Silin@amd.com>