Enrico Degregori
4ebc48a3cd
WMMA gemm_add_relu_add_layernorm ( #2989 )
...
* Summary:
- Refactor epilogue (with CShuffle) to support fused operations:
- EpilogueCShuffleBase holds common parts
- EpilogueCShuffle: runs CShuffle and write out
- EpilogueWelfordCShuffle: holds Welford specific arguments, runs CShuffle, write out, Welford first part and Welford write out
- Extend thread transfer v7r3:
- Support for intermediate data type different from src and dst type
- New functionality to write to dst buffer and keep data (to be able to use them for additional operations)
* Adress review comments
2025-10-31 11:19:26 -07:00
..
2025-09-09 11:22:36 +08:00
2024-09-12 11:47:52 +02:00
2025-09-16 17:47:28 +02:00
2025-09-09 11:22:36 +08:00
2025-09-09 11:22:36 +08:00
2025-09-04 14:10:24 -07:00
2025-09-09 11:22:36 +08:00
2025-09-09 11:22:36 +08:00
2025-07-28 13:01:07 -07:00
2025-09-09 11:22:36 +08:00
2025-09-09 11:22:36 +08:00
2025-09-09 11:22:36 +08:00
2025-07-28 13:01:07 -07:00
2025-09-09 11:22:36 +08:00
2025-09-09 11:22:36 +08:00
2025-10-31 11:19:26 -07:00
2025-10-31 11:19:26 -07:00
2025-09-09 11:22:36 +08:00
2025-09-09 11:22:36 +08:00
2025-09-09 11:22:36 +08:00
2025-07-28 11:34:07 -07:00
2025-09-09 11:22:36 +08:00
2025-09-09 11:22:36 +08:00
2025-07-28 11:34:07 -07:00
2025-10-17 15:36:39 +03:00
2025-09-09 11:22:36 +08:00
2025-09-09 11:22:36 +08:00
2025-09-09 11:22:36 +08:00
2025-09-09 11:22:36 +08:00
2025-09-09 11:22:36 +08:00
2025-09-09 11:22:36 +08:00
2025-04-03 15:30:21 -07:00
2025-09-09 11:22:36 +08:00
2025-09-09 11:22:36 +08:00
2025-04-03 15:30:21 -07:00
2025-09-22 18:49:06 -07:00
2025-09-09 11:22:36 +08:00
2025-07-28 13:01:07 -07:00
2025-10-31 11:19:26 -07:00
2025-09-24 11:28:20 -07:00
2025-09-09 11:22:36 +08:00
2025-09-22 18:49:06 -07:00
2025-09-09 11:22:36 +08:00
2025-10-09 08:33:16 +02:00
2025-09-09 11:22:36 +08:00
2025-09-09 11:22:36 +08:00
2025-10-02 11:15:24 -07:00
2025-09-09 11:22:36 +08:00
2025-09-09 11:22:36 +08:00
2025-09-22 18:49:06 -07:00
2025-09-22 18:49:06 -07:00
2025-09-22 18:49:06 -07:00
2025-10-03 07:08:49 -07:00
2025-09-17 14:50:15 -07:00
2025-09-09 11:22:36 +08:00
2025-09-09 11:22:36 +08:00
2025-09-09 11:22:36 +08:00
2025-09-09 11:22:36 +08:00
2025-09-09 11:22:36 +08:00
2025-09-09 11:22:36 +08:00
2025-09-09 11:22:36 +08:00
2025-09-09 11:22:36 +08:00
2025-09-09 11:22:36 +08:00
2025-09-24 11:28:20 -07:00
2025-09-09 11:22:36 +08:00
2025-09-09 11:22:36 +08:00
2025-09-09 11:22:36 +08:00
2025-09-09 11:22:36 +08:00
2025-09-09 11:22:36 +08:00
2025-09-09 11:22:36 +08:00
2025-06-11 23:41:03 +02:00
2025-10-17 15:36:39 +03:00
2025-07-28 13:01:07 -07:00
2025-09-19 16:27:50 +02:00
2025-10-10 15:28:17 +08:00
2025-10-10 15:28:17 +08:00
2025-05-26 16:51:09 +02:00
2025-10-10 15:28:17 +08:00
2025-10-10 15:28:17 +08:00
2025-10-31 07:52:42 -07:00
2025-07-28 13:01:07 -07:00
2025-10-29 09:54:42 +01:00
2025-10-29 16:04:13 +01:00
2025-09-24 11:28:20 -07:00
2025-10-31 07:52:42 -07:00
2025-10-29 16:04:13 +01:00
2025-03-26 21:13:38 +01:00
2025-10-17 15:36:39 +03:00
2025-07-28 13:01:07 -07:00
2025-09-09 11:22:36 +08:00
2025-09-09 11:22:36 +08:00
2025-09-09 11:22:36 +08:00
2025-09-09 11:22:36 +08:00
2025-09-09 11:22:36 +08:00
2025-09-09 11:22:36 +08:00
2025-07-28 13:01:07 -07:00
2025-01-31 09:48:39 -08:00
2025-09-09 11:22:36 +08:00
2025-09-09 11:22:36 +08:00
2025-09-09 11:22:36 +08:00
2025-09-09 11:22:36 +08:00
2025-09-09 11:22:36 +08:00
2025-07-28 13:01:07 -07:00
2024-09-11 15:21:00 +02:00
2024-08-13 16:15:47 +02:00
2024-08-13 16:15:47 +02:00
2024-08-13 16:15:47 +02:00
2024-08-13 16:15:47 +02:00
2025-08-01 14:30:07 -07:00
2025-09-09 11:22:36 +08:00
2025-07-31 12:08:45 +02:00
2025-07-31 12:08:45 +02:00