Commit Graph

3 Commits

Author SHA1 Message Date
Graner, Johannes
e2f75fa10e Extend GPU reference to enable batchnorm epilogue 2026-01-09 04:33:01 -05:00
Graner, Johannes
9e95a2a62a multiple A/B tensors and D tensor for fwd GPU ref 2026-01-08 08:03:12 -05:00
Johannes Graner
bb8445dca8 [CK] Integrate GPU reference into ckProfiler for convolutions (#3379)
Refactor and integrate CK GPU references into ckProfiler.
- All convolution layouts and groupings supported for all three directions
- Unit tests verifying GPU and CPU reference is the same
- Support added to profiler (do_verification = 2 enables GPU reference)
- One profiler-based test per direction changed to GPU reference to demonstrate usag

Closes AICK-427
2025-12-18 07:59:45 +01:00