Max Podkorytov
d7e7fbdcff
Add generate_identity_sequences helper for common pattern
...
This adds an optimized helper for the common generate_tuple pattern:
generate_tuple([](auto i) { return Sequence<i.value>{}; }, N)
The new generate_identity_sequences<N>() function creates
Tuple<Sequence<0>, Sequence<1>, ..., Sequence<N-1>> without
requiring lambda instantiation at each call site.
Updated 21 call sites across threadwise_tensor_slice_transfer,
wrapper utilities, and layout files to use the new helper.
Build time improvement: ~1.1% wall-clock (18.3s -> 18.1s)
2026-01-16 21:45:31 -06:00
Aviral Goel
de6466481f
chore(copyright): update copyright header for include directory ( #3293 )
2025-11-26 11:00:05 -07:00
Illia Silin
504b101da3
upgrade from clang-format-12 to clang-format-18 ( #2568 )
...
* upgrade to clang-format-18
* update to clang-format-18 in pre-commit-config
2025-07-28 11:34:07 -07:00
Bartłomiej Kocot
42fc8eddd2
Fix warnings during wrapper docs generation ( #1192 )
...
* Fix warnings during wrapper docs generation
* Fixes
2024-03-08 17:13:03 -08:00
Bartłomiej Kocot
1e73adbc28
Add optimized blockwise gemm using ck wrapper ( #1157 )
...
* Add optimized blockwise gemm using ck wrapper
* Add basic gemm example
* Update docs
* Add tutorial for gemm using ck wrapper
* Add perf note
* edits
* Fix cmake
* Fixes
---------
Co-authored-by: Lisa Delaney <lisa.delaney@amd.com >
2024-02-13 17:04:36 +01:00
Bartłomiej Kocot
171ca260b5
Extend gemm traits number for ck wrapper ( #1153 )
2024-02-02 11:25:54 -08:00
Bartłomiej Kocot
f3b6c23ac5
Add blockwise gemm to ck wrapper ( #1139 )
...
* Add blockwise gemm to ck wrapper
* Add blockwise gemm traits
* Disable test_gemm for non xdl devices
* Fixes
* Add c layout descritpions
2024-01-31 21:24:40 +01:00
Bartłomiej Kocot
7e4eb4b800
Add optimized copy to ck wrapper ( #1126 )
...
* Add optimized copy to ck wrapper
* Example optimizations
* Fixes
* Move img2col test to client example
* Refactor example
* Fix docs
* Fixes
* Fix
* Fixes
* Fixes
* Fixes
* Fixes
* Fixes
---------
Co-authored-by: zjing14 <zhangjing14@gmail.com >
2024-01-19 11:29:00 +01:00
Bartłomiej Kocot
4234b3a691
Add tensor partition and generic copy for ck wrapper ( #1108 )
...
* Add tensor partition and generic copy for ck wrapper
* Update changelog
* Stylistic fixes
* Change shape/strides logic to descriptor transforms
* Fixes
* Fix client example
* Fix comments
2024-01-03 01:10:57 +01:00
Bartłomiej Kocot
07092d68f0
Add tensor structure to wrapper ( #1098 )
...
* Add tensor structure to wrapper
* update changelog
* Fix names
* Comment fixes
2023-12-15 12:45:08 +01:00
Bartłomiej Kocot
836b7e557d
Introduce wrapper library ( #1071 )
...
* Introduce wrapper library
* Update cmake files
* Revert "Update cmake files"
This reverts commit c27f88b565 .
* Fix comments
2023-12-06 11:58:59 +01:00