Po Yen Chen
|
c6eac9746f
|
Fix type errors in composes<>
|
2024-04-09 13:18:17 +00:00 |
|
Po Yen Chen
|
a66409cfd9
|
Unify saturates<> implementation
|
2024-04-09 12:49:07 +00:00 |
|
Po Yen Chen
|
ecc64bce12
|
Generalize the composes<> template
|
2024-04-09 10:14:56 +00:00 |
|
Po Yen Chen
|
6ed739f913
|
Fix wrong value produced by saturating
|
2024-04-09 09:27:58 +00:00 |
|
Po Yen Chen
|
5d0ebdbfe4
|
Re-use already-existing scales<> functor template
|
2024-04-09 08:06:38 +00:00 |
|
Po Yen Chen
|
ad45cf8613
|
Support heterogeneous argument for binary function types
|
2024-04-09 07:41:30 +00:00 |
|
Po Yen Chen
|
87f3cd1ddd
|
Use CK_TILE_FLOAT_TO_FP8_STANDARD as default fp8 rounding mode
|
2024-04-08 12:39:58 +00:00 |
|
rocking
|
d6cb104d0f
|
Add some elementwise op, prepare to quantization
|
2024-04-04 03:18:39 +00:00 |
|
carlushuang
|
42866940dc
|
remove mistake
|
2024-03-31 00:01:30 +00:00 |
|
carlushuang
|
855a264b72
|
remove ck_tile example from default cmake target like all/install/check
|
2024-03-30 23:58:48 +00:00 |
|
carlushuang
|
200d2b22d4
|
fix scratch in fp8 kernel
|
2024-03-25 19:45:38 +00:00 |
|
Po-Yen, Chen
|
1cacb713c5
|
Default use CK_TILE_FLOAT_TO_FP8_STOCHASTIC rounding mode
|
2024-03-23 22:51:18 -04:00 |
|
carlushuang
|
886d040a81
|
fix compile error, fp8 not ready now
|
2024-03-18 07:58:00 +00:00 |
|
carlushuang
|
f55c7629bc
|
not using custom data type by default, now we can have ISA-level same code as opt_padding
|
2024-03-17 23:23:32 +00:00 |
|
carlushuang
|
ee397d0ab2
|
temp fix buffer_store spill
|
2024-03-15 22:56:41 +00:00 |
|
carlushuang
|
04762d212b
|
make sure thread_buffer can be tuple/array
|
2024-03-13 22:03:42 +00:00 |
|
carlushuang
|
616932068d
|
let more integral_constant->constant, and formating
|
2024-03-13 18:33:10 +00:00 |
|
Po-Yen, Chen
|
b1dbf64c91
|
Some minor changes
|
2024-03-13 03:55:07 -04:00 |
|
Po-Yen, Chen
|
8d1631adc9
|
Re-use function
|
2024-03-13 03:38:12 -04:00 |
|
Po-Yen, Chen
|
60221b89f8
|
Add constraint to array<> ctor
|
2024-03-13 03:32:05 -04:00 |
|
Po-Yen, Chen
|
5c433432fd
|
Fix format
|
2024-03-13 03:21:30 -04:00 |
|
Po-Yen, Chen
|
958218e9d0
|
Rename enum
Rename 'cood_transform_enum' to 'coord_transform_enum'
|
2024-03-13 03:15:04 -04:00 |
|
carlushuang
|
d962a0044b
|
fix compile issue in transpose
|
2024-03-13 15:02:45 +00:00 |
|
carlushuang
|
a59e655eb2
|
remove wrong code in store_raw()
|
2024-03-13 14:30:55 +00:00 |
|
Po-Yen, Chen
|
8103048b99
|
Merge branch 'ck_tile/refactor' of github.com:ROCm/composable_kernel-internal into ck_tile/refactor
|
2024-03-13 01:53:43 -04:00 |
|
Po-Yen, Chen
|
2b4e54305b
|
Merge function templates
|
2024-03-13 01:52:49 -04:00 |
|
carlushuang
|
9f34bcb431
|
re-structure tuple/array to avoid spill
|
2024-03-11 15:32:21 +00:00 |
|
carlushuang
|
26a25eb4cd
|
unify as tuple_array
|
2024-03-06 18:36:45 +00:00 |
|
carlushuang
|
0e7df1999f
|
wip fix
|
2024-03-06 14:31:36 +00:00 |
|
carlushuang
|
f549bb5d39
|
minor fix
|
2024-03-04 21:11:53 +00:00 |
|
carlushuang
|
a67473fff8
|
now can build
|
2024-03-04 20:45:51 +00:00 |
|
carlushuang
|
112d521b09
|
fix xx
|
2024-03-03 23:48:31 +00:00 |
|
carlushuang
|
fbd25cea35
|
fix build wip
|
2024-02-29 22:27:31 +00:00 |
|
carlushuang
|
f69356b1d7
|
add code
|
2024-02-28 22:57:19 +00:00 |
|