Commit Graph

22 Commits

Author SHA1 Message Date
rocking
83b8a99018 Merge branch 'ck_tile/refactor' into ck_tile/elementwise 2024-04-09 19:45:43 +08:00
carlushuang
89a75a97fa fix some bug in group-mode masking and codegen. update README 2024-04-09 19:01:25 +00:00
Po Yen Chen
a9adfbe54a Small refinements in C++ source files 2024-04-09 06:45:03 +00:00
Po Yen Chen
20fcd69687 Remove not-in-use elementwise function kargs 2024-04-09 06:03:35 +00:00
rocking
5860f3134a Merge branch 'ck_tile/refactor' into ck_tile/elementwise 2024-04-09 02:37:42 +08:00
Po Yen Chen
92d45d1681 Fix wrong fp8 QK/KV block gemm setting 2024-04-08 12:39:17 +00:00
rocking
4e005f2457 Avoid warning 2024-04-08 10:11:51 +00:00
rocking
29a0670744 Remove remove_cvref_t 2024-04-08 10:03:48 +00:00
rocking
5c3fdeb0b8 Remove f8 pipeline, we should share the same pipeline even in f8 2024-04-08 09:56:23 +00:00
rocking
f7d81364f3 To prevent compiler issue, remove the elementwise function we have not used. 2024-04-08 09:44:21 +00:00
carlushuang
42ebffe822 1).support receipe in generate.py 2).use simplified mask type 3).change left/right to pass into karg 2024-04-07 23:30:34 +00:00
rocking
d9323ea261 Fix bug of elementwise op, our elementwise op is not inout 2024-04-04 03:17:36 +00:00
rocking
bfcf550305 Adjust P elementwise function 2024-04-03 11:07:21 +00:00
rocking
286c74468d Add element function to fmha api 2024-03-29 18:05:36 -04:00
rocking
50c36f352a Add SAccElementFunction, PComputeElementFunction, OAccElementFunction in pipeline 2024-03-29 07:09:06 -04:00
carlushuang
f55c7629bc not using custom data type by default, now we can have ISA-level same code as opt_padding 2024-03-17 23:23:32 +00:00
carlushuang
a59e655eb2 remove wrong code in store_raw() 2024-03-13 14:30:55 +00:00
carlushuang
7df3947819 fix macro for exp2; fix warpgemm a/b in transposedC 2024-03-06 15:59:21 +00:00
carlushuang
a67473fff8 now can build 2024-03-04 20:45:51 +00:00
carlushuang
112d521b09 fix xx 2024-03-03 23:48:31 +00:00
carlushuang
fbd25cea35 fix build wip 2024-02-29 22:27:31 +00:00
carlushuang
f69356b1d7 add code 2024-02-28 22:57:19 +00:00