rocking
|
83b8a99018
|
Merge branch 'ck_tile/refactor' into ck_tile/elementwise
|
2024-04-09 19:45:43 +08:00 |
|
carlushuang
|
89a75a97fa
|
fix some bug in group-mode masking and codegen. update README
|
2024-04-09 19:01:25 +00:00 |
|
Po Yen Chen
|
a9adfbe54a
|
Small refinements in C++ source files
|
2024-04-09 06:45:03 +00:00 |
|
Po Yen Chen
|
20fcd69687
|
Remove not-in-use elementwise function kargs
|
2024-04-09 06:03:35 +00:00 |
|
rocking
|
5860f3134a
|
Merge branch 'ck_tile/refactor' into ck_tile/elementwise
|
2024-04-09 02:37:42 +08:00 |
|
Po Yen Chen
|
92d45d1681
|
Fix wrong fp8 QK/KV block gemm setting
|
2024-04-08 12:39:17 +00:00 |
|
rocking
|
4e005f2457
|
Avoid warning
|
2024-04-08 10:11:51 +00:00 |
|
rocking
|
29a0670744
|
Remove remove_cvref_t
|
2024-04-08 10:03:48 +00:00 |
|
rocking
|
5c3fdeb0b8
|
Remove f8 pipeline, we should share the same pipeline even in f8
|
2024-04-08 09:56:23 +00:00 |
|
rocking
|
f7d81364f3
|
To prevent compiler issue, remove the elementwise function we have not used.
|
2024-04-08 09:44:21 +00:00 |
|
carlushuang
|
42ebffe822
|
1).support receipe in generate.py 2).use simplified mask type 3).change left/right to pass into karg
|
2024-04-07 23:30:34 +00:00 |
|
rocking
|
d9323ea261
|
Fix bug of elementwise op, our elementwise op is not inout
|
2024-04-04 03:17:36 +00:00 |
|
rocking
|
bfcf550305
|
Adjust P elementwise function
|
2024-04-03 11:07:21 +00:00 |
|
rocking
|
286c74468d
|
Add element function to fmha api
|
2024-03-29 18:05:36 -04:00 |
|
rocking
|
50c36f352a
|
Add SAccElementFunction, PComputeElementFunction, OAccElementFunction in pipeline
|
2024-03-29 07:09:06 -04:00 |
|
carlushuang
|
f55c7629bc
|
not using custom data type by default, now we can have ISA-level same code as opt_padding
|
2024-03-17 23:23:32 +00:00 |
|
carlushuang
|
a59e655eb2
|
remove wrong code in store_raw()
|
2024-03-13 14:30:55 +00:00 |
|
carlushuang
|
7df3947819
|
fix macro for exp2; fix warpgemm a/b in transposedC
|
2024-03-06 15:59:21 +00:00 |
|
carlushuang
|
a67473fff8
|
now can build
|
2024-03-04 20:45:51 +00:00 |
|
carlushuang
|
112d521b09
|
fix xx
|
2024-03-03 23:48:31 +00:00 |
|
carlushuang
|
fbd25cea35
|
fix build wip
|
2024-02-29 22:27:31 +00:00 |
|
carlushuang
|
f69356b1d7
|
add code
|
2024-02-28 22:57:19 +00:00 |
|