Po Yen Chen
|
87f3cd1ddd
|
Use CK_TILE_FLOAT_TO_FP8_STANDARD as default fp8 rounding mode
|
2024-04-08 12:39:58 +00:00 |
|
Po-Yen, Chen
|
1cacb713c5
|
Default use CK_TILE_FLOAT_TO_FP8_STOCHASTIC rounding mode
|
2024-03-23 22:51:18 -04:00 |
|
carlushuang
|
f55c7629bc
|
not using custom data type by default, now we can have ISA-level same code as opt_padding
|
2024-03-17 23:23:32 +00:00 |
|
carlushuang
|
04762d212b
|
make sure thread_buffer can be tuple/array
|
2024-03-13 22:03:42 +00:00 |
|
Po-Yen, Chen
|
b1dbf64c91
|
Some minor changes
|
2024-03-13 03:55:07 -04:00 |
|
carlushuang
|
a59e655eb2
|
remove wrong code in store_raw()
|
2024-03-13 14:30:55 +00:00 |
|
carlushuang
|
9f34bcb431
|
re-structure tuple/array to avoid spill
|
2024-03-11 15:32:21 +00:00 |
|
carlushuang
|
26a25eb4cd
|
unify as tuple_array
|
2024-03-06 18:36:45 +00:00 |
|
carlushuang
|
0e7df1999f
|
wip fix
|
2024-03-06 14:31:36 +00:00 |
|
carlushuang
|
a67473fff8
|
now can build
|
2024-03-04 20:45:51 +00:00 |
|
carlushuang
|
112d521b09
|
fix xx
|
2024-03-03 23:48:31 +00:00 |
|
carlushuang
|
fbd25cea35
|
fix build wip
|
2024-02-29 22:27:31 +00:00 |
|
carlushuang
|
f69356b1d7
|
add code
|
2024-02-28 22:57:19 +00:00 |
|