zjing14
b53e9d08ed
Batched GEMM for fp16 (#79)
* prepare host for batched_gemm
* init commit of batched kernels
* fixed
* refine transform with freeze
* m/n padding
* fixed a bug; clean
* add small tiles
* clean
* clean code
* clean code
* add nt, tn, tt layout
* add missing file
* use StaticBufferTupleOfVector instead
* add reference_batched_gemm
* fixed a macro
2022-02-11 09:36:52 -06:00
..
2021-12-26 07:43:42 -07:00
2021-12-26 07:43:42 -07:00
2022-02-11 09:36:52 -06:00
2022-02-06 22:32:47 -06:00
2021-12-26 07:43:42 -07:00
2021-12-26 07:43:42 -07:00
2021-12-26 07:43:42 -07:00
2021-12-26 07:43:42 -07:00
2021-12-26 07:43:42 -07:00
2021-12-26 07:43:42 -07:00
2022-02-06 22:32:47 -06:00
2022-02-06 22:32:47 -06:00
2022-02-11 00:48:41 -06:00
2022-02-10 23:52:19 -06:00
2022-02-10 23:52:19 -06:00
2022-02-10 23:52:19 -06:00
2022-02-02 22:47:27 -06:00
2022-02-10 23:52:19 -06:00
2022-02-11 00:48:41 -06:00
2021-12-26 07:43:42 -07:00
2021-11-14 11:28:32 -06:00