Aviral Goel
|
004784ef98
|
chore(copyright) update library wide CMakeLists.txt copyright header template (#3313)
* chore(copyright) update library wide CMakeLists.txt files copyright header template
* Fix build
---------
Co-authored-by: Sami Remes <samremes@amd.com>
|
2025-11-28 13:49:54 -08:00 |
|
AviralGoelAMD
|
4e49e0228b
|
chore(copyright): update copyright header for test directory
|
2025-11-19 17:43:28 -07:00 |
|
Illia Silin
|
b94fd0b227
|
update copyright headers (#726)
|
2023-05-31 18:46:57 -05:00 |
|
Chao Liu
|
d3051d7517
|
add license in file (#303)
|
2022-06-24 23:32:43 -05:00 |
|
Chao Liu
|
d1db6a0c3e
|
Absolute include path (#281)
* ad gelu and fast_gelu
* added GeLU and fast GeLU
* clean up
* add gemm+fastgelu example
* add gemm+gelu instances
* update profiler
* clean up
* clean up
* adding gemm+bias+activation
* clean
* adding bias
* clean
* adding gemm multiple d
* debugging
* add gemm bias add fastgelu
* rename, clean
* refactoring; add readme
* refactor
* refactor
* refactor
* refactor
* refactor
* refactor
* fix
* fix
* update example
* update example
* rename
* update example
* add ckProfiler
* clean
* clean
* clean
* clean
* add client app example
* update readme
* delete obselete files
* remove old client app
* delete old file
* cleaning
* clean
* remove half
* fix header path
* fix header path
* fix header path
* fix header path
* fix header path
* fix header path for all examples
* fix header path
* fix header path
* fix header path
* fix header path
* fix header path
* fix header path
* fix header path
* fix header path
* fix header path
* revert client app example
* clean build
* fix build
* temporary disable client test on Jenkins
* clean
* clean
* clean
|
2022-06-24 20:51:04 -05:00 |
|
Anthony Chang
|
e579c9e5c6
|
Tensile-style block to C tile map (#239)
* fix build
* Revert "fix build"
This reverts commit d73102384b.
* post PR #235 merge fix
* amend
* adds tensile-stype c-tile map
* make it dynamic version
* add k-split flavor tile map
* apply tensile-style tile map to all xdl gridwise gemms
* remove dead code
Co-authored-by: Chao Liu <chao.liu2@amd.com>
|
2022-05-24 21:55:22 -05:00 |
|
Anthony Chang
|
a054f7d604
|
Refactor block to C tile map (#235)
* refactor block-to-ctile-map
* gridwise gemm block2ctile generic validity check
* format
* amend split-k gemm block2ctile map refactor
* add test
* format
* amend
* revert to calculating batch index in kernel instead of passing as block_id_z
* move file
* add valid ctile index check to gridwise v2r4
|
2022-05-20 12:40:51 -05:00 |
|