Sam Wu
|
e375ee8db4
|
standardize docs (#655)
[ROCm/composable_kernel commit: f80776d937]
|
2023-03-23 20:58:59 -07:00 |
|
Po Yen Chen
|
80cbcb0c6e
|
Add quotes for string option values (#472)
[ROCm/composable_kernel commit: 337642a48c]
|
2022-10-27 15:33:14 -06:00 |
|
Chao Liu
|
32718de0fe
|
Update doc (#464)
* update cmake script
* update readme
* Update README.md
* add citation
* add images
* Update README.md
* update
* Update README.md
* Update CONTRIBUTORS.md
* Update README.md
* Update CITATION.cff
* Update README.md
* Update CITATION.cff
* update doc
* Update CONTRIBUTORS.md
* Update LICENSE
[ROCm/composable_kernel commit: 6de749e29c]
|
2022-10-03 14:34:40 -05:00 |
|
Chao Liu
|
34f18d8e24
|
update document: Readme, contributors, citation, (#463)
* update cmake script
* update readme
* Update README.md
* add citation
* add images
* Update README.md
* update
* Update README.md
* Update CONTRIBUTORS.md
* Update README.md
* Update CITATION.cff
* Update README.md
* Update CITATION.cff
[ROCm/composable_kernel commit: 473ba5bc4a]
|
2022-10-03 00:48:24 -05:00 |
|
Chao Liu
|
5a1aa3f702
|
Clean up conv example, Instances, profiler and test (#324)
* convnd_fwd fp16 example
* update example
* update example
* update instance
* updating refernce conv
* update reference conv
* update conv fwd profiler
* update conv 1d and 3d instance
* update include path
* clean
* update profiler for conv bwd data and weight
* update conv bwd weight
* clean
* update conv example
* update profiler for conv bwd weight
* update ckprofiler for conv bwd data
* fix reference conv bwd data bug; update conv bwd data test
* update examples
* fix initialization issue
* update test for conv fwd
* clean
* clean
* remove test case too sensitive to error threshhold
* fix test
* clean
* fix build
* adding conv multiple d
* adding conv multiple D
* add matrix padder
* add gemm padding to convnd
* adding group conv
* update gemm multi-d
* refactor
* refactor
* refactor
* clean
* clean
* refactor
* refactor
* reorg
* add ds
* add bias
* clean
* add G
* adding group
* adding group
* adding group
* update Tensor
* clean
* update example
* update DeviceGemmMultipleD_Xdl_CShuffle
* update conv bwd-data and bwd-weight
* upate contraction example
* update gemm and batch gemm with e permute
* fix example build
* instance for grouped conv1d
* update example
* adding group conv instance
* update gemm bilinear instance
* update gemm+add+add+fastgelu instance
* update profiler
* update profiler
* update test
* update test and client example
* clean
* add grouped conv into profiler
* update profiler
* clean
* add test grouped conv, update all conv test to gtest
* update test
[ROCm/composable_kernel commit: 500fa99512]
|
2022-07-29 18:19:25 -05:00 |
|
Chao Liu
|
74b6e85eaf
|
Improve external interface for GEMM and GEMM+add+add+fastgelu (#311)
* interface for GEMM and GEMM+add+add+fastgelu
* rename namespace
* instance factory
* fix build
* fix build; add GEMM client example
* clean
[ROCm/composable_kernel commit: 0dcb3496cf]
|
2022-06-30 22:11:00 -05:00 |
|
Liam Wrubleski
|
ef57890712
|
Switch to standard ROCm packaging (#301)
* Switch to standard ROCm packaging
* Revert .gitignore changes
* install new rocm-cmake version
* update readme
Co-authored-by: illsilin <Illia.Silin@amd.com>
Co-authored-by: Chao Liu <chao.liu2@amd.com>
[ROCm/composable_kernel commit: b653c5eb2e]
|
2022-06-25 09:35:16 -05:00 |
|
Chao Liu
|
e33120786f
|
update readme and script (#290)
[ROCm/composable_kernel commit: ccbd8d907b]
|
2022-06-20 23:34:32 -05:00 |
|
JD
|
569dd9f47b
|
Add host API (#220)
* Add host API
* manually rebase on develop
* clean
* manually rebase on develop
* exclude tests from all target
* address review comments
* update client app name
* fix missing lib name
* clang-format update
* refactor
* refactor
* refactor
* refactor
* refactor
* fix test issue
* refactor
* refactor
* refactor
* upate cmake and readme
Co-authored-by: Chao Liu <chao.liu2@amd.com>
[ROCm/composable_kernel commit: cec69bc3bc]
|
2022-05-12 09:21:01 -05:00 |
|
Wen-Heng (Jack) Chung
|
1dc34ba98b
|
Update README.md (#228)
[ROCm/composable_kernel commit: 968bd93285]
|
2022-05-09 15:00:04 -05:00 |
|
Chao Liu
|
3f732cceab
|
Compile for gfx908 and gfx90a (#130)
* adding compilation for multiple targets
* fix build
* clean
* update Jekinsfile
* update readme
* update Jenkins
* use ck::half_t instead of ushort for bf16
* rename enum classes
* clean
* rename
* clean
[ROCm/composable_kernel commit: cd167e492a]
|
2022-03-31 12:33:34 -05:00 |
|
Chao Liu
|
b9f9ed96ac
|
ckProfiler and device-level XDL GEMM operator (#48)
* add DeviceGemmXdl
* update script
* fix naming issue
* fix comment
* output HostTensorDescriptor
* rename
* padded GEMM for fwd v4r4r4 nhwc
* refactor
* refactor
* refactor
* adding ckProfiler
* adding ckProfiler
* refactor
* fix tuning parameter bug
* add more gemm instances
* add more fp16 GEMM instances
* fix profiler driver
* fix bug in tuning parameter
* add fp32 gemm instances
* small fix
* refactor
* rename
* refactor gemm profiler; adding DeviceConv and conv profiler
* refactor
* fix
* add conv profiler
* refactor
* adding more GEMM and Conv instance
* Create README.md
Add build instruction for ckProfiler
* Create README.md
Add Readme for gemm_xdl example
* Update README.md
Remove build instruction from top most folder
* Update README.md
* clean up
[ROCm/composable_kernel commit: e823d518cb]
|
2021-11-14 11:28:32 -06:00 |
|
Chao Liu
|
c5a4edb9e8
|
rename
[ROCm/composable_kernel commit: c03045ce2d]
|
2021-08-10 23:45:36 +00:00 |
|
Chao Liu
|
1b8b55fc61
|
Update README.md
[ROCm/composable_kernel commit: 85a1429301]
|
2021-07-28 09:41:38 -05:00 |
|
Chao Liu
|
18bec192f5
|
Update README.md
[ROCm/composable_kernel commit: 56f93c6f33]
|
2021-07-28 09:40:44 -05:00 |
|
Chao Liu
|
2ac5994379
|
Create README.md (#45)
* Create README.md
[ROCm/composable_kernel commit: 4682d070a6]
|
2021-07-08 13:32:29 -05:00 |
|