Po-Yen, Chen
|
3f190b0779
|
Add new DeviceMem API to copy memory
|
2022-08-21 01:16:47 -04:00 |
|
Po-Yen, Chen
|
1facdbd08f
|
Add wrapper function to hide value conversion while copying memory
|
2022-08-20 12:51:53 -04:00 |
|
Po-Yen, Chen
|
eb8d636619
|
Re-format CMakeLists.txt
|
2022-08-20 00:00:12 -04:00 |
|
Po-Yen, Chen
|
eff7a1043e
|
Add missing 'else' keyword
|
2022-08-19 16:39:27 -04:00 |
|
Po-Yen, Chen
|
b268df80dc
|
Exit successfully if GEMM instance does not support given problem
|
2022-08-19 16:37:30 -04:00 |
|
Po-Yen, Chen
|
5e5e99e8c5
|
Fix never-ran examples
|
2022-08-19 16:33:50 -04:00 |
|
Po-Yen, Chen
|
990eed11b7
|
Handle the case while user specify all the strides
|
2022-08-19 16:32:37 -04:00 |
|
Po-Yen, Chen
|
7558d14442
|
Fix wrong program return value of GEMM examples
|
2022-08-19 16:29:48 -04:00 |
|
Po-Yen, Chen
|
1ce791ea05
|
Use more strict condition to add code in examples
|
2022-08-19 15:39:50 -04:00 |
|
Po-Yen, Chen
|
75a30f8b18
|
Mark Tensor<> special member functions as 'default'
|
2022-08-19 15:30:43 -04:00 |
|
Po-Yen, Chen
|
1626a6e376
|
Remove unnecessary copy ctor for Tensor<>
|
2022-08-19 15:27:21 -04:00 |
|
Po-Yen, Chen
|
cd395646fa
|
Fix compilation error in check_err()
|
2022-08-19 15:22:26 -04:00 |
|
Po-Yen, Chen
|
47770c857b
|
Allow unsigned integer arguments for check_err()
|
2022-08-19 15:19:34 -04:00 |
|
Po-Yen, Chen
|
3b0f97f6eb
|
Revert "Add type traits 'is_signed_integral<>'"
This reverts commit f2c148efae.
|
2022-08-19 15:14:12 -04:00 |
|
Po-Yen, Chen
|
103ae7d126
|
Use reinterpret_cast<>() for cross-type pointer conversion
|
2022-08-19 15:01:32 -04:00 |
|
Po-Yen, Chen
|
a177ad758f
|
Unify structured comment in examples
|
2022-08-19 14:57:21 -04:00 |
|
Po-Yen, Chen
|
e37f4ab9cc
|
Re-format common.hpp
|
2022-08-19 14:50:44 -04:00 |
|
Po-Yen, Chen
|
f7288bc2b1
|
Reuse same implementation code for most of GEMM examples
|
2022-08-19 14:47:09 -04:00 |
|
Po-Yen, Chen
|
ed51c0638b
|
Re-format template argument in example code
|
2022-08-19 14:31:46 -04:00 |
|
Po-Yen, Chen
|
5931c7ebe6
|
Move common codes together
|
2022-08-19 13:49:22 -04:00 |
|
Po-Yen, Chen
|
68a57e71e6
|
Move #include directives into new header
|
2022-08-19 13:24:00 -04:00 |
|
Po-Yen, Chen
|
42d75f356c
|
Sort include directives
|
2022-08-19 12:59:46 -04:00 |
|
Po-Yen, Chen
|
dd5b139401
|
Extract int4 example common codes
|
2022-08-19 12:57:36 -04:00 |
|
Po-Yen, Chen
|
3e2f37a148
|
Re-format GEMM instance template arguments
|
2022-08-19 12:02:57 -04:00 |
|
Po-Yen, Chen
|
c1fbabea04
|
Avoid too much generalizing check_err()
|
2022-08-19 11:59:21 -04:00 |
|
Po-Yen, Chen
|
4d4a659cd6
|
Use ""_uz to simplify example code
|
2022-08-19 11:54:51 -04:00 |
|
Po-Yen, Chen
|
3e2371c554
|
Align design with other PR
|
2022-08-19 11:44:08 -04:00 |
|
Po-Yen, Chen
|
503f07c1e0
|
Add constraint to check_err() input reference type
|
2022-08-19 11:34:19 -04:00 |
|
Po-Yen, Chen
|
2fb766e852
|
Simplify tensor usages in examples
|
2022-08-19 11:33:25 -04:00 |
|
Po-Yen, Chen
|
0d5025befe
|
Add #error directive to prevent compile sources with wrong setting
|
2022-08-19 10:51:30 -04:00 |
|
Po-Yen, Chen
|
625f95ade4
|
Remove debug messages
|
2022-08-19 10:05:44 -04:00 |
|
Po-Yen, Chen
|
84843aa36f
|
Avoid compilation error while disabling ck::int4_t support
|
2022-08-19 09:54:03 -04:00 |
|
Po-Yen, Chen
|
51d0c6794c
|
Remove constraint of Tensor<>::CopyAsType()
|
2022-08-19 05:31:04 -04:00 |
|
Po-Yen, Chen
|
c34f8411c4
|
Check converted Tensor<int4_t> with golden Tensor<int8_t>
|
2022-08-19 04:40:13 -04:00 |
|
Po-Yen, Chen
|
a83c006098
|
Allow comparing different-sized integral types in check_err()
|
2022-08-19 04:39:20 -04:00 |
|
Po-Yen, Chen
|
726c115393
|
Add type constraints for integer version check_err<>()
|
2022-08-19 03:48:20 -04:00 |
|
Po-Yen, Chen
|
f2c148efae
|
Add type traits 'is_signed_integral<>'
|
2022-08-19 03:47:22 -04:00 |
|
Po-Yen, Chen
|
463d15f9b5
|
Add constraint to Tensor<> templated methods
|
2022-08-19 03:27:41 -04:00 |
|
Po-Yen, Chen
|
f3f61f836b
|
Complete the int4 examples
|
2022-08-19 02:19:50 -04:00 |
|
Po-Yen, Chen
|
2dc3357a20
|
Fix typo in alias names
|
2022-08-19 01:41:20 -04:00 |
|
Po-Yen, Chen
|
79480f0aee
|
Re-use element-wise operation type alias
|
2022-08-19 01:39:46 -04:00 |
|
Po-Yen, Chen
|
dd849a8736
|
Re-use CopyAsType<>() to implement copy ctor
|
2022-08-19 01:02:36 -04:00 |
|
Po-Yen, Chen
|
e03cece9c4
|
Use different type for host tensors
|
2022-08-19 00:32:57 -04:00 |
|
Po-Yen, Chen
|
89a827cab9
|
Re-format source files
|
2022-08-19 00:32:24 -04:00 |
|
Po-Yen, Chen
|
cbbe2485b2
|
Allow conversion between Tensor<> specializations
|
2022-08-19 00:30:53 -04:00 |
|
Po-Yen, Chen
|
30ed3e218c
|
Add int4_t support for check_err()
|
2022-08-19 00:30:28 -04:00 |
|
Po-Yen, Chen
|
194faf7837
|
Distinguish user-side type from kernel-side type
|
2022-08-18 23:43:19 -04:00 |
|
Po-Yen, Chen
|
70c87970ec
|
Re-use pre-defined alias in int4 exmples
|
2022-08-18 23:29:38 -04:00 |
|
Po-Yen, Chen
|
4b153bd974
|
Add GEMM examples for int4
Currently the source files are just copied from int8 examples
|
2022-08-18 23:03:36 -04:00 |
|
Illia Silin
|
9efd033bee
|
restart the stages on MI200 in case of failures (#366)
* restart the stages on MI200
* fix the docker image storage issue
|
2022-08-18 14:54:47 -05:00 |
|