Po Yen Chen
641ae96215
Check fp8 rounding error in check_err()
2024-04-08 12:39:27 +00:00
Po Yen Chen
92d45d1681
Fix wrong fp8 QK/KV block gemm setting
2024-04-08 12:39:17 +00:00
rocking
4e005f2457
Avoid warning
2024-04-08 10:11:51 +00:00
rocking
29a0670744
Remove remove_cvref_t
2024-04-08 10:03:48 +00:00
rocking
5c3fdeb0b8
Remove f8 pipeline, we should share the same pipeline even in f8
2024-04-08 09:56:23 +00:00
rocking
f7d81364f3
To prevent compiler issue, remove the elementwise function we have not used.
2024-04-08 09:44:21 +00:00
rocking
68153dea0b
Let generate.py can generate different elementwise function
2024-04-04 03:59:38 +00:00
rocking
d6cb104d0f
Add some elementwise op, prepare to quantization
2024-04-04 03:18:39 +00:00
rocking
d9323ea261
Fix bug of elementwise op, our elementwise op is not inout
2024-04-04 03:17:36 +00:00
rocking
bfcf550305
Adjust P elementwise function
2024-04-03 11:07:21 +00:00
rocking
cf57626c07
Merge branch 'ck_tile/refactor' into ck_tile/elementwise
2024-04-01 16:07:27 +08:00
carlushuang
42866940dc
remove mistake
2024-03-31 00:01:30 +00:00
carlushuang
855a264b72
remove ck_tile example from default cmake target like all/install/check
2024-03-30 23:58:48 +00:00
rocking
286c74468d
Add element function to fmha api
2024-03-29 18:05:36 -04:00
carlushuang
076da565dd
let python version to be 3.8 as minimal
2024-03-29 17:07:23 +00:00
rocking
50c36f352a
Add SAccElementFunction, PComputeElementFunction, OAccElementFunction in pipeline
2024-03-29 07:09:06 -04:00
carlushuang
f236a13d1b
fix several issue
2024-03-28 22:00:11 +00:00
carlushuang
06c54880d1
Merge remote-tracking branch 'origin/develop' into ck_tile/refactor
2024-03-28 21:59:40 +00:00
dependabot[bot]
5f2c89e8b4
Bump rocm-docs-core from 0.37.1 to 0.38.0 in /docs/sphinx ( #1218 )
...
Bumps [rocm-docs-core](https://github.com/RadeonOpenCompute/rocm-docs-core ) from 0.37.1 to 0.38.0.
- [Release notes](https://github.com/RadeonOpenCompute/rocm-docs-core/releases )
- [Changelog](https://github.com/ROCm/rocm-docs-core/blob/develop/CHANGELOG.md )
- [Commits](https://github.com/RadeonOpenCompute/rocm-docs-core/compare/v0.37.1...v0.38.0 )
---
updated-dependencies:
- dependency-name: rocm-docs-core
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2024-03-27 10:23:54 -07:00
carlushuang
b0b8a5ad46
update README of ck_tile example
2024-03-26 18:57:29 +00:00
carlushuang
13311f2e5a
fix clang-format
2024-03-26 18:53:10 +00:00
carlushuang
b9ed9c8e4d
Merge remote-tracking branch 'origin/ck_tile_merge_public_develop' into ck_tile/refactor
2024-03-26 16:34:46 +00:00
carlushuang
ca941d66ef
remove unused
2024-03-26 16:33:48 +00:00
carlushuang
97902de98c
sync 22
2024-03-26 16:30:50 +00:00
carlushuang
f955af6ff7
sync upstream again
2024-03-26 16:25:32 +00:00
carlushuang
1c92c5d83d
sync with upstream
2024-03-26 16:05:54 +00:00
carlushuang
04ee01191a
fix merge from upstream
2024-03-26 14:09:54 +00:00
carlushuang
c94b545747
update some readme
2024-03-26 13:35:53 +00:00
carlushuang
200d2b22d4
fix scratch in fp8 kernel
2024-03-25 19:45:38 +00:00
Po-Yen, Chen
1cacb713c5
Default use CK_TILE_FLOAT_TO_FP8_STOCHASTIC rounding mode
2024-03-23 22:51:18 -04:00
Illia Silin
cc1f733d0e
allow the CI to pass even if can't connect to db ( #1214 )
2024-03-22 15:39:11 -07:00
dependabot[bot]
2ae16e901f
Bump rocm-docs-core from 0.37.0 to 0.37.1 in /docs/sphinx ( #1211 )
...
Bumps [rocm-docs-core](https://github.com/RadeonOpenCompute/rocm-docs-core ) from 0.37.0 to 0.37.1.
- [Release notes](https://github.com/RadeonOpenCompute/rocm-docs-core/releases )
- [Changelog](https://github.com/ROCm/rocm-docs-core/blob/develop/CHANGELOG.md )
- [Commits](https://github.com/RadeonOpenCompute/rocm-docs-core/compare/v0.37.0...v0.37.1 )
---
updated-dependencies:
- dependency-name: rocm-docs-core
dependency-type: direct:production
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2024-03-22 07:58:36 -07:00
Bartłomiej Kocot
9c052804a7
Add elementwise with dynamic vector dim ( #1198 )
...
* Add elementwise with dynamic vector dim
* Reduce number of instaces
* Fixes
* Fixes
2024-03-22 10:40:43 +01:00
Rostyslav Geyyer
fd0d093e78
Add instances for conv_scale with bf8 in / fp8 out ( #1200 )
...
* Add bf8 conv fwd instances
* Add example
* Add profiler mode
* Add client example
* Fix copyright headers
* Format
2024-03-21 13:57:34 -05:00
dependabot[bot]
9e50426915
Bump rocm-docs-core from 0.36.0 to 0.37.0 in /docs/sphinx ( #1208 )
...
Bumps [rocm-docs-core](https://github.com/RadeonOpenCompute/rocm-docs-core ) from 0.36.0 to 0.37.0.
- [Release notes](https://github.com/RadeonOpenCompute/rocm-docs-core/releases )
- [Changelog](https://github.com/ROCm/rocm-docs-core/blob/develop/CHANGELOG.md )
- [Commits](https://github.com/RadeonOpenCompute/rocm-docs-core/compare/v0.36.0...v0.37.0 )
---
updated-dependencies:
- dependency-name: rocm-docs-core
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2024-03-20 09:28:03 -06:00
carlushuang
bb1f6e48eb
fix fp8 duplicated move/shift/and/or problem
2024-03-19 23:29:57 +00:00
Illia Silin
f52109531b
Fix a couple of docker issues. ( #1206 )
...
* do not install sccache by default, only install rocm-llvm-dev for rocm6.1
* add sccache flag to docker build options
2024-03-19 08:38:52 -07:00
Illia Silin
9e011bcd6e
update the changelog for ROCm6.1 release ( #1205 )
...
* update the changelog for ROCm6.1 release
* modifty the order of items in changelog, capitalize GEMMs
2024-03-18 10:16:45 -07:00
Illia Silin
bdcd037428
Re-enable the performance tracking in CI. ( #1203 )
...
* test CK with rocm6.1 RC2
* add docker credentials for pull
* update the performance db name
* use environment variable for db name
* add rocm-llvm-dev package to ck docker
* turn off verification for daily performance runs
* do not stash ckProfiler on MI300 node
* add processing of mixed gemms to qa, fix parsing of splitk gemm logs
* fix the splitk gemm log file name
* turn the timing on for splitk gemm performance
2024-03-18 09:48:29 -07:00
carlushuang
886d040a81
fix compile error, fp8 not ready now
2024-03-18 07:58:00 +00:00
carlushuang
f55c7629bc
not using custom data type by default, now we can have ISA-level same code as opt_padding
2024-03-17 23:23:32 +00:00
carlushuang
ee397d0ab2
temp fix buffer_store spill
2024-03-15 22:56:41 +00:00
Rostyslav Geyyer
e626d5202a
Add instances for conv_scale with fp8 in/out ( #1193 )
...
* Add fp8 conv instances and client example
* Format
* Add example
* Update cmakelists
* Add profiler mode
* Format
* Fix copyright headers
2024-03-15 09:50:03 -07:00
Bartłomiej Kocot
285251768e
Add conv fwd/bwd data scale instances, extend bilinear instances ( #1178 )
...
* Add conv fwd/bwd data scale instances
* Fix cmake client example file
---------
Co-authored-by: Adam Osewski <19374865+aosewski@users.noreply.github.com >
2024-03-13 23:09:08 +01:00
carlushuang
04762d212b
make sure thread_buffer can be tuple/array
2024-03-13 22:03:42 +00:00
carlushuang
616932068d
let more integral_constant->constant, and formating
2024-03-13 18:33:10 +00:00
Po-Yen, Chen
b1dbf64c91
Some minor changes
2024-03-13 03:55:07 -04:00
Po-Yen, Chen
8d1631adc9
Re-use function
2024-03-13 03:38:12 -04:00
Po-Yen, Chen
60221b89f8
Add constraint to array<> ctor
2024-03-13 03:32:05 -04:00
Po-Yen, Chen
5c433432fd
Fix format
2024-03-13 03:21:30 -04:00