Commit Graph

  • b0aab85baa [CK][Examples] Fix for example_grouped_gemm_multiple_d_dl_fp16 - corrected stride for B matrix. (#3104) Michał Kulikowski 2025-10-28 17:47:25 +01:00
  • 73d5d832ad Merge branch 'develop' into philipm/documentation-cleanup-7 philipm/documentation-cleanup-7 Illia Silin 2025-10-28 09:29:39 -07:00
  • 3b6a825599 Merge commit '331273b4747c9beebed5653e38f32ebede9f539b' into develop assistant-librarian[bot] 2025-10-28 15:15:19 +00:00
  • 97c2fb582a Fix multiple test failures with staging compiler. (#3103) Illia Silin 2025-10-28 08:07:19 -07:00
  • 02a10e6946 Fix multiple test failures with staging compiler. (#3103) Illia Silin 2025-10-28 08:07:19 -07:00
  • 331273b474 Fix multiple test failures with staging compiler. (#3103) Illia Silin 2025-10-28 08:07:19 -07:00
  • 8eb813de42 [CK_TILE] Fixed multi-abd GEMM test, NaN problem (#2979) Mateusz Ozga 2025-10-28 15:53:36 +01:00
  • 8d51d0ef4d [CK_TILE] Fixed multi-abd GEMM test, NaN problem (#2979) Mateusz Ozga 2025-10-28 15:53:36 +01:00
  • da4247a6df [CK_TILE] Fixed multi-abd GEMM test, NaN problem (#2979) Mateusz Ozga 2025-10-28 15:53:36 +01:00
  • 594c4e9fa7 set nt load tune_norm felix 2025-10-28 14:37:24 +00:00
  • 402bf6664d [CK_TILE] Add Bquant to Grouped Gemm (#3063) Aviral Goel 2025-10-28 10:20:24 -04:00
  • dfbc489a6b [CK_TILE] Add Bquant to Grouped Gemm (#3063) Aviral Goel 2025-10-28 10:20:24 -04:00
  • 4368fd9f57 [CK_TILE] Add Bquant to Grouped Gemm (#3063) Aviral Goel 2025-10-28 10:20:24 -04:00
  • d5c8315aff fixed window creation number<>{} Tianxing Wu 2025-10-28 11:32:48 +00:00
  • 190696e63c fixed tests for grouped_convolution_forward_clamp.cpp kabraham/factory-tests Kevin Abraham 2025-10-28 07:28:40 +00:00
  • 5ed7d04d90 Merge commit '1c17bae816edc44c32ee9d1a19d79d768fd1be13' into develop assistant-librarian[bot] 2025-10-28 06:16:21 +00:00
  • f0b6fdcadb Add name member to CK elementwise operations. (#3102) Ville Pietilä 2025-10-28 07:19:29 +02:00
  • 96c8bba2e4 Add name member to CK elementwise operations. (#3102) Ville Pietilä 2025-10-28 07:19:29 +02:00
  • 1c17bae816 Add name member to CK elementwise operations. (#3102) Ville Pietilä 2025-10-28 07:19:29 +02:00
  • 03c97c9524 [CK_BUILDER] Test and fix instance traits utils. (#3096) John Shumway 2025-10-27 22:14:08 -07:00
  • c237ad2950 [CK_BUILDER] Test and fix instance traits utils. (#3096) John Shumway 2025-10-27 22:14:08 -07:00
  • 54746e9329 [CK_BUILDER] Test and fix instance traits utils. (#3096) John Shumway 2025-10-27 22:14:08 -07:00
  • 56845d02b8 Merge commit 'e02b1e7cafedd3847672329ac310e0379268ffd6' into develop assistant-librarian[bot] 2025-10-28 04:13:39 +00:00
  • 689ae06dbc Fix AITER tests. (#3106) Illia Silin 2025-10-27 20:59:21 -07:00
  • b97849e066 Fix AITER tests. (#3106) Illia Silin 2025-10-27 20:59:21 -07:00
  • e02b1e7caf Fix AITER tests. (#3106) Illia Silin 2025-10-27 20:59:21 -07:00
  • 5978a1b36f Merge commit '715395bc8636d3bca350a6cbe0fba804bd6bae48' into develop assistant-librarian[bot] 2025-10-28 02:41:29 +00:00
  • 71fef01df6 [CK_TILE] Stream-K Gemm Example for fp8 and bf8 (#3041) arai713 2025-10-27 19:29:03 -07:00
  • df355e12a8 [CK_TILE] Stream-K Gemm Example for fp8 and bf8 (#3041) arai713 2025-10-27 19:29:03 -07:00
  • 715395bc86 [CK_TILE] Stream-K Gemm Example for fp8 and bf8 (#3041) arai713 2025-10-27 19:29:03 -07:00
  • c1c7bc9368 Ck tile engine gemm (#2982) Thrupti Raj Lakshmana Gowda 2025-10-27 21:11:13 -05:00
  • f32ef6ed17 Ck tile engine gemm (#2982) Thrupti Raj Lakshmana Gowda 2025-10-27 21:11:13 -05:00
  • 7fc0a38e90 Ck tile engine gemm (#2982) Thrupti Raj Lakshmana Gowda 2025-10-27 21:11:13 -05:00
  • 0c6a40ab1c try making tail handling logic non-constexpr tenpercent/compv3_build_time_reduce_experiment Max Podkorytov 2025-10-27 16:37:26 -05:00
  • dfe762a225 Merge commit 'b11f53a484e45a796ddce247294287b4f524c64f' into develop assistant-librarian[bot] 2025-10-27 21:11:25 +00:00
  • 35cb7500e4 Fix quant scale matrix layout for block scale gemm (#3079) Khushbu Agarwal 2025-10-27 13:56:07 -07:00
  • e10a11323a Fix quant scale matrix layout for block scale gemm (#3079) Khushbu Agarwal 2025-10-27 13:56:07 -07:00
  • b11f53a484 Fix quant scale matrix layout for block scale gemm (#3079) Khushbu Agarwal 2025-10-27 13:56:07 -07:00
  • 225a028cce added test_grouped_convolution_forward_clamp Kevin Abraham 2025-10-27 20:38:02 +00:00
  • 5e14625be7 resolved merge issues with test_ck_factory_grouped_convolution_forward_convscale Kevin Abraham 2025-10-27 20:36:26 +00:00
  • 44a0e1afdb Merge commit 'a46b725992bdefad16d1c30dcfe4bb8441462907' into develop assistant-librarian[bot] 2025-10-27 19:11:23 +00:00
  • 5fa81082e4 Added Support for tile_grouped_gemm_preshuffle example (#2993) mkumar16-amd 2025-10-28 00:01:19 +05:30
  • 5dc38c98bf Added Support for tile_grouped_gemm_preshuffle example (#2993) mkumar16-amd 2025-10-28 00:01:19 +05:30
  • a46b725992 Added Support for tile_grouped_gemm_preshuffle example (#2993) mkumar16-amd 2025-10-28 00:01:19 +05:30
  • d3e72e87c4 Merge commit '6c2ca1211ae29802281049843d284ba1bd6511f8' into develop assistant-librarian[bot] 2025-10-27 18:15:18 +00:00
  • e1e96b89fa [CK_BUILDER] First fwd convolution builder implementation (#3070) Ville Pietilä 2025-10-27 20:09:24 +02:00
  • d859b04023 [CK_BUILDER] First fwd convolution builder implementation (#3070) Ville Pietilä 2025-10-27 20:09:24 +02:00
  • 6c2ca1211a [CK_BUILDER] First fwd convolution builder implementation (#3070) Ville Pietilä 2025-10-27 20:09:24 +02:00
  • 0690ed26ba [CK_TILE] Add conv fwd + bias + clamp example (#3012) Johannes Graner 2025-10-27 18:43:09 +01:00
  • 3b8e9864c6 [CK_TILE] Add conv fwd + bias + clamp example (#3012) Johannes Graner 2025-10-27 18:43:09 +01:00
  • 5c1974065e [CK_TILE] Add conv fwd + bias + clamp example (#3012) Johannes Graner 2025-10-27 18:43:09 +01:00
  • 9cdbee7709 Merge commit '054fdb765cd74c0f7bbb6561ea58713df82ed85f' into develop assistant-librarian[bot] 2025-10-27 17:11:55 +00:00
  • 3b45c7fd2d added grouped_conv_bilinear to tests Kevin Abraham 2025-10-27 16:36:16 +00:00
  • ef1c170abd Revert "added grouped_conv_bilinear to tests" Kevin Abraham 2025-10-27 15:20:37 +00:00
  • 37323495cb Implemented tests for dynamic op Kevin Abraham 2025-10-27 15:20:21 +00:00
  • d08b6ab52b added grouped_conv_bilinear to tests Kevin Abraham 2025-10-27 13:14:52 +00:00
  • 02b250f298 implemented tests for instances from grouped_convolution_forward_convscale.hpp:210 Kevin Abraham 2025-10-24 20:11:40 +00:00
  • bd40b58266 implemented tests for instances from grouped_convolution_forward_convscale.hpp:100 Kevin Abraham 2025-10-24 19:54:05 +00:00
  • 7a49fb2874 ck-builder: add missing type tf32 to type_name Robin Voetter 2025-10-23 16:03:51 +02:00
  • bf022a6d15 ck-builder: add InstanceSet and InstanceMatcher Robin Voetter 2025-10-22 18:43:59 +02:00
  • d06d23ab11 [CK_TILE] Stream-K operator() Reboot (#3064) arai713 2025-10-27 09:14:17 -07:00
  • cbf24c87c6 [CK_TILE] Stream-K operator() Reboot (#3064) arai713 2025-10-27 09:14:17 -07:00
  • 054fdb765c [CK_TILE] Stream-K operator() Reboot (#3064) arai713 2025-10-27 09:14:17 -07:00
  • fda9de5430 Merge commit '0b684230158892bcd1eb24f8ba1524c2d0c02170' into develop assistant-librarian[bot] 2025-10-27 16:13:55 +00:00
  • 3dd0779bf7 Add .cline* files to .gitignore (#3101) John Shumway 2025-10-27 08:29:15 -07:00
  • facd83876e Add .cline* files to .gitignore (#3101) John Shumway 2025-10-27 08:29:15 -07:00
  • 0b68423015 Add .cline* files to .gitignore (#3101) John Shumway 2025-10-27 08:29:15 -07:00
  • 1f13003f1e fix format Sami Remes 2025-10-27 15:17:58 +00:00
  • 470d6e4df4 Merge remote-tracking branch 'origin/develop' into samremes/bmatrix_2d_blockscale Sami Remes 2025-10-27 15:17:14 +00:00
  • 48838830f9 Clean up batched contraction: remove old UniversalGemmKernel path Mohsen Saffari 2025-10-27 15:14:47 +00:00
  • d1c71e6283 Merge commit '06973b1cf4987b5f2e7fc1fe504b56df58edaf1f' into develop assistant-librarian[bot] 2025-10-27 15:13:31 +00:00
  • 02653cd69d Fix multi-abd tests bug (#3099) Enrico Degregori 2025-10-27 16:09:02 +01:00
  • b0c0571809 Fix multi-abd tests bug (#3099) Enrico Degregori 2025-10-27 16:09:02 +01:00
  • 06973b1cf4 Fix multi-abd tests bug (#3099) Enrico Degregori 2025-10-27 16:09:02 +01:00
  • d16618825d ck-builder: ck factory grouped conv fwd bias bnorm clamp Robin Voetter 2025-10-27 15:32:11 +01:00
  • 2d86cd0081 fix formatting Sami Remes 2025-10-27 14:28:06 +00:00
  • 742af334f4 Jenkins Alerts Notifications (#3086) andrew clark 2025-10-27 08:24:36 -06:00
  • 66310cc5bf Jenkins Alerts Notifications (#3086) andrew clark 2025-10-27 08:24:36 -06:00
  • a1ce64374f Jenkins Alerts Notifications (#3086) andrew clark 2025-10-27 08:24:36 -06:00
  • 2cb1d61ec6 ck-builder: ck factory grouped conv fwd scaleadd scaleadd relu Robin Voetter 2025-10-27 15:21:33 +01:00
  • 2a309d7534 ck-builder: ck factory grouped conv fwd bias clamp Robin Voetter 2025-10-27 15:09:46 +01:00
  • 78ba0358bd Ck tile engine preshuffle (#2919) Thrupti Raj Lakshmana Gowda 2025-10-27 09:15:34 -05:00
  • 20ef4380d7 Ck tile engine preshuffle (#2919) Thrupti Raj Lakshmana Gowda 2025-10-27 09:15:34 -05:00
  • 8b185e872e Ck tile engine preshuffle (#2919) Thrupti Raj Lakshmana Gowda 2025-10-27 09:15:34 -05:00
  • 98deefac3e Enable NWarps replication for bquant tile dstr Sami Remes 2025-10-27 14:09:07 +00:00
  • f709bedcd6 fix interpreter path on remove_exec_bit script Robin Voetter 2025-10-27 14:46:03 +01:00
  • 37738e4cb8 Add more specialized tile distributions Sami Remes 2025-10-27 13:43:02 +00:00
  • 10869a06b7 ck-builder: ck factory grouped conv fwd convinvscale Robin Voetter 2025-10-27 13:30:51 +01:00
  • b24e1bf32b Refactor instance_traits_util and add unit tests tests John Shumway 2025-10-25 16:58:05 -04:00
  • ea7f5faa3e ck-builder: ck factory grouped conv fwd scale Robin Voetter 2025-10-24 18:13:20 +02:00
  • 5679bcfe49 ck-builder: ck factory grouped conv fwd scaleadd ab Robin Voetter 2025-10-24 18:00:43 +02:00
  • d15334ed0d ck-builder: ck factory grouped conv fwd Robin Voetter 2025-10-24 14:21:54 +02:00
  • 16db75fadf ck-builder: ck factory convscale relu/add Robin Voetter 2025-10-24 12:12:48 +02:00
  • c07b436666 ck-builder: add InstanceSet and InstanceMatcher Robin Voetter 2025-10-22 18:43:59 +02:00
  • 5d1e915a8f Update cmake/SetupDocs.cmake pmaybank 2025-10-27 11:11:13 +00:00
  • a0847290d8 make a start on RDNA / Navi specific doc Philip Maybank 2025-10-20 12:25:54 +01:00
  • c121d5a4c4 Merge branch 'develop' into philipm/documentation-cleanup-5 pmaybank 2025-10-27 10:51:53 +00:00
  • a464269bb6 Fix in the comments Qianfeng Zhang 2025-10-27 10:36:15 +00:00
  • 4eeb5cc917 Update to gemm_0's CBlockDistribution encoding so that it is compatible with gemm_1's ABlockDistribution encoding Qianfeng Zhang 2025-10-27 10:34:45 +00:00
  • 4fdde500eb change tilem moe_block_m_32 felix 2025-10-26 08:24:24 +00:00