Commit Graph

  • 9212e3b464 AOCL-5.3 GA Release master AOCL-5.3 5.3 Rayan 2026-05-20 09:46:21 +05:30
  • 4ee6f75292 GCC 16 fixes AOCL-20260502 dev Smyth, Edward 2026-05-18 15:54:29 +01:00
  • c845e41b38 Pack B matrix for zgemm conjugate input in SUP path (#362) Dave, Harsh 2026-04-21 11:12:52 +05:30
  • 07fd52823a Fix: Resovle label redefination errors in zgemm sup kernel (#359) AOCL-20260402 Dave, Harsh 2026-04-16 16:51:55 +05:30
  • cdd181a7d7 Optimize complex scalv kernels with inline assembly and FMA instructions S, Hari Govind 2026-04-13 16:06:02 +05:30
  • 5c1555b95e Updated the version string from 5.2.2 to 5.3.0 Chandrashekara K R 2026-04-02 16:45:04 +05:30
  • 88745841e0 Adding the cgemmt and sgemm memory fix patches Rayan, Rohan 2026-03-30 11:43:04 +05:30
  • c8d0259ed4 Fixing memory issue in the cgemm pack kernels on zen4 Rayan, Rohan 2026-03-30 11:08:34 +05:30
  • 95da096e4f BugFix: BF16 AVX2 fallback GEMV m=1 path for reordered B inputs (#352) V, Varsha 2026-03-27 11:31:59 +05:30
  • 0b68dfc5ed The zgemm tiny and sup paths currently only support Dave, Harsh 2026-03-25 14:45:54 +05:30
  • d995496ac1 BugFix: BF16 AVX2 fallback GEMV m=1 path for reordered B inputs V, Varsha 2026-03-25 12:34:51 +05:30
  • ad6a372abf Updated LICENSE and NOTICES files for 5.2.2 release 5.2.2 Kiran Varaganti 2026-03-20 06:23:20 +00:00
  • d512e3a736 Converting mul+add to FMA for ddot, daxpy and daxpyf zen kernels Rayan, Rohan 2026-03-20 11:07:12 +05:30
  • 6459b66b48 GTestSuite: Fix for ZGEMM tiny tests on Zen3 and earlier Smyth, Edward 2026-03-18 18:11:56 +00:00
  • f632492b91 GTestSuite: Refine tests for GEMM Smyth, Edward 2026-03-17 10:58:27 +00:00
  • 95e34d46db configure: follow reproducible-builds spec for SOURCE_DATE_EPOCH Rayan, Rohan 2026-03-17 09:49:15 +05:30
  • d93c8a7b58 GTestSuite: cleanup of disabled tests (#323) Vlachopoulou, Eleni 2026-03-16 12:04:39 +00:00
  • af4c3ef1a5 Fixing memory issues in sgemm SUP kernels on AVX2 and AVX512 Rayan, Rohan 2026-03-16 16:05:34 +05:30
  • 4a9af35bf4 Bitexactness CRC verification and per-test JSON output (#320) Vettickal Sen, Anuraj 2026-03-13 15:12:17 +05:30
  • a7da8ee174 AOCL 5.2.2 GA Release Kiran Varaganti 2026-03-12 16:08:56 +00:00
  • 48a6db6c69 add support for conjugate transpose in avx512 zgemm sup kernel (#300) Dave, Harsh 2026-03-12 19:00:53 +05:30
  • fbd45e8eab optimize edge and non-unit stride path with intrinsics ins/c axpyv kernels (#334) Dave, Harsh 2026-03-12 17:20:44 +05:30
  • c8a5b21b46 configure: follow reproducible-builds spec for SOURCE_DATE_EPOCH Request, Osi 2026-03-10 05:58:46 -07:00
  • 23b48bb999 Enable support for OpenMP 2.5 and earlier Smyth, Edward 2026-03-06 09:34:17 +00:00
  • bb6545a46b Added new thread control API with global and thread-local variants Varaganti, Kiran 2026-03-06 12:16:17 +05:30
  • cf2de1e7e6 Fix for undefined arch and model id symbols Smyth, Edward 2026-03-05 14:01:00 +00:00
  • 05e837d176 BLIS: Implement zen6 sub-configuration Smyth, Edward 2026-03-05 13:33:56 +00:00
  • e62d246789 Misc AMD CPUID improvements (#222) Smyth, Edward 2026-03-05 11:59:56 +00:00
  • 713b09b407 Remove unnecessary barrier in sup path decorator to fix ~10% DGEMM regression Varaganti, Kiran 2026-03-05 11:44:57 +05:30
  • 4e84bbfb68 Ensure Accumulation Consistency Across DOTXF and DOTXV kernels (#325) Sharma, Shubham 2026-03-03 16:38:55 +05:30
  • 3bbf12665c Ensure consistency across AVX2 and AVX512 AMAX kernels Sharma, Shubham 2026-03-03 13:01:52 +05:30
  • 6f718982d6 SGEMM RV kernel optimization on Zen4 Rayan, Rohan 2026-02-27 15:50:25 +05:30
  • 237393ec71 Coverity fixes in LPGEMM group post-ops translator Balasubramanian, Vignesh 2026-02-25 17:21:12 +05:30
  • 199f2347ba Fix AOCC version detection in CMake and config script (#321) Vlachopoulou, Eleni 2026-02-20 13:58:13 +00:00
  • c315766c8d Fixing undefined behavior in bli_arch_log Rayan, Rohan 2026-02-18 13:36:18 +05:30
  • 011c75dddb Remove unnecessary OpenMP include (AOCL) AOCL-Feb2026-b2 Smyth, Edward 2026-02-06 10:41:38 +00:00
  • 8310b2d5d3 Optimize bli_arch_query_id and related functions Smyth, Edward 2026-02-04 13:16:46 +00:00
  • ebf8721a5c Optimizing sgemm rd kernels on zen3 (#293) Rayan, Rohan 2026-02-04 09:08:11 +05:30
  • 50ae5a05ef Updated version string from 5.1.1 to 5.2.2 Chandrashekara K R 2026-02-02 16:47:14 +05:30
  • ec6f4e96cd Replace intrinsics with inline assembly for bli_saxpyv_zen4_int and bli_saxpyf_zen_int_5 S, Hari Govind 2026-01-29 11:48:47 +05:30
  • 3b8aca0874 GTestSuite: Misc fixes (3) Smyth, Edward 2026-01-23 17:23:52 +00:00
  • dd66dfff50 cblas_ctrmm invalid diag fix Smyth, Edward 2026-01-23 16:51:35 +00:00
  • b510d06cc8 Tuned input threshold for tiny dgemm interface (#309) Dave, Harsh 2026-01-22 20:11:06 +05:30
  • 73911d5990 Updates to the build systems(CMake and Make) for LPGEMM compilation (#303) AOCL-Jan2026-b2 Balasubramanian, Vignesh 2026-01-16 19:39:55 +05:30
  • 9f9bfbed7f GTestSuite: Banded APIs (gbmv, hbmv, sbmv, tbmv, tbsv) Smyth, Edward 2026-01-16 12:37:47 +00:00
  • c32247678c GTestSuite: Packed APIs (hpmv, spmv, tpmv, tpsv) Smyth, Edward 2026-01-16 12:08:36 +00:00
  • 72e0c001f2 GTestSuite: Packed APIs (hpr, hpr2, spr, spr2) Smyth, Edward 2026-01-16 10:25:27 +00:00
  • bd99d6cd92 GTestSuite: Misc fixes (2) Smyth, Edward 2026-01-16 00:33:16 +00:00
  • 824e289899 Tuned decision logic for DGEMV multithreading for skinny sizes. (#301) Sharma, Shubham 2026-01-14 12:08:46 +05:30
  • 9cbb1c45d8 Improving sgemm rd kernel on zen4/zen5 (#292) Rayan, Rohan 2025-12-17 18:48:50 +05:30
  • 504ac9d8a2 CMake: Adding targets and aliases so that blis works with fetch content (#179) AOCL-Weekly-121225 Vlachopoulou, Eleni 2025-12-10 13:02:09 +00:00
  • 1d80d5fee4 Fixing doc about building bench (#290) Vlachopoulou, Eleni 2025-12-10 12:07:50 +00:00
  • a22e0022c2 SGEMM tiny path tuning for zen4 and zen5 (#267) Rayan, Rohan 2025-12-10 15:58:54 +05:30
  • b06b55e864 Update LICENSE and NOTICES files for AOCL-5.2 release (#285) KR, Chandrashekara 2025-12-10 11:25:02 +05:30
  • bbb7edcb22 thread: free global communicator after parallel region completes in p… Varaganti, Kiran 2025-12-09 19:15:52 +05:30
  • 9734fc18cc AOCL-5.2 GA Release 5.2 Kiran Varaganti 2025-12-05 11:25:50 +00:00
  • eff1b561c5 GTestSuite: Misc fixes AOCL-Weekly-051225 Smyth, Edward 2025-12-05 17:23:47 +00:00
  • b04818bf48 GTestSuite: Conjugate dot and ger IIT_ERS tests Smyth, Edward 2025-12-05 16:01:49 +00:00
  • 8a84b2fb2c Global Communicator is now freed outside the parallel region Varaganti, Kiran 2025-12-05 15:52:08 +05:30
  • ad472369c1 Update LICENSE and NOTICES files for AOCL-5.2 release Chandrashekara K R 2025-12-05 11:47:20 +05:30
  • 17702aedef Add compiler information to "make showconfig" and "bench_getlibraryInfo" Varaganti, Kiran 2025-12-01 21:14:25 +05:30
  • 54ac36c8bc Bugfix: BF16 to F32 conversion in AVX2 F32 codepath Balasubramanian, Vignesh 2025-12-01 15:06:13 +05:30
  • 1b15f940aa Update ai_coverity.json file with project as AOCL_RELEASE and email recipients with blank (#277) Kumar, Harish 2025-11-28 21:19:15 +05:30
  • edf64e2b89 GTestSuite: Adding data pool (#272) Vlachopoulou, Eleni 2025-11-27 14:01:29 +00:00
  • f992942f6b Disabling GEMV(M1) rerouting in BF16 APIs(AVX512) Balasubramanian, Vignesh 2025-11-27 14:43:31 +05:30
  • 5c42229e05 GTestSuite: Moving data generator definitions in a cpp file (#270) Vlachopoulou, Eleni 2025-11-18 10:52:49 +00:00
  • 0923d8ff56 GTestSuite: break up tests Smyth, Edward 2025-11-17 09:12:03 +00:00
  • a8daea04ea GTestSuite: computediff improvements (#264) Vlachopoulou, Eleni 2025-11-17 08:30:20 +00:00
  • 50f3520c33 GTestSuite: Fix in swap (#266) Vlachopoulou, Eleni 2025-11-14 10:11:39 +00:00
  • 979b547876 Make all bench applications consistent (#189) Rayan, Rohan 2025-11-13 07:42:53 +05:30
  • 26588d5814 Added Fast path for single threaded AVX512 DGEMV kernel #260 (#262) Sharma, Shubham 2025-11-10 15:26:55 +05:30
  • b60542c45c Added Fast path for single threaded AVX512 DGEMV kernel #260 Sharma, Shubham 2025-11-10 10:32:36 +05:30
  • 4ecfbde082 Fix extreme values handling in GEMV S, Hari Govind 2025-11-08 12:30:03 +05:30
  • 42195faa52 DTL Windows getpid support Smyth, Edward 2025-11-07 19:16:46 +00:00
  • 15cca23574 Delete .github/self_enablement_config.yaml Prabhu, Anantha 2025-11-07 18:16:55 +05:30
  • 7fdc1229db Delete .github/recipients.yaml Prabhu, Anantha 2025-11-07 18:16:46 +05:30
  • 15eca83c74 Delete .github/psdb-jenkins-trigger.yml Prabhu, Anantha 2025-11-07 18:16:35 +05:30
  • 66694a36e7 Delete .github/workflows/ai-code-review-trigger.yml Prabhu, Anantha 2025-11-07 18:16:24 +05:30
  • 90f1e8d183 Update ai_coverity.json Prabhu, Anantha 2025-11-07 18:16:09 +05:30
  • 77e4d229be Update ai-pr-platform-app.yml Prabhu, Anantha 2025-11-07 18:06:15 +05:30
  • 25d172b835 AI Coverity Fix - Historical Analysis related changes Prabhu, Anantha 2025-11-03 18:58:44 +05:30
  • b602b61601 Update ai-pr-platform-app.yml Tyagi, Shubham 2025-11-03 15:33:52 +05:30
  • 98164e06a6 Update ai-pr-platform-app.yml Tyagi, Shubham 2025-11-03 15:08:44 +05:30
  • aebc02e2f3 Update ai-pr-platform-app.yml Tyagi, Shubham 2025-11-03 15:01:30 +05:30
  • 8c0196fef9 Add ai-pr-platform-app.yml Tyagi, Shubham 2025-11-03 14:45:31 +05:30
  • 85bd30ab3b DTL Windows getpid support Smyth, Edward 2025-11-07 17:03:25 +00:00
  • 5a73108953 Data Race and Barrier Fixes Varaganti, Kiran 2025-11-07 21:55:24 +05:30
  • 7b4e665273 Fix extreme values handling in GEMV S, Hari Govind 2025-11-07 19:55:26 +05:30
  • 9230c978a1 Fixed Data Race in Native code-path (#251) Varaganti, Kiran 2025-11-07 10:49:19 +05:30
  • 7ac261b173 Replaced omp barrier with bli_thread_barrier and added similar fix fo… (#248) Varaganti, Kiran 2025-10-31 10:01:40 +05:30
  • b729473839 Fix DTL dynamic thread logging in BLAS operations (#230) Varaganti, Kiran 2025-10-24 18:04:00 +05:30
  • 7341b12c46 Added dynamic threads and actual threads in the DTL log of SAXPY (#224) (#244) Smyth, Edward 2025-10-24 14:48:47 +01:00
  • 80619ae874 Updated version string from 5.1.1 to 5.2.0 Chandrashekara K R 2025-10-24 16:41:50 +05:30
  • 49961aa569 Fix DTL dynamic thread logging in BLAS operations (#230) AOCL-Weekly-241025 Varaganti, Kiran 2025-10-24 18:04:00 +05:30
  • 7ae22bc636 Add OpenMP barrier before releasing threadinfo & global communicator to avoid race (#225) (#242) Dave, Harsh 2025-10-24 17:30:43 +05:30
  • 4178e3c5ff Bug Fix in BF16 AVX2 conversion path (#236) (#241) V, Varsha 2025-10-24 16:39:41 +05:30
  • 90d252d59a Add OpenMP barrier before releasing threadinfo & global communicator to avoid race (#225) Dave, Harsh 2025-10-24 16:22:45 +05:30
  • ab25b825aa Fix: Resolve Operator Precedence Warning in Zen5 DCOMPLEX Threshold Logic S, Hari Govind 2025-10-24 14:23:23 +05:30
  • 64d8d06aad Fix: Resolve Operator Precedence Warning in Zen5 DCOMPLEX Threshold Logic S, Hari Govind 2025-10-24 14:23:03 +05:30
  • e85be22da0 Adding tiny path for SGEMM (#237) Rayan, Rohan 2025-10-24 13:14:33 +05:30