Muhammed Emin Ozturk
|
9e95d54cd2
|
BF16 GEMM Stream-K (#1541)
* initial
* Cmake file
* successfull compilation but validation failed
* Cmake
* update
* gpu validation
* gemm universal
* gemm universal sk update
* sk bf16 universal instance
* gemm_universal_streamk.hpp
* only build for gfx94
* Cmakelist
* profiler update, bf16 sk only works at gfx42
* clang
* clang
* clang all
* no need flags
* cmake script
* delete comment
* gemm universal sk fix
* clang
* profiler fix
* clang
* update
* update
* delete comment
* code formatting
* cmake
* fix instance
* clang
* argument supported
* argument supported and clang
* update
* fix
* removing unnecessary comments
* clang formatting
* Update library/src/tensor_operation_instance/gpu/CMakeLists.txt
Co-authored-by: afagaj <john.afaganis@gmail.com>
* CopyRight Comment 2025
* clang reformatting
* copy right 2025
---------
Co-authored-by: Emin Ozturk <ozturk.27@osu.edu>
Co-authored-by: root <root@ctr-ubbsmc16.amd.com>
Co-authored-by: Muhammed Emin Ozturk <meozturk@t004-008.hpcfund>
Co-authored-by: root <root@splinter-126-wr-d3.amd.com>
Co-authored-by: Muhammed Emin Ozturk <meozturk@t006-001.hpcfund>
Co-authored-by: Muhammed Emin Ozturk <meozturk@login1.hpcfund>
Co-authored-by: Muhammed Emin Ozturk <meozturk@t004-004.hpcfund>
Co-authored-by: Emin Ozturk <emin.ozturk@utah.edu>
Co-authored-by: Muhammed Emin Ozturk <meozturk@t008-001.hpcfund>
Co-authored-by: afagaj <john.afaganis@gmail.com>
|
2025-01-02 10:30:04 -08:00 |
|