composable_kernel

mirror of https://github.com/ROCm/composable_kernel.git synced 2026-04-19 14:29:05 +00:00

Author	SHA1	Message	Date
Illia Silin	d072790fe2	Fix CI error. (#530 ) * ignore .git folder when doing clang-format * fix syntax * add backslashes before quotes * add path filter for several extensions	2022-12-06 15:09:51 -06:00
Illia Silin	39abb4704a	Fix build issue and schedule daily tests with latest staging compiler version. (#470 ) * run branch once a day, with release and staging compilers * add GetDockerImage in Clang stage * apply the new triggers to the develop branch	2022-10-11 12:06:36 -05:00
Illia Silin	7fc3ed761a	Allow setting ROCM version, activate cchache, etc. (#462 ) * enable ccache and decouple it from MIOpen ccache use * fix the ccache check script * use another method to get server name * fix syntax * add quotes around the server name variable * use check_host as function * change syntax * fix syntax * test if server name is parsed correctly * try different syntax * check the env var value * test new check node function * add ROCMVERSION parameter and fix script syntax * fix script syntax * add missing instances of rocm version * install ccache in the docker image * do not check GPU in clang format stage, clean up old code * update defaults and clean up	2022-10-01 18:48:19 -05:00
Illia Silin	b882554758	Fix build issues, set new compiler default, etc. (#451 ) * add an option to select specific compiler commit * change the logic of forcing building a docker * add check for compiler commit in dockerfile * compiler check syntax fix * change compiler selection logic * fix the new compiler build issue * set new compiler as default, update dev-requirements * fix jenkins syntax * fix docker syntax * get rid of hipcc.pl editing in jenkinsfile * fix the hipcc.pl in both places * try to fix the 10738 compiler linking bug * fix syntax * use dockerhub to store images * use newer amd-stg-open commit as default	2022-09-27 15:26:56 -05:00
Illia Silin	aa0b05156f	Replace the obsolete offload-arch flags with GPU_TARGETS and fix a bug. (#437 ) * replace obsolete offload-arch flags with GPU_TARGETS * fix a build error for client app * replace commma with semicolon in GPU_TARGETS	2022-09-22 09:32:25 -05:00
Illia Silin	85b0920dc8	Build the CK targets only once. (#433 ) * build CK only once, use deb package in all subsequent stages * update jenkins file * change prefix for build_CK stage * update writing deb metadata to control file * update ubuntu source for docker, script syntax for deb package metadata * try different way to create deb metadata * clean up DEBIAN before creating one * fix the CI folder names, fix splitK qa * use correct docker in all stages, separate tests for splitK verification and performance * clean old comments, change dir before packaging * use different package syntax * change packaging syntax * package with cmake * remove unnecessary build prefix * get rid of unnecessary paths * change paths during unpacking * change script syntax while unpacking * get rid of unneccesary steps * get rid of comments in the scripts * use double quotes for scripts * add ccache during build, try dpkg -x * pull and install each package separately * use full package names * try to use stashing for packages * change stash/unstash syntax * move unstash out of shell, run tests on any gpu node * unpack each package separately * try re-using existing workspace * merge the build and test stages, only stash ckProfiler * merge the build and test stages, only stash zipped ckProfiler * fix syntax * add GPU check before build and test, rename docker to usual name	2022-09-21 14:30:13 -05:00
Illia Silin	9f7c193064	use rocm5.2 compiler as default, use same flags for amd-stg-open as for release (#426 )	2022-09-20 11:08:09 -05:00
Illia Silin	b22ebd4485	Upgrade the OS and ROCM versions. (#411 ) * upgrade the OS and ROCM versions in CK docker * add cxx flags to link code with rocm5.2 and ck-9110 compiler * rename the docker image * run ONNX gemms using init=1	2022-09-13 10:39:14 -05:00
Illia Silin	ce74cea407	Add stderr to QA logfiles, process splitK and ONNX gemm kernels (#402 ) * add processing for the onng_gemm and splitK_gemm * add profile_onnx_gemm.sh * add stderr to logfiles, add splitK and onnx gemm parsing * enable splitK gemm wresults posting to db	2022-09-07 13:59:44 -05:00
Illia Silin	1e5b59df22	Add an option to build CK with clang directly (#387 ) * replace hipcc compiler with clang++ * build client app with hipcc * build client app with clang * add an option to build with hipcc ro clang * fix the environment for client app * fix setting up compiler in cmake_build * change the way the compiler is set	2022-08-26 12:51:39 -05:00
Illia Silin	9efd033bee	restart the stages on MI200 in case of failures (#366 ) * restart the stages on MI200 * fix the docker image storage issue	2022-08-18 14:54:47 -05:00
Illia Silin	de60d290b6	Build docker only once in CI, fix conv_bwd logfile names. (#353 ) * build docker in separate stage * build docker with only one prefix * add parallel statement * add docker repo url * fix the name of perf_conv_bwd_data log file	2022-08-12 12:30:37 -05:00
Illia Silin	aba7fefce7	Fix QA, allow switching compiler versions, fix google test compilation error. (#348 ) * allow selecting compiler version * fix typo * add Wno-deprecated flag for google tests * change git repo, fix qa log files names * change the git clone syntax * use Omkar's git credentials * try to use jenkins as git user * try using illsilin username for gerrit repo with ssh key * try new gerrit authorization * change ssh key syntax * try another way of passing ssh key to docker * add mount ssh in dockerfile * create .ssh folder * move ssh-keyscan to later * get rid of npm call * build first docker image on master * check the contents of the .ssh folder * try replacing omkars creds with gerrit creds * use open repo, clean up changes * get rid of ssh default argument	2022-08-08 13:49:14 -05:00
Illia Silin	984b3722bf	Run CI on MI100 nodes only, run daily QA on MI200 nodes. (#339 ) * turn on full qa only on gfx90a, use int initialization * change script syntax * update script parsing clinfo, throw exception if 0 devices * fix syntax * try using toBoolean for the QA conditions * run regular CI on MI100 only, use MI200 only for daily QA * evaluate when conditions before agent * launch QA on develop branch and update profile_reduce script * update test script * update script * remove false dependency from dockerfile * try removing rbuild completely Co-authored-by: Chao Liu <chao.liu2@amd.com> Co-authored-by: Chao Liu <lc.roy86@gmail.com>	2022-08-02 09:17:11 -05:00
Illia Silin	85978e0201	comment out cron trigger (#334 )	2022-07-22 13:52:10 -05:00
Illia Silin	d8415a96b3	Add full QA with verification option, few other changes. (#331 ) * add verify flag and update scripts * replace old check_error function with the new check_err * fix syntax * remove blank spaces * remove empty line * add check_err for tensors * fix syntax * replace tensors with vectors in check_err calls * fix syntax * remove blank spaces * fix syntax * add new line at end of file * disable conv2d_bwd_weight test, add gpu check * set check_gpu using export * check GPU using runShell * add definition of runShell * fix script syntax * reduce the number of threads, add full qa option * run processing scripts in bash * fix the branch and host names in performance scripts, add chronos * replace parameterizedCron with cron * archive the perf log files * try to fix git call * pass branch and host names as arguments into scripts * fix script arguments * fix script arguments * process results on master * fix pipeline * add definition of gpu_arch * run processing scripts in docker * fix the brackets * add agent master for the processing stage * get rid of show_node_info call on master * try using mici label instead of master, disable MI100 tests for now * fix syntax * simplify container for results processing * remove node(master) from the process_results stage * put all stages in original order * change the agent label from master to mici for gfx908	2022-07-21 15:25:46 -05:00
Illia Silin	39acaea36d	Add switch between compilers, make 9110 compiler default, add full QA scripts. (#322 ) * adding scripts for full perf test suite * uncomment the sql queries * fix typo and chmod a+x for scripts * dos2unix for all new scripts * disable verification in full performance test * fix reduction scripts, add gfrouped_gemm hotfix * fix the grouped_gemm hotfix and only run reduction for fp16 * change compiler flag syntax * fix syntax * add predefinition of dockerArgs * avoid redefinitions of dockerArgs * add blank space at the end of dockerArgs * try to build with release compiler * adding spaces inside if condition * limit the number of threads for building 9110 compiler * change the way HIP_CLANG_PATH is set * remove the export command * change the conditional ENV syntax * set HIP_CLANG_PATH at docker run time * update scripts for full qa * enable the sql write query * fix typo * remove a comment from a script	2022-07-13 09:27:43 -05:00
Chao Liu	aebd211c36	External Interface (#304 ) * add client example * clean * clean * reorg * clean up profiler * reorg * clea * fix profiler * function for getinstances * update client example * update client example * update client example * update * update example * update Jenkins file * update cmake * update Jenkins	2022-06-26 19:39:02 -05:00
Chao Liu	d1db6a0c3e	Absolute include path (#281 ) * ad gelu and fast_gelu * added GeLU and fast GeLU * clean up * add gemm+fastgelu example * add gemm+gelu instances * update profiler * clean up * clean up * adding gemm+bias+activation * clean * adding bias * clean * adding gemm multiple d * debugging * add gemm bias add fastgelu * rename, clean * refactoring; add readme * refactor * refactor * refactor * refactor * refactor * refactor * fix * fix * update example * update example * rename * update example * add ckProfiler * clean * clean * clean * clean * add client app example * update readme * delete obselete files * remove old client app * delete old file * cleaning * clean * remove half * fix header path * fix header path * fix header path * fix header path * fix header path * fix header path for all examples * fix header path * fix header path * fix header path * fix header path * fix header path * fix header path * fix header path * fix header path * fix header path * revert client app example * clean build * fix build * temporary disable client test on Jenkins * clean * clean * clean	2022-06-24 20:51:04 -05:00
Illia Silin	e4584d91ac	Don't look up the /sys/module/amdgpu/version file. (#287 ) * use pre-built docker instead of building a new one * try docker.image.pull * change syntax in docker.image() * add 30 min timeout * increase timeout to 3 hours * move performance tests to first stage for testing * set image variable to the new container name * update image name * check available images * check available images in both places * try different image name * use image ID to refer to image * run performance on gfx90a * fix the gpu_arch labeling, add parameter * move env vars out of stages * add stand-alone performance script, MI200 tests, CU numbers * dos2unix for run_perf_tests.sh * try the new git credentials * use env var for git credentials * don't look up /sys/module/amdgpu/version Co-authored-by: Chao Liu <chao.liu2@amd.com>	2022-06-17 15:11:21 -05:00
Illia Silin	fb9b6b1e33	Use new github credentials (#278 ) * use pre-built docker instead of building a new one * try docker.image.pull * change syntax in docker.image() * add 30 min timeout * increase timeout to 3 hours * move performance tests to first stage for testing * set image variable to the new container name * update image name * check available images * check available images in both places * try different image name * use image ID to refer to image * run performance on gfx90a * fix the gpu_arch labeling, add parameter * move env vars out of stages * add stand-alone performance script, MI200 tests, CU numbers * dos2unix for run_perf_tests.sh * try the new git credentials * use env var for git credentials	2022-06-15 21:26:48 -05:00
Illia Silin	1ced00a577	Add performance tests on MI200 in CI, reporting number of CUs, add stand-alone perf test. (#277 ) * use pre-built docker instead of building a new one * try docker.image.pull * change syntax in docker.image() * add 30 min timeout * increase timeout to 3 hours * move performance tests to first stage for testing * set image variable to the new container name * update image name * check available images * check available images in both places * try different image name * use image ID to refer to image * run performance on gfx90a * fix the gpu_arch labeling, add parameter * move env vars out of stages * add stand-alone performance script, MI200 tests, CU numbers	2022-06-10 14:43:43 -05:00
Illia Silin	1677cf705e	Adding Resnet50 test to Performance tests (#268 ) * add resnet50 test to performance tests * add blanks before gpu_arch in log files * add resnet50 test with N=4 and process its results * add ROCM and HIP versions to test tables * uncomment the sql queries * fix script syntax in jenkinsfile	2022-06-02 18:16:59 -05:00
Illia Silin	1085794df3	Add performance tests as a stage of CI. (#247 ) * modify ckProfiler_gemm output * fix syntax * change ckProfiler output and return 0 * fix syntax * output datatype * fix syntax * output datatype in another way * fix syntax * fix syntax * test return values of ckProfiler * add layout info and tests, make sure ckprofiler returns 0 * fix syntax * change layout output * fix syntax * fix syntax again * update script to process perf results * rearrange jenkins stages * fix typo * add python packages to Docker file * adding setuptools-rust package * modify parsing for new test parameters * test db credentials on jenkins * fix syntax * update python script to handle incomplete lines * ungrade python to 3.8 and write the gemm_params table * add sqlalchemy package to docker * move perf data processing to master node * move the master node inside a steps region * add new stage for result processing * move results processing to separate stage * reduce number of tests to speedup debugging * pass config to processPerfResults stage * run script on master in a docker container * replace show_node_info * try loading docker on master node again * use ansible node instead of master * get rid of pymysql package * try ssh connection using paramiko * put back pymysql * put the perf data processing back on the gpu node * put back artifact definition * archive the perf_log before parsing * clean up jenkinsfile, fix parsing * fix typo * enable all perf tests * put all stages in original order, finalize script * fix gpu_arch version * update parsing script * remove obsolete file causing merge conflict	2022-05-24 11:14:50 -05:00
JD	cec69bc3bc	Add host API (#220 ) * Add host API * manually rebase on develop * clean * manually rebase on develop * exclude tests from all target * address review comments * update client app name * fix missing lib name * clang-format update * refactor * refactor * refactor * refactor * refactor * fix test issue * refactor * refactor * refactor * upate cmake and readme Co-authored-by: Chao Liu <chao.liu2@amd.com>	2022-05-12 09:21:01 -05:00
Illia Silin	a3c910ac6c	Add Benchmark test into CI (#226 ) * add performance test to jenkins pipeline * fix typo * fix the syntax in conv_fwd_util.cpp * fix the error message syntax spacing * fix the error message syntax spacing again * run profile_gemm and archive results * fix typo * try to figure out the paths * try to figure out the paths one more time * skip the copying step * build ckProfiler release only once * change directory using dir * fix dir syntax * change the gemm parameters * do not pipe script output to file * try running ckProfiler directly * fix typo * use set +e * run profile_gemm.sh \|\| true * run multiple gemms and parse results * fix typo in jenkinsfile * fix syntax * add new gemm sizes, update scripts * put all jenkins steps in original order Co-authored-by: Chao Liu <chao.liu2@amd.com> Co-authored-by: Chao Liu <lc.roy86@gmail.com>	2022-05-08 02:44:18 -05:00
JD	97d8c5045e	Add gfx90a CI stage for tests (#208 ) * Add gfx90a CI stage * upgrade to ROCm 5.1 and fix formatting	2022-04-29 10:36:19 -05:00
JD	7353ec0c25	Fix `clang-format` (#189 ) * Fix clang-format filepath * update docker and fix format	2022-04-21 17:02:15 -05:00
Chao Liu	cd167e492a	Compile for gfx908 and gfx90a (#130 ) * adding compilation for multiple targets * fix build * clean * update Jekinsfile * update readme * update Jenkins * use ck::half_t instead of ushort for bf16 * rename enum classes * clean * rename * clean	2022-03-31 12:33:34 -05:00
Chao Liu	245f741457	improve parallelism for testing (#112 )	2022-03-07 10:33:12 -06:00
Chao Liu	5b178874a1	Fix Tests build (#109 ) * fix tests * remove useless file * fix test build * reduce parallelism when compiling * fix test	2022-03-05 00:44:11 -06:00
JD	992f71e371	Update test CMakeLists to add new tests automatically and add Jenkins stage for tests (#88 ) * add docker file and make default target buildable * add Jenkinsfile * remove empty env block * fix package stage * remove render group from docker run * clean up Jenkins file * add cppcheck as dev dependency * update cmake file * Add profiler build stage * add hip_version config file for reduction operator * correct jenkins var name * Build release instead of debug * Update test CMakeLists.txt reorg test dir add test stage * reduce compile threads to prevent compiler crash * add optional debug stage, update second test * remove old test target * fix tests to return proper results and self review * Fix package name and make test run without args * change Dockerfile to ues rocm4.3.1 * remove parallelism from build * Lower paralellism Co-authored-by: Chao Liu <chao.liu2@amd.com>	2022-03-03 16:59:42 -06:00
JD	2778e99758	Initial Setup for CI (#86 ) * add docker file and make default target buildable * add Jenkinsfile * remove empty env block * fix package stage * remove render group from docker run * clean up Jenkins file * add cppcheck as dev dependency * update cmake file * Add profiler build stage * add hip_version config file for reduction operator * correct jenkins var name * Build release instead of debug * clean up Co-authored-by: Chao Liu <chao.liu2@amd.com>	2022-02-18 21:44:11 -06:00

1 2 3 4 5

233 Commits