No raw index calculation (#31)

* Replace most raw index calculation to coordinate transformation
* Overhaul blockwise and threadwise GEMM
* Overhaul driver for gridwies GEMM kernel

Co-authored-by: Jing Zhang <jizhan@amd.com>

[ROCm/composable_kernel commit: 01055d95d9]
This commit is contained in:
Chao Liu
2021-05-11 00:09:25 -05:00
committed by GitHub
parent 8a0d2f6753
commit ce4662dbda
23 changed files with 2494 additions and 2933 deletions

14
script/docker-rocm3.7.sh Normal file
View File

@@ -0,0 +1,14 @@
WORKSPACE=$1
echo "workspace: " $WORKSPACE
docker run \
-it \
--rm \
--privileged \
--group-add sudo \
-w /root/workspace \
-v $WORKSPACE:/root/workspace \
asroy/tensorflow:rocm3.7-tf2.3-dev-omp \
/bin/bash
#--network host \