* switch to universal gemm for batched and grouped gemms * added reviewer comments * fixed grouped gemm tests