cutlass

mirror of https://github.com/NVIDIA/cutlass.git synced 2026-05-11 17:00:05 +00:00

Files

Artem Belevich ce2b3f695d Fixed debug macros for clang.

Unlike nvcc, clang always sees both host and device-side code during
compilation. CUDA_LOG macro is used in both host and device code, so when it
expanded to contain device-only code, that resulted in errors when it was used
from the host-side functions.

In order to make CUDA_LOG work with clang it was split into two parts -- a pair
of target-attribute-based overloaded functions that perform host or device
specific parts of logging, and a printf which works on both sides.

2017-12-11 14:52:30 -08:00

gemm

Update license info

2017-12-06 10:00:59 -05:00

util

Fixed debug macros for clang.

2017-12-11 14:52:30 -08:00