* Build native arch by default, update rapids-cmake.
* Add check that CXX and CUDA_HOST compiler match.
Similar to CCCL, we need these to match to ensure that our warning flag detection functions properly.
* GCC only recognizes `unused-local-typedefs`.
Clang recognizes both. Ensure that we set this for both compilers.