From 6fb5e667c16f5ea9aca9eddc6a3e6be80b5e065c Mon Sep 17 00:00:00 2001 From: Ali Hassani <68103095+alihassanijr@users.noreply.github.com> Date: Wed, 30 Jul 2025 22:14:13 -0400 Subject: [PATCH] [Doc fix] incorrect compute cap. for Blackwell RTX (#2511) Blackwell RTX is compute capability 12.0 (SM120) but incorrectly listed as SM100 in the README. --- README.md | 4 ++-- 1 file changed, 2 insertions(+), 2 deletions(-) diff --git a/README.md b/README.md index c9807a96f..6b0edce81 100644 --- a/README.md +++ b/README.md @@ -170,7 +170,7 @@ CUTLASS runs successfully on the following NVIDIA GPUs, and it is expected to be |NVIDIA H100 Tensor Core GPU |9.0|11.8| |NVIDIA H200 Tensor Core GPU |9.0|11.8| |NVIDIA B200 Tensor Core GPU |10.0|12.8| -|NVIDIA GeForce RTX 50x0 series |10.0|12.8| +|NVIDIA GeForce RTX 50x0 series |12.0|12.8| ## Target Architecture @@ -202,7 +202,7 @@ cmake .. -DCUTLASS_NVCC_ARCHS="100a" Note: The NVIDIA Blackwell SM100 architecture used in the datacenter products has a different compute capability than the one underpinning -NVIDIA Blackwell GeForce RTX 50 series GPUs. As a result, kernels +NVIDIA Blackwell GeForce RTX 50 series GPUs (SM120). As a result, kernels compiled for Blackwell SM100 architecture with arch conditional features (using `sm100a`) are not compatible with RTX 50 series GPUs.