mirror of
https://github.com/NVIDIA/nvbench.git
synced 2026-05-13 17:55:39 +00:00
This example demonstrates using cuda.bench and cuda.bench.results to implement simple auto-tuning, demonstrated on selecting of tile shape hyperparameter for naive stencil kernel implemented in numba-cuda.
11 lines
105 B
Plaintext
11 lines
105 B
Plaintext
numpy
|
|
numba
|
|
cuda-bindings
|
|
cuda-core
|
|
numba-cuda
|
|
cuda-cccl
|
|
cupy
|
|
nvidia-cute-dsl[cu13]
|
|
tabulate
|
|
torch[cu13]
|