nvbench

mirror of https://github.com/NVIDIA/nvbench.git synced 2026-05-12 09:15:47 +00:00

Author	SHA1	Message	Date
Allison Piper	50e764a308	Merge remote-tracking branch 'origin/main' into fea/axes_iteration_space	2025-05-01 15:14:58 +00:00
Sergey Pavlov	433376fd83	Restrict stopping criterion parameter usage in command line (#174 ) * restrict stopping criterion parameter usage in command line * Update docs for stopping criterion. * Add convenience benchmark_base API for criterion params. * Add more test cases for stopping criterion parsing. --------- Co-authored-by: Sergey Pavlov <psvvsp89@gmail.com> Co-authored-by: Allison Piper <alliepiper16@gmail.com>	2025-04-30 15:53:45 -04:00
Allison Piper	e4057575c7	Disable throttling when `sync` exec tag is used.	2025-04-24 22:48:35 +00:00
Allison Piper	18926ced87	Replace references to `peak_sm_clock` with `default_sm_clock`. The actual measured clock speed can exceed this value, so default is less confusing than peak.	2025-04-14 11:33:04 -04:00
Georgy Evtushenko	254ac2517f	Remove discard on throttle option	2025-04-12 21:13:13 -07:00
Georgy Evtushenko	b926daf09f	Better throttle recovery delay	2025-04-12 21:04:12 -07:00
Georgy Evtushenko	f29f7ac2fb	Detect throttle Signed-off-by: Georgy Evtushenko <evtushenko.georgy@gmail.com>	2025-04-11 14:35:40 -07:00
Allison Piper	a6df59a9b5	Add support for CPU-only benchmarking. Fixes #95. CPU-only mode is enabled by setting the `is_cpu_only` property while defining a benchmark, e.g. `NVBENCH_BENCH(foo).set_is_cpu_only(true)`. An optional `nvbench::exec_tag::no_gpu` hint can also be passed to `state.exec` to avoid instantiating GPU benchmarking backends. Note that a CUDA compiler and CUDA runtime are always required, even if all benchmarks in a translation unit are CPU-only. Similarly, a new `nvbench::exec_tag::gpu` hint can be used to avoid compiling CPU-only backends for GPU benchmarks.	2025-04-08 11:17:23 -04:00
Georgy Evtushenko	b789240c76	Entropy-based stopping criterion	2024-01-05 14:59:48 -08:00
Bryce Adelstein Lelbach aka wash	39b2770b62	Fix typo in documentation: `set_type_axis_names` should be `set_type_axes_names`	2023-10-05 13:16:16 -04:00
Robert Maynard	dc7e2b789d	Drop ability to zip axii after construction	2022-08-29 10:24:45 -04:00
Robert Maynard	99395df136	Update to cross reference docs	2022-08-23 14:46:03 -04:00
Robert Maynard	5b000e8988	More cleanup	2022-04-12 11:15:03 -04:00
Robert Maynard	f791475941	Update docs/benchmarks.md Co-authored-by: Allison Vacanti <alliepiper16@gmail.com>	2022-04-12 09:47:05 -04:00
Robert Maynard	f4570d43cf	Update docs/benchmarks.md Co-authored-by: Allison Vacanti <alliepiper16@gmail.com>	2022-04-12 09:46:55 -04:00
Paul Große-Bley	7f51ead595	Add --disable-blocking-kernel and --profile options.	2022-04-08 20:03:44 +02:00
Robert Maynard	a25f578891	Rename tie_axes to zip_axes	2022-02-28 14:23:35 -05:00
Robert Maynard	344878e9dc	Allow users to control iteration via the concept of iteration spaces. Changes in the work include: - [x] Internally use linear_space for iterating - [x] Simplify type and value iteration in `state_iterator::build_axis_configs` - [x] Store the iteration space in `axes_metadata` - [x] Expose `tie` and `user` spaces to user - [x] Add tests for `linear`, `tie`, and `user` - [x] Add examples for `tie` and `user`	2022-02-25 15:09:51 -05:00
Allison Vacanti	48d94259b4	Fix typo in new docs.	2022-02-11 14:01:49 -05:00
Allison Vacanti	039d455727	Move documentation on streams to new subsection. Also update to use `nvbench::make_cuda_stream_view`.	2022-02-11 13:29:06 -05:00
Yunsong Wang	e7c29c1c1b	Update docs	2022-02-06 19:34:57 -05:00
Yunsong Wang	a2a12c689c	Update docs/benchmarks.md Co-authored-by: Jake Hemstad <jhemstad@nvidia.com>	2022-02-06 19:31:20 -05:00
Yunsong Wang	76cbbcc8f9	Update benchmarks.md	2022-02-04 17:20:40 -05:00
Allison Vacanti	b948e79cab	Add NVML support for persistence mode, locking clocks. Locking clocks is currently only implemented for Volta+ devices. Example usage: my_bench -d [0,1,3] --persistence-mode 1 --lock-gpu-clocks base See the cli_help.md docs for more info.	2021-12-17 13:59:43 -05:00
Allison Vacanti	1875d9962d	Document new `--version` option.	2021-10-26 17:45:20 -04:00
Allison Vacanti	6d79c80152	Add --run-once option. Fixes #10. Adds a mode that forces a benchmark to only run once, simplifying profiling usecases. This can be enabled by any of the following methods: * Passing `--run-once` on the command line * `NVBENCH_CREATE(...).set_run_once(true)` when declaring a benchmark * `state.set_run_once(true)` from within the benchmark implementation.	2021-10-07 16:28:15 -04:00
Allison Vacanti	ff507596bf	Fix typo in docs.	2021-04-12 14:48:45 -04:00
Allison Vacanti	4e83e048ba	Store percentages as ratios. Human-readable outputs (md) and CLI inputs still use percentages. In-memory and machine-readable outputs (csv, json) use ratios. This is the convention that spreadsheet apps expect. Fixes #2.	2021-03-18 13:42:43 -04:00
Allison Vacanti	60c94d9ed6	Add `enum_type_axis` and `examples/enums.cu`. - `enum_type_axis` simplifies using integral_constants with type axes. - `examples/enums.cu` demonstrates various ways of implementing parameter sweeps with enum types.	2021-03-16 13:57:52 -04:00
Yunsong Wang	a097e6d90d	Minor corrections in doc	2021-03-11 16:47:03 -05:00
Allison Vacanti	3fc75f5ea6	Add more examples. - exec_tag_timer - exec_tag_sync - skip - throughput	2021-03-09 16:03:14 -05:00
Allison Vacanti	33aa9e1a07	Update README to link to the new example.	2021-03-08 18:26:26 -05:00
Allison Vacanti	922a6d09d0	Add `--json` option to CLI docs.	2021-03-05 16:37:23 -05:00
Allison Vacanti	33fa0c773f	Typo.	2021-03-04 23:24:37 -05:00
Allison Vacanti	65bc2c1e3f	Documentation overhaul. Revamp README, split into multiple files. Add docs on CLI. Add `--help` and `--help-axis`.	2021-03-04 18:40:23 -05:00

35 Commits