Oleksandr Pavlyk
d5d188eed4
Changed title, fixed references, added intro borrowed from README
2026-03-13 10:58:54 -05:00
Oleksandr Pavlyk
d62efb4e70
Tweak to index.rst intro sentence and title
2026-03-13 08:06:21 -05:00
Oleksandr Pavlyk
b09105eaa8
Enumerate free functions in nvbench:: namespace
2026-03-12 16:04:39 -05:00
Oleksandr Pavlyk
20b0b72e1e
Make sure to reset __module__ of reexported symbols to be cuda.bench
2026-03-12 13:58:00 -05:00
Oleksandr Pavlyk
8537c66b9a
Fixed quotes of types in docstrings
2026-03-12 13:57:20 -05:00
Oleksandr Pavlyk
4d887a44e4
Fix cpp_benchmarks, add py_benchmarks
...
1. Fixed xrefs in docs/sphinx-combined/cpp_benchmarks.md, which is built on top of
docs/benchmarks.md
Added level-1 heading, and pushed existing headings one level down.
2. Added py_benchmarks.md to document benchmarking of Python scripts.
3. Rearranged entries in index.rst so that overview documents come before
API enumeration.
2026-03-12 13:53:58 -05:00
Oleksandr Pavlyk
f227266ad0
Account for test_cpp_exceptions -> _test_cpp_exception, same for *_py_*
2026-03-11 15:54:07 -05:00
Oleksandr Pavlyk
e9208552c9
Fixed relative text alignment in docstrings to fix autodoc warnigns
...
Renamed test_cpp_exception and test_py_exception functions to start with
underscore, signaling that these functions are internal and should not be
documented
2026-03-11 15:52:50 -05:00
Oleksandr Pavlyk
b41981d9a6
Remove folders and scripts for building C++ docs/Python docs separately
2026-03-11 15:52:36 -05:00
Oleksandr Pavlyk
c519c02c9e
Add sphinx-combined folder that builds combined C++ & Python docs
2026-03-11 15:51:36 -05:00
Oleksandr Pavlyk
25ab62e39f
Add scaffolding to build C++/Python docs
2026-03-10 16:12:55 -05:00
Bernhard Manfred Gruber
728212f9f1
Merge pull request #315 from bernhardmgruber/plot_diff_script
...
Extend `nvbench_compare.py` with `--plot`, axis/benchmark filtering, and dark mode
2026-02-28 01:38:27 +01:00
Bernhard Manfred Gruber
4164909c52
Feedback
2026-02-28 01:19:18 +01:00
Oleksandr Pavlyk
5387d2005b
Merge pull request #322 from oleksandr-pavlyk/feature/save-frequencies
...
Save frequencies when bulk-saving of times is enabled
SM clock rates are now always collected, even if throttling threshold is set to zero
2026-02-27 13:30:11 -06:00
Oleksandr Pavlyk
c9705de4a4
Reserve enough space clock-rates for min samples, if specified
2026-02-27 12:49:35 -06:00
Bernhard Manfred Gruber
0abc8ec82b
Extend nvbench_compare.py with --plot, axis/benchmark filtering, and dark mode
...
Co-authored-by: Oleksandr Pavlyk <21087696+oleksandr-pavlyk@users.noreply.github.com >
2026-02-27 11:06:20 +01:00
Oleksandr Pavlyk
ba7150e447
Merge pull request #314 from bernhardmgruber/plot_script
...
Add a script to plot benchmark results
2026-02-26 12:59:16 -06:00
Bernhard Manfred Gruber
800f640c20
Apply reviewer feedback
2026-02-26 19:23:51 +01:00
Oleksandr Pavlyk
998ab125ce
Don't override m_check_throttling if throttling threshold is non-positive
...
measure_cold class now directly inherits m_check_throttling from state.
This ensures that when `--jsonbin` is specified frequency data corresponding
to timing data are available to write out.
2026-02-20 16:34:53 -06:00
Oleksandr Pavlyk
731e0c2c30
Swapped data members m_sm_clock_rates and m_sm_clock_rate_accumulator
...
This places all std::vector members together. Added default initialization
to all std::vector members, and all other members with default constructors.
Exceptions are references and nvbench::launch m_launch; member
2026-02-19 15:33:57 -06:00
Oleksandr Pavlyk
4da9f431c0
Templatize write_out_values for different storage formats
...
This could be used to save data as float32_t, or float64_t.
This flexibility is useful for experimentation.
2026-02-19 15:32:00 -06:00
Oleksandr Pavlyk
988420b5b1
Use write_out_values utility to save frequencies
...
The utility was already used to save times
2026-02-13 10:19:06 -06:00
Georgy Evtushenko
40b2f4ece2
Better place to stop freq timer?
2026-02-13 09:53:59 -06:00
Georgy Evtushenko
a487a38895
Dump frequencies
2026-02-13 08:49:41 -06:00
Bernhard Manfred Gruber
d3a0bec4a8
Feedback from review
2026-02-05 14:13:16 +01:00
Bernhard Manfred Gruber
28ed32bb47
Implement dark mode using style sheets
2026-02-05 14:00:33 +01:00
Bernhard Manfred Gruber
ec9759037d
I have no idea what I am doing
2026-02-05 11:15:27 +01:00
Bernhard Manfred Gruber
ccde9fc4d4
More
2026-02-05 10:56:36 +01:00
Bernhard Manfred Gruber
0be190b407
Add a script to plot benchmark results
2026-02-05 10:36:52 +01:00
Nader Al Awar
dc59f98ecd
Remove cupti from cuda-bench dependencies ( #311 )
python-0.2.0
2026-02-03 14:16:26 -06:00
Bernhard Manfred Gruber
90ad8bcbc7
Merge pull request #296 from bernhardmgruber/compare_sub_results
...
Allow partial comparison in `nvbench_compare.py`
2026-02-03 20:02:34 +01:00
Bernhard Manfred Gruber
c6ef87575c
Allow partial comparison in nvbench_compare.py
...
Fixes : #295
2026-02-03 16:32:11 +01:00
Nader Al Awar
d75fc74162
Merge branch 'main' into remove-cupti-python
2026-02-03 08:58:41 -06:00
Oleksandr Pavlyk
867d5d4276
Merge pull request #294 from oleksandr-pavlyk/add-docstrings
2026-02-03 08:51:55 -06:00
Oleksandr Pavlyk
8a128ed7d9
Merge pull request #309 from oleksandr-pavlyk/support-skipping-batched-runs
2026-02-02 17:57:45 -06:00
Nader Al Awar
4fa4296810
Remove cuda.pathfinder function
2026-02-02 16:43:45 -06:00
Nader Al Awar
f2d5730104
Disable CUPTI in cmake file
2026-02-02 16:03:15 -06:00
Nader Al Awar
6df5fc8c67
Remove cupti from cuda-bench dependencies
2026-02-02 15:37:13 -06:00
Oleksandr Pavlyk
a33a454a2d
Make skip_hot_measurement method const
2026-02-02 14:42:07 -06:00
Oleksandr Pavlyk
f049f10977
Fix typo
2026-02-02 14:41:42 -06:00
Oleksandr Pavlyk
cfb4a9b8b0
Fix for comment grammar
2026-02-02 12:58:15 -06:00
Oleksandr Pavlyk
27d6492355
Factor out check for whether to skip hot measurement to a nvbench::state private method
2026-02-02 12:43:39 -06:00
Oleksandr Pavlyk
cff6df9bb2
Renamed option to --no-batch to stay aligned with tag name
2026-02-02 12:28:39 -06:00
Oleksandr Pavlyk
8ff0557ad8
Replace use of py::handle to store global_registry
...
Use py::gil_safe_call_once_and_store facility pybind11 provides.
2026-02-02 11:55:48 -06:00
Oleksandr Pavlyk
39c29026fd
Move docstrings from PYI file to implementation
...
Added tests that docstrings exist and are not empty.
This closes #291
2026-02-02 11:55:48 -06:00
Oleksandr Pavlyk
f1b9d44304
Support --no-batched CLI option
...
The option sets m_skip_batched boolean member in benchmark_base class.
Methods `bool get_skip_batched()` and `void set_skip_batched(bool)` added.
m_skip_batched is also added to state class. Similarly named methods
are added.
CLI help file documents `--no-batched` option.
2026-02-02 11:32:57 -06:00
Nader Al Awar
34a089f805
Add 89-real to list of architectures built for cuda-bench ( #308 )
2026-01-30 13:35:17 -06:00
Nader Al Awar
7b5887a4a6
Add 89-real to list of architectures built
2026-01-30 13:02:42 -06:00
Nader Al Awar
a5ad480dfe
Add installation instructions to cuda-bench readme ( #307 )
...
Add installation instructions to `cuda-bench` readme
2026-01-30 10:02:56 -06:00
Nader Al Awar
edf0b80599
Add installation instructions
2026-01-30 09:32:44 -06:00