Commit Graph

  • d5d188eed4 Changed title, fixed references, added intro borrowed from README pull-request/323 Oleksandr Pavlyk 2026-03-13 10:58:54 -05:00
  • d62efb4e70 Tweak to index.rst intro sentence and title Oleksandr Pavlyk 2026-03-13 08:06:21 -05:00
  • b09105eaa8 Enumerate free functions in nvbench:: namespace Oleksandr Pavlyk 2026-03-12 16:04:39 -05:00
  • 20b0b72e1e Make sure to reset __module__ of reexported symbols to be cuda.bench Oleksandr Pavlyk 2026-03-12 13:58:00 -05:00
  • 8537c66b9a Fixed quotes of types in docstrings Oleksandr Pavlyk 2026-03-12 13:57:20 -05:00
  • 4d887a44e4 Fix cpp_benchmarks, add py_benchmarks Oleksandr Pavlyk 2026-03-12 13:53:58 -05:00
  • f227266ad0 Account for test_cpp_exceptions -> _test_cpp_exception, same for *_py_* Oleksandr Pavlyk 2026-03-11 15:54:07 -05:00
  • e9208552c9 Fixed relative text alignment in docstrings to fix autodoc warnigns Oleksandr Pavlyk 2026-03-11 15:52:50 -05:00
  • b41981d9a6 Remove folders and scripts for building C++ docs/Python docs separately Oleksandr Pavlyk 2026-03-11 15:52:36 -05:00
  • c519c02c9e Add sphinx-combined folder that builds combined C++ & Python docs Oleksandr Pavlyk 2026-03-11 15:51:36 -05:00
  • 25ab62e39f Add scaffolding to build C++/Python docs Oleksandr Pavlyk 2026-03-10 16:12:55 -05:00
  • 728212f9f1 Merge pull request #315 from bernhardmgruber/plot_diff_script main Bernhard Manfred Gruber 2026-02-28 01:38:27 +01:00
  • 4164909c52 Feedback Bernhard Manfred Gruber 2026-02-28 01:19:18 +01:00
  • 5387d2005b Merge pull request #322 from oleksandr-pavlyk/feature/save-frequencies Oleksandr Pavlyk 2026-02-27 13:30:11 -06:00
  • c9705de4a4 Reserve enough space clock-rates for min samples, if specified Oleksandr Pavlyk 2026-02-27 12:49:35 -06:00
  • 0abc8ec82b Extend nvbench_compare.py with --plot, axis/benchmark filtering, and dark mode Bernhard Manfred Gruber 2026-02-05 12:01:38 +01:00
  • ba7150e447 Merge pull request #314 from bernhardmgruber/plot_script Oleksandr Pavlyk 2026-02-26 12:59:16 -06:00
  • 800f640c20 Apply reviewer feedback Bernhard Manfred Gruber 2026-02-26 19:23:51 +01:00
  • 998ab125ce Don't override m_check_throttling if throttling threshold is non-positive Oleksandr Pavlyk 2026-02-20 16:34:53 -06:00
  • 731e0c2c30 Swapped data members m_sm_clock_rates and m_sm_clock_rate_accumulator Oleksandr Pavlyk 2026-02-19 15:33:57 -06:00
  • 4da9f431c0 Templatize write_out_values for different storage formats Oleksandr Pavlyk 2026-02-19 15:32:00 -06:00
  • 988420b5b1 Use write_out_values utility to save frequencies Oleksandr Pavlyk 2026-02-13 10:19:06 -06:00
  • 40b2f4ece2 Better place to stop freq timer? Georgy Evtushenko 2025-05-03 19:54:39 -07:00
  • a487a38895 Dump frequencies Georgy Evtushenko 2025-04-22 14:37:16 -07:00
  • d3a0bec4a8 Feedback from review Bernhard Manfred Gruber 2026-02-05 14:13:16 +01:00
  • 28ed32bb47 Implement dark mode using style sheets Bernhard Manfred Gruber 2026-02-05 14:00:33 +01:00
  • ec9759037d I have no idea what I am doing Bernhard Manfred Gruber 2026-02-05 11:15:27 +01:00
  • ccde9fc4d4 More Bernhard Manfred Gruber 2026-02-05 10:56:36 +01:00
  • 0be190b407 Add a script to plot benchmark results Bernhard Manfred Gruber 2026-02-05 10:36:52 +01:00
  • f77d001206 Fix pull-request/263 Bernhard Manfred Gruber 2025-08-18 16:52:54 +02:00
  • 8541265963 Also compare batch measurements in nvbench_compare.py Bernhard Manfred Gruber 2025-08-14 19:16:59 +02:00
  • dc59f98ecd Remove cupti from cuda-bench dependencies (#311) python-0.2.0 python-0.2.0-release Nader Al Awar 2026-02-03 14:16:26 -06:00
  • 90ad8bcbc7 Merge pull request #296 from bernhardmgruber/compare_sub_results Bernhard Manfred Gruber 2026-02-03 20:02:34 +01:00
  • c6ef87575c Allow partial comparison in nvbench_compare.py Bernhard Manfred Gruber 2025-12-10 11:06:55 +01:00
  • d75fc74162 Merge branch 'main' into remove-cupti-python Nader Al Awar 2026-02-03 08:58:41 -06:00
  • 867d5d4276 Merge pull request #294 from oleksandr-pavlyk/add-docstrings Oleksandr Pavlyk 2026-02-03 08:51:55 -06:00
  • 8a128ed7d9 Merge pull request #309 from oleksandr-pavlyk/support-skipping-batched-runs Oleksandr Pavlyk 2026-02-02 17:57:45 -06:00
  • 4fa4296810 Remove cuda.pathfinder function Nader Al Awar 2026-02-02 16:43:45 -06:00
  • f2d5730104 Disable CUPTI in cmake file Nader Al Awar 2026-02-02 16:03:15 -06:00
  • 6df5fc8c67 Remove cupti from cuda-bench dependencies Nader Al Awar 2026-02-02 15:37:13 -06:00
  • a33a454a2d Make skip_hot_measurement method const Oleksandr Pavlyk 2026-02-02 14:42:07 -06:00
  • f049f10977 Fix typo Oleksandr Pavlyk 2026-02-02 14:41:42 -06:00
  • cfb4a9b8b0 Fix for comment grammar Oleksandr Pavlyk 2026-02-02 12:58:15 -06:00
  • 27d6492355 Factor out check for whether to skip hot measurement to a nvbench::state private method Oleksandr Pavlyk 2026-02-02 12:43:39 -06:00
  • cff6df9bb2 Renamed option to --no-batch to stay aligned with tag name Oleksandr Pavlyk 2026-02-02 12:28:39 -06:00
  • 8ff0557ad8 Replace use of py::handle to store global_registry Oleksandr Pavlyk 2025-12-09 14:02:42 -06:00
  • 39c29026fd Move docstrings from PYI file to implementation Oleksandr Pavlyk 2025-12-09 09:39:10 -06:00
  • f1b9d44304 Support --no-batched CLI option Oleksandr Pavlyk 2026-02-02 11:32:57 -06:00
  • 34a089f805 Add 89-real to list of architectures built for cuda-bench (#308) Nader Al Awar 2026-01-30 13:35:17 -06:00
  • 7b5887a4a6 Add 89-real to list of architectures built Nader Al Awar 2026-01-30 13:02:42 -06:00
  • a5ad480dfe Add installation instructions to cuda-bench readme (#307) Nader Al Awar 2026-01-30 10:02:56 -06:00
  • edf0b80599 Add installation instructions Nader Al Awar 2026-01-30 09:32:44 -06:00
  • a29748316d Fix pypi url to publish wheel (#306) python-0.1.0 python-0.1.0-release Nader Al Awar 2026-01-29 16:03:48 -06:00
  • bd775c8c14 Use inputs.component for concistency with cuda-cccl Nader Al Awar 2026-01-29 15:10:46 -06:00
  • a8e8e176e9 Fix pypi url to publish wheel Nader Al Awar 2026-01-29 14:57:48 -06:00
  • f66f76731c Replace all occurences of pynvbench with cuda-bench (#305) Nader Al Awar 2026-01-29 14:13:44 -06:00
  • fa1eed69c0 Rename test file to refer to cuda_bench Nader Al Awar 2026-01-29 13:53:29 -06:00
  • c14a016e40 Replace a few more occurrences Nader Al Awar 2026-01-29 13:32:09 -06:00
  • 711c1e2eb1 Replace all occurences of pynvbench with cuda-bench Nader Al Awar 2026-01-29 13:25:17 -06:00
  • 5e7adc5c3f Build multi architecture cuda wheels (#302) Nader Al Awar 2026-01-28 19:13:24 -06:00
  • a681e2185d Add multi-cuda wheel build (#289) Ashwin Srinath 2026-01-28 10:37:55 -05:00
  • f3fa93f388 Merge pull request #290 from oleksandr-pavlyk/debug/outstanding-changes Oleksandr Pavlyk 2026-01-23 15:39:23 -06:00
  • 1170910a9f [pre-commit.ci] pre-commit autoupdate pull-request/299 pre-commit-ci-update-config pre-commit-ci[bot] 2026-01-05 21:12:56 +00:00
  • 2d4690e07d Merge pull request #298 from bernhardmgruber/ignore_device Bernhard Manfred Gruber 2025-12-10 18:24:26 +01:00
  • 85548809d6 Allow to by-pass device section check and compare different devices Bernhard Manfred Gruber 2025-12-10 13:14:50 +01:00
  • f6a9b245d3 Only trigger skipping of outstanding benchmarks on KeyboardInterrupt exception, on others benchmakr is to continue execution Oleksandr Pavlyk 2025-12-08 14:46:59 -06:00
  • 7e9a9a8983 Replace main_arg_run_benchmarks with run_interriptible Oleksandr Pavlyk 2025-12-08 14:28:26 -06:00
  • 8e6154511e Introduce runner->run_or_skip(bool &) and benchmark->run_or_skip(bool &) Oleksandr Pavlyk 2025-12-08 14:24:32 -06:00
  • a7763bdd7a Remove debug outputs Oleksandr Pavlyk 2025-12-08 12:25:31 -06:00
  • b2a80c92b8 Revert "Scripts to triage 284" Oleksandr Pavlyk 2025-12-08 11:53:08 -06:00
  • ce9a76167f Use nvbench::stop_runner_loop to signal stop of runner loop Oleksandr Pavlyk 2025-12-05 14:38:48 -06:00
  • e57f1ecf4c Introduce nvbench::stop_runner_loop exception. If application throws it, runner loop is stopped and other pending benchmark instances are skipped Oleksandr Pavlyk 2025-12-05 19:32:55 -06:00
  • c286199adc Scripts to triage 284 Oleksandr Pavlyk 2025-12-05 14:40:16 -06:00
  • de471e1d42 Use pybind11==3.0.1, do not use pybind11_add_module Oleksandr Pavlyk 2025-12-05 14:39:11 -06:00
  • f651636501 entropy criterion optimizations (#286) Jerry Hou 2025-12-05 17:02:21 -08:00
  • a6995413ac Merge pull request #288 from shwina/wheel-build-and-publish-infra Ashwin Srinath 2025-12-04 04:37:07 -05:00
  • 1d33536ce1 Re-enable other CI jobs Ashwin Srinath 2025-12-03 16:42:30 -05:00
  • 603a2df445 Remove workaround Ashwin Srinath 2025-12-03 16:23:42 -05:00
  • 77b7afc3c9 Remove the Python version file Ashwin Srinath 2025-12-03 16:23:14 -05:00
  • 3af11c8ee7 Expand the CI matrix back Ashwin Srinath 2025-12-03 15:48:40 -05:00
  • cadfa7de61 We no longer need to install libnvidia-ml.so Ashwin Srinath 2025-12-03 15:37:20 -05:00
  • 7ad064ea4f Change to GPU runner for testing Ashwin Srinath 2025-12-03 15:18:39 -05:00
  • b7eaf44ca3 Install libnvidia-ml.so.1 in test environment Ashwin Srinath 2025-12-03 14:56:37 -05:00
  • c2c34c9378 Temporarily reduce CI matrix Ashwin Srinath 2025-12-03 14:37:23 -05:00
  • a293af1d52 Try capturing the Python path before changing directories Ashwin Srinath 2025-12-03 14:15:34 -05:00
  • a7f92b7436 Try an inner and outer script Ashwin Srinath 2025-12-03 13:21:53 -05:00
  • 9746aa14df Maybe fix to test script Ashwin Srinath 2025-12-03 12:47:43 -05:00
  • d1efef03bc Fix wheel naming Ashwin Srinath 2025-12-03 11:54:46 -05:00
  • 618001143b Fixes to test script Ashwin Srinath 2025-12-03 11:41:36 -05:00
  • 8443a2059c Ensure test jobs find wheels correctly Ashwin Srinath 2025-12-03 11:22:19 -05:00
  • f3df4104de Make wheels manylinux compliant Ashwin Srinath 2025-12-03 11:22:12 -05:00
  • e15d9ebf58 Lint fixes Ashwin Srinath 2025-12-03 11:07:03 -05:00
  • 98e0b5994a Introduce build-and-test-python-wheels workflow Ashwin Srinath 2025-12-03 11:06:11 -05:00
  • e9cf53a1a4 Add PR workflow for building and testing wheels Ashwin Srinath 2025-12-03 10:30:27 -05:00
  • 8b2afa6c16 Lint fixes Ashwin Srinath 2025-12-03 10:17:23 -05:00
  • 29389b5791 Initial wheel build and publishing infrastructure Ashwin Srinath 2025-12-03 10:14:15 -05:00
  • 34f1e2a7ee Merge pull request #285 from ashermancinelli/patch-1 Bernhard Manfred Gruber 2025-11-16 00:11:42 +01:00
  • e91559edf0 Update README.md Asher Mancinelli 2025-11-14 14:34:18 -08:00
  • 92d2e01cd1 Profile only the kernels involved in the benchmark (#277) comeyrd 2025-10-21 13:51:37 -04:00
  • 9b133a94bc Remove GLOBAL tags from fmt targets. (#281) Allison Piper 2025-10-21 11:16:44 -04:00