Testing all fwd convolution specializations. (#259)

mirror of https://github.com/ROCm/composable_kernel.git synced 2026-05-18 12:00:07 +00:00

* UniforFill with integer values.

* Log tested instance type string.

* Add UT for all convolution specializations.

* debugging conv

* Fix dangling reference bug.

* Small refinements.

* Fix call to error checking function.

* Small refinements to tests.

* Configure error tolerance
* Change problem size.
* Remove OddC case from types that do not support it.

* Add helper traits for AccumulatorDataType.

* Print first 5 errs in check_err for integral types.

* Rename FillUniform to FillUniformDistribution

* Refactor

* Do not use typed tests.
* Instead use plain fixture class with templatized member functions.
* Initialize tensors with integer values.

* Refine test instances.

* Properly set accumulator data type.
* Add another "big" instance.

* Refactor convolution tests.

* Revert "debugging conv"

This reverts commit b109516455.

* Add pragma once + format + small refinement.

* Fix some unwanted changes.

* Clang-format

* Fix profile_convnd to use renamed tensor initializer.

* Add instances for ConvFWDND kernel case 2D

* Helpers to get ConvNDFwd 2D instances.

* Refactoring.

* Remove "small block" instance as it was generating compiler errors.
* Remove default template parameters values.

* Refine and fix test.

* Fix problem with default template parameter types.
* Adjust error thresholds for floating point values test.
* Use integer values initialization for instances test.
* Add tests for ConvNDFwd 2D case.

* Remove AccumulatorDataType type trait.

* Update unit-tests.

* Remove operator<< overload.

* Unlock conv1d/3d nd fwd instances.

* Enable skipping calculating reference using flag.

* Fix number of channels for first ResNet50 layer.

* Clang-format.

Co-authored-by: Adam Osewski <aosewski@amd.com>
Co-authored-by: Chao Liu <chao.liu2@amd.com>

[ROCm/composable_kernel commit: a2edd7d802]

This commit is contained in:

Adam Osewski

2022-06-23 05:05:04 +02:00

committed by

GitHub

parent b6bb66a70c

commit abcca1dd45

20 changed files with 1219 additions and 268 deletions

									
										4

example/09_convnd_fwd/convnd_fwd_xdl_fp16.cpp
									
												View File
												
				@@ -291,8 +291,8 @@ int main(int argc, char* argv[])

				    float tflops     = static_cast<float>(flop) / 1.E9 / ave_time;

				    float gb_per_sec = num_btype / 1.E6 / ave_time;

				    std::cout << "Perf: " << ave_time << " ms, " << tflops << " TFlops, " << gb_per_sec << " GB/s, " << conv->GetTypeString() 

				              << std::endl;

				    std::cout << "Perf: " << ave_time << " ms, " << tflops << " TFlops, " << gb_per_sec << " GB/s, "

				              << conv->GetTypeString() << std::endl;

				    if(do_verification)

				    {

Testing all fwd convolution specializations. (#259)

4 example/09_convnd_fwd/convnd_fwd_xdl_fp16.cpp Unescape Escape View File

4

example/09_convnd_fwd/convnd_fwd_xdl_fp16.cpp

View File