mirror of https://gitlab.kitware.com/vtk/vtk-m synced 2024-10-05 01:49:02 +00:00

History

Kenneth Moreland 1f07b0ecf6 Consolidate WarpScalar and WarpVector filter In reflection, the `WarpScalar` filter is surprisingly a superset of the `WarpVector` features. `WarpScalar` has the ability to displace in the directions of the mesh normals. In VTK, there is a distinction of normals to vectors, but in VTK-m it is a matter of selecting the correct one. As such, it makes little sense to have two separate implementations for the same operation. The filters have been combined and the interface names have been generalized for general warping (e.g., "normal" or "vector" becomes "direction"). In addition to consolidating the implementation, the `Warp` filter implementation has been updated to use the modern features of VTK-m's filter base classes. In particular, when the `Warp` filters were originally implemented, the filter base classes did not support more than one active scalar field, so filters like `Warp` had to manage multiple fields themselves. The `FilterField` base class now allows specifying multiple, indexed active fields, and the updated implementation uses this to manage the input vectors and scalars. The `Warp` filters have also been updated to directly support constant vectors and scalars, which is common for `WarpScalar` and `WarpVector`, respectively. Previously, to implement a constant field, you had to add a field containing an `ArrayHandleConstant`. This is still supported, but an easier method of just selecting constant vectors or scalars makes this easier. Internally, the implementation now uses tricks with extracting array components to support many different array types (including `ArrayHandleConstant`. This allows it to simultaneously interact with coordinates, directions, and scalars without creating too many template instances.		2023-09-26 07:20:09 -04:00
..
BenchmarkArrayTransfer.cxx	Make BenchmarkArrayTransfer actually benchmark transfers	2020-08-04 09:16:46 -06:00
BenchmarkAtomicArray.cxx	Prefer ArrayHandle::Fill over Algorithm::Fill	2022-01-04 08:50:57 -07:00
BenchmarkCopySpeeds.cxx	Implement tbb runtime device configuration and update vtkm to use it	2021-09-20 10:24:23 -06:00
BenchmarkDeviceAdapter.cxx	Add changes for supporting Kokkos/HIP	2021-10-01 15:27:00 -04:00
Benchmarker.h	Remove brigand from Benchmarker.h	2022-03-08 07:25:08 -07:00
BenchmarkFieldAlgorithms.cxx	Merge topic 'no-execution-whole-array'	2022-10-31 14:41:54 -04:00
BenchmarkFilters.cxx	Consolidate WarpScalar and WarpVector filter	2023-09-26 07:20:09 -04:00
BenchmarkInSitu.cxx	Split flying edges and marching cells into separate filters	2023-05-04 15:20:20 +02:00
BenchmarkODEIntegrators.cxx	Remove device compiler dependencies.	2022-08-01 08:00:46 -04:00
BenchmarkRayTracing.cxx	add include CanvasRayTracer.h	2023-05-30 13:01:02 -06:00
BenchmarkTopologyAlgorithms.cxx	Remove testing headers from benchmarking	2021-06-10 09:41:26 -06:00
CMakeLists.txt	Split up the filters benchmark tests	2022-12-05 13:20:22 -07:00
README_insitu.md	Switch how InSitu benchmark iterates	2022-09-12 09:24:47 -06:00
README.md	benchmarks: pass unparsed args to Google benchmark	2020-04-21 10:52:31 -04:00
vtkm.module	Fix some deprecated hacks in modules	2022-10-27 10:24:28 -06:00

README.md

BENCHMARKING VTK-m

TL;DR

When configuring VTM-m with CMake pass the flag -DVTKm_ENABLE_BENCHMARKS=1 . In the build directory you will see the following binaries:

$ ls bin/Benchmark*
bin/BenchmarkArrayTransfer*  bin/BenchmarkCopySpeeds* bin/BenchmarkFieldAlgorithms*
bin/BenchmarkRayTracing* bin/BenchmarkAtomicArray*    bin/BenchmarkDeviceAdapter*
bin/BenchmarkFilters* bin/BenchmarkTopologyAlgorithms*

Taking as an example BenchmarkArrayTransfer, we can run it as:

$ bin/BenchmarkArrayTransfer -d Any

Choosing devices

Taking as an example BenchmarkArrayTransfer, we can determine in which device we can run it by simply:

$ bin/BenchmarkArrayTransfer
...
Valid devices: "Any" "Serial"
...

Upon the Valid devices you can chose in which device to run the benchmark by:

$ bin/BenchmarkArrayTransfer -d Serial

Run a subset of your benchmarks

VTK-m benchmarks uses Google Benchmarks which allows you to choose a subset of benchmaks by using the flag --benchmark_filter=REGEX

For instance, if you want to run all the benchmarks that writes something you would run:

$ bin/BenchmarkArrayTransfer -d Serial --benchmark_filter='Write'

Note you can list all of the available benchmarks with the option: --benchmark_list_tests.

Compare with baseline

VTM-m ships with a helper script based in Google Benchmarks compare.py named compare-benchmarks.py which lets you compare benchmarks using different devices, filters, and binaries. After building VTM-m it must appear on the bin directory within your build directory.

When running compare-benchmarks.py:

You can specify the baseline benchmark binary path and its arguments in --benchmark1=
The contender benchmark binary path and its arguments in --benchmark2=
Extra options to be passed to compare.py must come after --

Compare between filters

When comparing filters, we only can use one benchmark binary with a single device as shown in the following example:

$ ./compare-benchmarks.py --benchmark1='./BenchmarkArrayTransfer -d Any
--benchmark_filter=1024' --filter1='Read' --filter2=Write -- filters

# It will output something like this:

Benchmark                                                                          Time             CPU      Time Old      Time New       CPU Old       CPU New
---------------------------------------------------------------------------------------------------------------------------------------------------------------
BenchContToExec[Read vs. Write]<F32>/Bytes:1024/manual_time                     +0.2694         +0.2655         18521         23511         18766         23749
BenchExecToCont[Read vs. Write]<F32>/Bytes:1024/manual_time                     +0.0212         +0.0209         25910         26460         26152         26698

Compare between devices

When comparing two benchmarks using two devices use the option benchmark after -- and call ./compare-benchmarks.py as follows:

$ ./compare-benchmarks.py --benchmark1='./BenchmarkArrayTransfer -d Serial
--benchmark_filter=1024' --benchmark2='./BenchmarkArrayTransfer -d Cuda
--benchmark_filter=1024' -- benchmarks


# It will output something like this:

Benchmark                                                              Time             CPU      Time Old      Time New       CPU Old       CPU New
---------------------------------------------------------------------------------------------------------------------------------------------------
BenchContToExecRead<F32>/Bytes:1024/manual_time                     +0.0127         +0.0120         18388         18622         18632         18856
BenchContToExecWrite<F32>/Bytes:1024/manual_time                    +0.0010         +0.0006         23471         23496         23712         23726
BenchContToExecReadWrite<F32>/Bytes:1024/manual_time                -0.0034         -0.0041         26363         26274         26611         26502
BenchRoundTripRead<F32>/Bytes:1024/manual_time                      +0.0055         +0.0056         20635         20748         21172         21291
BenchRoundTripReadWrite<F32>/Bytes:1024/manual_time                 +0.0084         +0.0082         29288         29535         29662         29905
BenchExecToContRead<F32>/Bytes:1024/manual_time                     +0.0025         +0.0021         25883         25947         26122         26178
BenchExecToContWrite<F32>/Bytes:1024/manual_time                    -0.0027         -0.0038         26375         26305         26622         26522
BenchExecToContReadWrite<F32>/Bytes:1024/manual_time                +0.0041         +0.0039         25639         25745         25871         25972

Installing compare-benchmarks.py

compare-benchmarks.py relies on compare.py from Google Benchmarks which also relies in SciPy, you can find instructions here regarding its installation.