vtk-m2

Author	SHA1	Message	Date
David Thompson	880d8a989e	Add `vtkm/Geometry.h` and test it. This commit adds several geometric constructs to vtk-m in the `vtkm/Geometry.h` header. They may be used from both the execution and control environments. We also add methods to perform projection and Gram-Schmidt orthonormalization to `vtkm/VectorAnalysis.h`. See `docs/changelog/geometry.md` included in this commit for more information.	2018-06-20 11:58:14 -04:00
ayenpure	d8e8078099	Fixing the typos with ScanExclusiveByKey - Fixed the typo - Moved the test to vtkm/worklet/testing as vtkm/cont/testing does not execute with CUDA	2018-06-15 16:39:00 -07:00
Robert Maynard	8276e35cf4	Mark classes that should not be derived from as final.	2018-06-15 10:49:59 -04:00
Robert Maynard	82cdae0025	VTK-m waits for cuda streams to finish before host access Previously it was possible for VTK-m to access memory from the host before the computations in a stream finished.	2018-06-01 10:28:55 -04:00
Robert Maynard	9c3547bc7c	VTK-m cuda runtime now handles no cuda runtime properly Previously it would throw an uncaught exception and crash.	2018-05-31 10:07:37 -04:00
Allison Vacanti	1f6a662c0a	Merge DevAdaptAlgoThrust --> DevAdaptAlgoCuda.	2018-05-29 14:07:29 -04:00
Allison Vacanti	be0c6a17a9	Move DevAdaptAtomicArrayImplementation to its own file.	2018-05-29 14:07:29 -04:00
Allison Vacanti	3af9f66083	Merge ArrayManagerExecutionThrustDevice into AMECuda.	2018-05-29 14:07:29 -04:00
Robert Maynard	4a520b7bdd	Merge topic 'pascal_managed_memory_copy_non_blocking' e0b6e698 copying cpu memory to pascal managed memory now works consistently. Acked-by: Kitware Robot <kwrobot@kitware.com> Merge-request: !1211	2018-05-18 15:15:17 -04:00
Robert Maynard	e0b6e69878	copying cpu memory to pascal managed memory now works consistently. When copying small arrays from cpu memory to pascal memory we would see subsequent kernels fail as the memory transfer hadn't finished. This is a bug as each stream should act like a FIFO queue. So for now when encountering this use case we explicitly synchronize after the memcpy.	2018-05-16 17:56:50 -04:00
Robert Maynard	1c5feeb185	Make sure all device specific tests use the intended device. This means that we not only setup the runtime device tracker to force the intended device, it also means making sure the default device is the error device.	2018-05-16 08:21:16 -04:00
Robert Maynard	e28244f345	Re-implement DeviceAdapterRuntimeDetector to avoid ODR violations. The previous implementation of DeviceAdapterRuntimeDetector caused multiple differing definitions of the same class to exist and was causing the runtime device tracker to report CUDA as disabled when it actually was enabled. The ODR was caused by having a default implementation for DeviceAdapterRuntimeDetector and a specific specialization for CUDA. If a library had both CUDA and C++ sources it would pick up both implementations and would have undefined behavior. In general it would think the CUDA backend was disabled. To avoid this kind of situation in the future I have reworked VTK-m so that each device adapter must implement DeviceAdapterRuntimeDetector for that device.	2018-05-15 13:08:34 -04:00
Robert Maynard	571556d984	CUDA's RuntimeDeviceTracker and Timer are now built as part of vtkm_cont This is done to not only reduce the amount of code that users need to generate but to reduce the amount of errors when using the RuntimeDeviceTracker. If the runtime device tracker is initially used in a library by a c++ file it will never properly detect the cuda backend. By moving the code into vtkm_cont we can make sure this problem doesn't occur.	2018-05-10 10:57:06 -04:00
Robert Maynard	364b366ab3	Correct signed/unsigned cast warnings from DeviceAdapterAlgorithmThrust Found with CUDA 7.5	2018-05-08 15:29:11 -04:00
Robert Maynard	c9ba80ad93	Replace uint with vtkm::Id in DeviceAdapterAlgorithmThrust The usage of uint was causing problems with CUDA + MSVC2015 as type was not defined. Instead we use vtkm::Id as that was the expect type to be passed to the task	2018-05-02 09:55:56 -04:00
Robert Maynard	b56894dd09	Move VTK-m Cuda backend over to a grid-stride iteration pattern. This allows for easier host side logic when determining grid and block sizes, and allows for a smaller library side by moving some logic into compiled in functions.	2018-04-30 17:29:26 -04:00
Robert Maynard	b7e6371842	Correct issues found be enabling more CUDA warnings.	2018-04-23 14:27:53 -04:00
Matt Larsen	715141737f	Merge topic 'typos' efdf8543 Misc. Typos Acked-by: Kitware Robot <kwrobot@kitware.com> Acked-by: Matt Larsen <mlarsen@cs.uoregon.edu> Merge-request: !1113	2018-04-06 18:04:46 -04:00
Robert Maynard	84311a2453	Merge branch 'master' into cmake_refactor	2018-04-05 10:18:36 -04:00
Robert Maynard	c123796949	VTK-m ArrayHandle can now take ownership of a user allocated memory location Previously memory that was allocated outside of VTK-m was impossible to transfer to VTK-m as we didn't know how to free it. By extending the ArrayHandle constructors to support a Storage object that is being moved, we can clearly express that the ArrayHandle now owns memory it didn't allocate. Here is an example of how this is done: ```cpp T* buffer = new T[100]; auto user_free_function = [](void* ptr) { delete[] static_cast<T*>(ptr); }; vtkm::cont::internal::Storage<T, vtkm::cont::StorageTagBasic> storage(buffer, 100, user_free_function); vtkm::cont::ArrayHandle<T> arrayHandle(std::move(storage)); ```	2018-04-04 11:28:25 -04:00
Robert Maynard	707970f492	VTK-m StorageBasic is now able to give/take ownership of user allocated memory. This fixes the three following issues with StorageBasic. 1. Memory that was allocated by VTK-m and Stolen by the user needed the proper free function called which is generally StorageBasicAllocator::deallocate. But that was hard for the user to hold onto. So now we provide a function pointer to the correct free function. 2. Memory that was allocated outside of VTK-m was impossible to transfer to VTK-m as we didn't know how to free it. This is now resolved by allowing the user to specify a free function to be called on release. 3. When the CUDA backend allocates memory for an ArrayHandle that has no control representation, and the location we are running on supports concurrent managed access we want to specify that cuda managed memory as also the host memory. This requires that StorageBasic be able to call an arbitrary new delete function which is chosen at runtime.	2018-04-04 11:27:57 -04:00
Robert Maynard	8808b41fbd	Merge branch 'master' into vtk-m-cmake_refactor	2018-03-29 22:51:26 -04:00
Robert Maynard	944bc3c0d6	Introduce vtkm::cont::ColorTable replacing vtkm::rendering::ColorTable The new and improved vtkm::cont::ColorTable provides a more feature complete color table implementation that is modeled after vtkDiscretizableColorTransferFunction. This class therefore supports different color spaces ( rgb, lab, hsv, diverging ) and supports execution across all device adapters.	2018-03-28 16:11:23 -04:00
luz.paz	efdf854306	Misc. Typos Found via `codespell` and `grep`	2018-03-28 09:45:07 -04:00
Robert Maynard	2bfbf0a902	Transfer of virtuals to the CUDA device now properly uses streams This way when multiple threads are using VTK-m they all won't block while one transfer a class with virtuals to the device.	2018-03-20 17:04:41 -04:00
Robert Maynard	6202d8d22d	CudaAllocator guards all CUDA 8.0+ calls behind ifdef's.	2018-02-26 16:37:57 -05:00
Robert Maynard	e630ac5aa4	Merge branch 'master' into vtk-m-cmake_refactor	2018-02-23 14:52:00 -05:00
Robert Maynard	705528bf17	vtk-m ArrayHandle + basic storage has an optimized PrepareForDevice method By hard coding the PrepareForDevice to know about all the different VTK-m devices, we can have a single base class do the execution allocation, and not have that logic repeated in each child class.	2018-02-16 10:00:28 -05:00
Robert Maynard	22f9ae3d24	vtk-m ArrayHandle + basic holds control data by StorageBasicBase By making the array handle hold the control side data by the parent storage class we remove significant code generation.	2018-02-16 09:59:20 -05:00
Robert Maynard	d0a68d3266	Refactor vtk-m storage basic to generate less code By moving all common operations to a parent class we can significantly reduce the vtk-m library size.	2018-02-16 09:59:19 -05:00
Robert Maynard	22ea58335a	iVTK-m CUDA backend doesn't use thrust::cuda::pointer any more. This was removed as CUDA 9.0 on MSVC has issues where CUB/Thrust would fail to compile when given these types.	2018-02-02 08:33:17 -05:00
Robert Maynard	0c5a087e41	Merge topic 'dont_allow_rvalue_tasks' ef611239 Don't allow DeviceTaskTypes to construct tasks from rvalues. Acked-by: Kitware Robot <kwrobot@kitware.com> Merge-request: !1062	2018-01-30 08:37:29 -05:00
luz.paz	80b11afa24	Misc. typos Found via `codespell -q 3` via downstream VTK	2018-01-30 06:51:47 -05:00
Robert Maynard	ef611239f6	Don't allow DeviceTaskTypes to construct tasks from rvalues.	2018-01-18 13:55:37 -05:00
Robert Maynard	7d7c6ab1ab	Don't allow DeviceTaskTypes to construct tasks from rvalues.	2018-01-18 13:51:30 -05:00
Robert Maynard	9c668b61e0	Simplify how we built the list of source files for vtkm_cont	2018-01-17 17:13:50 -05:00
Robert Maynard	0660c67fef	Merge branch 'master' into vtk-m-cmake_refactor	2018-01-16 15:42:28 -05:00
Sujin Philip	950b12b1f2	Add ArrayHandleVirtualCoordinates	2018-01-09 17:23:41 -05:00
Matthew Letter	e17cfddfc8	added vtkm_cont_EXPORTS flag into the build cuda, serial, and tbb were missing the vtkm_cont_EXPORTS flag	2018-01-08 14:00:58 -05:00
Robert Maynard	004bfe7b12	Prefer using existence of targets when looking for TBB/CUDA support	2018-01-08 14:00:57 -05:00
Robert Maynard	afc19ab0fc	Setup symbol visibility controls for VTK-m	2018-01-08 14:00:57 -05:00
Robert Maynard	24e57556e6	Merge branch 'master' into vtk-m-cmake_refactor Includes updating to cleanup benchmark code and handle the new MPI option	2017-12-28 14:23:21 -05:00
Sujin Philip	b530a5ce3f	Fix issue with Managed Memory for 0 size arrays	2017-12-19 17:18:24 -05:00
Matthew Letter	fac43bd812	Merge branch 'master' into cmake_refactor	2017-11-28 13:36:02 -07:00
Sujin Philip	8c242cef91	Switch from faux to true virtuals	2017-11-06 15:25:29 -05:00
Robert Maynard	a6eecbe9ac	ExecutionArrayInterface now can hint at how allocated memory will be used. Certain backends desire the ability to mark allocations as being used for reading versus writing to improve performance.	2017-11-02 10:12:57 -04:00
Matthew Letter	24d0e7766e	Merge remote-tracking branch 'remotes/origin/master' into cmake_refactor	2017-10-31 16:57:41 -06:00
Sujin Philip	5842da4921	Remove ArrayHandle CopyInto Fixes #170	2017-10-27 17:28:59 -04:00
Robert Maynard	ed8f4111ef	Update all the code to work with CMake 3.3 Obviously this does mean that CUDA is not supported with 3.3.	2017-10-27 15:30:14 -04:00
Robert Maynard	56c7362258	A thought on what CMake 3.9 would mean to VTK-m.	2017-10-27 15:29:51 -04:00
Li-Ta Lo	3acd7c37a1	Merge topic 'pointlocator' ed3a64a5 Coding style improvment 7fa800b7 Update TestingPointLocatorUniformGrid.h f1974cab Update TestingPointLocatorUniformGrid.h 508882fa PointLocatorUniformGrid Acked-by: Kitware Robot <kwrobot@kitware.com> Merge-request: !973	2017-10-25 10:42:07 -04:00
Allison Vacanti	5a99dd761b	Only use cuda hints for CUDA 8.0+.	2017-10-24 11:55:07 -04:00
Li-Ta Lo	508882fa21	PointLocatorUniformGrid Provide an accelerated neareast neighbor search of points in the dataset using a one layer uniform grid.	2017-10-19 11:44:36 -06:00
Allison Vacanti	1018d981a0	Check for overlap in CopySubRange. Some parallel copy implementations will not handle this sanely.	2017-10-11 16:52:32 -04:00
Sujin Philip	41679cb5f9	Add a CellLocator Implements a two-level uniform grid cell locator	2017-10-10 14:01:41 -04:00
Robert Maynard	f8f1adc962	Forward decleare DeviceAdapterAlgorithm correctly as a struct	2017-10-06 09:50:12 -04:00
Allison Vacanti	75f88b4c46	Add versioning to VTKM installed include/share dirs.	2017-10-02 11:39:10 -04:00
Robert Maynard	311618a15f	Enable highest level of warnings(W4) under MSVC This will make VTK-m warning level match the one used by VTK. This commit also resolves the first round of warnings that W4 exposes.	2017-09-22 13:04:28 -04:00
Kenneth Moreland	c3a3184d51	Update copyright for Sandia Sandia National Laboratories recently changed management from the Sandia Corporation to the National Technology & Engineering Solutions of Sandia, LLC (NTESS). The copyright statements need to be updated accordingly.	2017-09-20 15:33:44 -06:00
Allison Vacanti	00320e5dc0	Use vtkm::UInt64 for byte sizes. vtkm::Id could be just 32bits, which limits the number of values we would be able to store for large arrays.	2017-09-19 09:21:14 -04:00
Allison Vacanti	28ab480a40	Fix warnings on renar.	2017-09-18 15:33:02 -04:00
Robert Maynard	b9e69217ae	Merge topic 'typedef_to_using_round_4' f6863594 Convert VTK-m over to use 'using' instead of 'typedef' Acked-by: Kitware Robot <kwrobot@kitware.com> Merge-request: !885	2017-08-17 16:38:49 -04:00
Sujin Philip	72a6cf4a21	Change cuda calls to use the per-thread stream.	2017-08-17 11:03:02 -04:00
Robert Maynard	f68635941e	Convert VTK-m over to use 'using' instead of 'typedef'	2017-08-17 10:47:25 -04:00
Robert Maynard	89f439999a	Reduce the amount of typedef statements in DeviceAdapters By using the auto keyword and decltype we can reduce the number of complex typedefs that exist when writing device adapter algorithms. The goal being that it is easier for developers to see the actual algorithms being implemented, by reducing the amount of template 'noise'.	2017-08-16 14:23:21 -04:00
Allison Vacanti	326757b571	Remove ArrayHandleCuda. !861 (b0dba9a1) adds this functionality to basic ArrayHandles.	2017-08-10 15:21:52 -04:00
Allison Vacanti	0a828189ad	Reuse user-provided cuda device pointers when possible.	2017-08-03 17:21:13 -04:00
David C. Lonie	bd042ec567	Add CudaAllocator to encapsulate runtime managed memory logic. Unified memory is now used when we detect that the hardware supports it.	2017-07-31 09:08:27 -04:00
Robert Maynard	76532264c3	Correct signed to unsigned warnings in ArrayManagerExecutionCuda.	2017-07-20 10:42:09 -04:00
David C. Lonie	379c3a0fad	Use current device when allocating managed memory.	2017-07-13 12:55:22 -04:00
Li-Ta Lo	65910f139c	put return value back to ScanInclusivePortal	2017-07-10 12:11:51 -06:00
Li-Ta Lo	16b61d8697	Make ScanInclusiveByKey and ScanInclusiveByKey void functions. These two algorithms does not return meaningful return values. Generic interface and implementation are both void. Remove erronous return type and statement for CUDA backend.	2017-07-07 15:11:42 -06:00
Robert Maynard	09a08fea8d	Correct errors in TaskTuner found from the TestBuilds.	2017-07-07 08:40:03 -04:00
Robert Maynard	c11f29c093	Move the parameter sweeping code to a separate header. The parameter sweeping code is only enabled when tuning for new GPU's so we should move it to a separate header to make DeviceAdapterAlgorithmThrust easier to read.	2017-07-05 14:58:19 -04:00
David C. Lonie	b2c3e41645	Refactor array transfer logic for basic storage. The old templated array transfer mechanism generated a lot of code that ended up doing a simple, type-agnostic memcpy for most devices. This patch specialized array handles for basic storage and uses a fast-path array transfer implementation. This reduces the size of the vtkm_cont library by 27% on gcc (from 6.2MB to 4.5MB).	2017-06-29 13:18:44 -04:00
Robert Maynard	cfda0593be	ArrayManagerExecutionThrustDevice stops generating casting warnings.	2017-06-16 08:50:32 -04:00
Robert Maynard	5dd346007b	Respect VTK-m convention of parameters all or nothing on a line clang-format BinPack settings have been disabled to make sure that the VTK-m style guideline is obeyed.	2017-05-26 13:53:28 -04:00
Robert Maynard	60a405ef65	Add TaskTiling1D/3D which use faux virtuals to reduce binary size. Redesigns the TBB and Serial backends and the vtkm::exec::Task concept so that we can re-use the same launching logic for all Worklets, instead of generating per worlet code. To keep the performance the same the TilingTask now is past a range of indices to work on, rather than a single index. Binary size reduction: WorkletTests_SERIAL old - 19MB WorkletTests_SERIAL new - 18MB WorkletTests_TBB old - 39MB WorkletTests_TBB new - 18MB libvtkAcceleratorsVTKm old - 48MB libvtkAcceleratorsVTKm new - 19MB	2017-05-25 11:00:01 -04:00
Kitware Robot	4ade5f5770	clang-format: apply to the entire tree	2017-05-25 07:51:37 -04:00
Kitware Robot	efbde1d54b	clang-format: sort include directives	2017-05-18 12:59:33 -04:00
Robert Maynard	022c36fa4f	Add vtkm::exec::TaskBase, and rename WorkletInvokeFunctor to TaskSingular Previously WorkletInvokeFunctor inherited from vtkm::exec::FunctorBase, which is also the base class for all users Worklets and for all functors based to DeviceAdapter::Schedule. This is done for a few reasons. The first is that we reduce the minimum size of user worklets. Previously the users worklet would hold a reference to the error message, and so would the wrapper class added when calling DeviceAdapter::Schedule. Now we only have the users worklet holding a reference. Second, by refactoring to have two base classes we can better improve the documentation on what responsibilities FunctorBase.h has, compared to TaskBase.	2017-05-02 16:38:43 -04:00
Sujin Philip	e9898cc5cf	Merge topic 'virtual-methods' 4049b5b2 Add ClipWithImplicitFunction Filter 82d02e46 Modify ImplicitFunctions to use Virtual Methods 968960c1 Add Virtual Methods Framework Acked-by: Kitware Robot <kwrobot@kitware.com> Acked-by: Kenneth Moreland <kmorel@sandia.gov> Merge-request: !750	2017-05-02 16:12:04 -04:00
Sujin Philip	82d02e46ef	Modify ImplicitFunctions to use Virtual Methods	2017-05-01 16:55:59 -04:00
Sujin Philip	968960c1a1	Add Virtual Methods Framework	2017-05-01 16:51:42 -04:00
Robert Maynard	80b9d74a23	Merge topic 'embed_more_into_vtkm_cont' ec6589d3 Only enable -fPIC on component static libraries when necessary. cbfe5fdd Fix up various issues with ArrayHandles in vtkm_cont. 355eea88 Get the vtkm cont cuda object to compile properly. 6ecc22bb First pass at compiling ArrayHandle into vtkm_cont. Acked-by: Kitware Robot <kwrobot@kitware.com> Merge-request: !715	2017-04-26 13:47:10 -04:00
Li-Ta Lo	0ba9784082	Merge topic 'scanbykey' 5c735a38 this should resovle all the type conversion warnings d8b02329 Merge branch 'scanbykey' of gitlab.kitware.com:ollielo/vtk-m into scanbykey 58ef7c8d one more attemp to get the data type right 22d0e355 attempt to fix warning on type conversion ded4583a attempt to fix warning on type conversion 897b2f0f add tests for 1, 2 and ARRAY_SIZE elements for both ScanInclusiveByKey and ScanExclusiveByKey c05a2c32 Merge branch 'scanbykey' of gitlab.kitware.com:ollielo/vtk-m into scanbykey 0e97fcb9 handle the case of 0 or 1 element in the input for ScanExclusiveByKey ... Acked-by: Kitware Robot <kwrobot@kitware.com> Merge-request: !746	2017-04-25 15:54:25 -04:00
Li-Ta Lo	897b2f0f63	add tests for 1, 2 and ARRAY_SIZE elements for both ScanInclusiveByKey and ScanExclusiveByKey	2017-04-19 13:38:28 -06:00
Li-Ta Lo	a205f21043	make ScanExclusiveByKey return void, rearrange parameter ordering	2017-04-17 16:11:02 -06:00
Li-Ta Lo	7023266585	add both generic and Thrust ScanExclusiveByKey	2017-04-17 15:03:49 -06:00
Li-Ta Lo	e77f9fac6a	add CUDA implementation of ScanInclusiveByKey using Thrust library	2017-04-14 11:25:25 -06:00
David C. Lonie	4807b3c472	Silence warnings about unavoidable weak vtables. - Exception classes cannot be exported due to MSVC's design decisions. See http://stackoverflow.com/questions/24511376. We must leave these classes as header only and silence the warnings. - TransferResource in BufferState.h must remain a header-only class since there is no vtkm_interop library to compile the class into. - The VTKDataSetReader hierarchy must similarly remain header-only since there is no vtkm_io library. - The OptionParser Action classes are part of a header-only utility and cannot be easily compiled into a library. -	2017-04-13 14:06:33 -04:00
David C. Lonie	cbfe5fddd9	Fix up various issues with ArrayHandles in vtkm_cont.	2017-04-05 15:45:11 -07:00
Robert Maynard	355eea887c	Get the vtkm cont cuda object to compile properly.	2017-04-05 15:45:10 -07:00
David C. Lonie	6ecc22bb8c	First pass at compiling ArrayHandle into vtkm_cont.	2017-04-05 15:45:01 -07:00
Li-Ta Lo	2bdc0be5ca	add cuda calls for memory advise as per Tom Fogel	2017-03-14 14:19:01 -06:00
Li-Ta Lo - 194699	6ce8a0135a	Merge branch 'master' into unified-memory	2017-03-09 14:54:03 -07:00
Li-Ta Lo - 194699	b470175f98	new unified memory effort with the new Thrust device	2017-03-09 14:51:45 -07:00
Sujin Philip	9eddce6c99	Rename StreamCompact to CopyIf Plus, removes the version that uses one array as both input and stencil.	2017-03-06 11:08:27 -05:00
Sujin Philip	8c4bbc39ad	Use C++11 =delete keyword	2017-02-24 09:39:22 -05:00
Sujin Philip	a88807fd7e	Catch all exceptions by reference	2017-02-23 13:25:01 -05:00
David Lonie	a0016456cc	Merge topic 'default_device_to_public_header' 7a41621d Move default device selection out of private headers. Acked-by: Kitware Robot <kwrobot@kitware.com> Acked-by: Kenneth Moreland <kmorel@sandia.gov> Merge-request: !693	2017-02-17 08:26:01 -05:00
David C. Lonie	7a41621d82	Move default device selection out of private headers. This will make the librarification of vtk-m easier as we tread that path. Refs #120.	2017-02-16 13:40:35 -05:00
Tom Fogal	31e20859d4	Fix typo.	2017-02-14 10:29:00 -08:00
Li-Ta Lo - 194699	835073dae2	clean up with custom allocator	2017-02-13 11:45:17 -07:00
David C. Lonie	f601e38ba8	Simplify exception hierarchy. Remove the ErrorControl class such that all subclasses now inherit from error. Renamed all exception classes via s/ErrorControl/Error/. See issue #57.	2017-02-07 15:42:38 -05:00
Robert Maynard	6e4e07b378	Merge topic 'all_array_portals_have_set' 629271bc Make sure all ArrayPortals have a Set method. Acked-by: Kitware Robot <kwrobot@kitware.com> Merge-request: !676	2017-02-01 09:08:51 -05:00
Kenneth Moreland	629271bceb	Make sure all ArrayPortals have a Set method. The current design for ArrayPortalVirtual makes it a requirement for all array portals (that it wraps) to have Set defined. Thus, make sure Set is defined for all ArrayPortal. Where Set is invalid, an assert is thrown if something calls it at runtime.	2017-01-31 15:46:39 -05:00
David C. Lonie	575d74d143	Manage cuda device memory with cudaMalloc instead of thrust::vector.	2017-01-30 15:36:37 -05:00
Christopher Sewell	82c40a6374	First support for unified memory	2017-01-18 11:43:49 -07:00
Robert Maynard	bd1ff7a5ac	Allow vtkm errors to properly work with shared libraries.	2017-01-16 09:17:38 -05:00
Robert Maynard	d8d6fd1741	StorageTags are now always exported to resolve future dynamic_cast issues. Class that need to be passed across dynamic library boundaries such as DynamicArrayHandle need to be properly export. One of 'tricks' of this is that templated classes such as PolymorphicArrayHandleContainer need the Type and Storage types to have public visibility. This makes sure that all vtkm storage tags have public visibility so in the future we can transfer dynamic array handles across libraries.	2017-01-16 09:17:38 -05:00
Kenneth Moreland	7ec8c03489	Remove warning about constant conditional in VTKM_ASSUME It is common practice to swallow a semicolon in a macro using a do/ while(false) loop that only executes once. However, the Visual Studio 2013 compiler was stupidly warning about having a constant expression for the loop. Get around the problem by creating our own swallow semicolon macros that disable this warning as necessary.	2017-01-10 10:40:17 -07:00
Kenneth Moreland	98c8cb8657	Do not attempt to execute CUDA kernels with no blocks I noticed that when I attempted to execute a CUDA kernel with 0 blocks, it generated a CUDA error. The error did not really matter since nothing was really supposed to run anyway. However, once we are careful about checking CUDA errors, it will cause test failures and likely other misdiagnoses.	2016-12-15 11:33:48 -07:00
Kenneth Moreland	55c159d6f0	Check error codes from CUDA functions Most functions in the CUDA runtime API return an error code that must be checked to determine whether the operation completed successfully. Most operations in VTK-m just called the function and assumed it completed correctly, which could lead to further errors. This change wraps most CUDA calls in a VTKM_CUDA_CALL macro that checks the error code and throws an exception if the call fails.	2016-12-14 10:43:44 -07:00
Robert Maynard	b97b4cc7ef	Allow thrust::reduce to work when iterator and initial value types differ. natively thrust::reduce is unable to handle the use case where the iterator type and the initial value/return value are two different types. We use array handle cast to work around this problem when we detect this usecase.	2016-11-25 13:11:19 -05:00
Robert Maynard	2cfc9743e3	Reduce can support reduce to a T type that isn't the arrayhandles T type. This has been done so that operations such as computing the Min/Max of an array can be done in a single reduce step.	2016-11-25 11:40:46 -05:00
Kenneth Moreland	fdaccc22db	Remove exports for header-only functions/methods Change the VTKM_CONT_EXPORT to VTKM_CONT. (Likewise for EXEC and EXEC_CONT.) Remove the inline from these macros so that they can be applied to everything, including implementations in a library. Because inline is not declared in these modifies, you have to add the keyword to functions and methods where the implementation is not inlined in the class.	2016-11-15 22:22:13 -07:00
Christopher Sewell	1ebf0c17b6	Attempt 10 to resolve Windows compiler warning with streaming storage	2016-10-21 13:10:49 -06:00
Christopher Sewell	05975a2325	Attempt 3 to resolve Windows compiler warning with streaming storage	2016-10-20 10:32:30 -06:00
Christopher Sewell	c6e15c1240	Merge remote-tracking branch 'upstream/master' into StreamingArray	2016-10-07 18:10:29 -06:00
Robert Maynard	0f58d6fc54	Add vtkm/cont/serial directory for the serial backend.	2016-09-28 14:22:53 -04:00
Robert Maynard	f7a9bbe0e7	All cuda / tbb unit tests follow the same naming pattern.	2016-09-28 14:22:53 -04:00
Christopher Sewell	d92f39df12	Merge branch 'master' into StreamingArray	2016-09-15 17:54:59 -06:00
Christopher Sewell	610f96a831	Adding streaming inclusive scan	2016-09-15 17:46:09 -06:00
Robert Maynard	f81c42b9b4	Replace NULL with nullptr where applicable.	2016-09-01 09:38:25 -04:00
Robert Maynard	51e50d2933	Add DeviceAdapter::CopySubRange to all device adapters. This allows callers to copy a subsection of an array into another array, without clearing the contents of the destination array if a resize is required.	2016-08-24 15:42:51 -04:00
Christopher Sewell	767a5f6792	Clean up for streaming dispatcher	2016-08-03 19:47:12 -06:00
Christopher Sewell	504e6f55df	Working streaming dispatcher	2016-08-03 18:48:31 -06:00
Christopher Sewell	ced9fd32db	Improving streaming dispatcher	2016-08-03 15:34:43 -06:00
Christopher Sewell	2caf81a4af	First commit for ArrayHandleStreaming	2016-08-02 16:54:12 -06:00
Robert Maynard	45ada6b55a	Rework ArrayHandleCuda to make it stop generate warnings	2016-07-20 12:41:13 -04:00
Robert Maynard	4ca6ce2ad6	nvcc doesn't have troubles with boost shared_ptr optimizations	2016-07-19 13:26:23 -04:00
Kenneth Moreland	51a35cb4fe	Fix warnings about type conversions	2016-06-27 07:50:15 -06:00
Kenneth Moreland	fd29c81bde	Fix warnings about redefined macros The build automatically sets some macros when building CUDA files. Some of the CUDA sources were setting the same macros, which was causing warnings. Change the code to be more careful about setting preprocessor macros.	2016-06-27 07:50:13 -06:00
Kenneth Moreland	7ff20c9230	Fix includes for CUDA builds The CMake CUDA build targets do not respect the target_include_directories (yet?). Instead, add the necessary includes to cuda_include_directories().	2016-06-22 12:53:23 -06:00
Kenneth Moreland	b5415169e2	Change Field and related methods to use Range and Bounds First, be more explicit when we mean a range of values in a field or a spacial bounds. Use the Range and Bounds structs in Field and CoordinateSystem to make all of this more clear (and reduce a bit of code as well).	2016-05-29 18:49:36 -06:00
Robert Maynard	e5c3f9c42d	Solve reduce by key bugs with cuda 7.5 + maxwell hardware. The concern is now all architectures are doing a hardware sync on reduce_by_key. This isn't a super serious concern, but it is a downside.	2016-05-12 13:24:59 -04:00
Matt Larsen	eeae5c1352	Adding fast path for radix sort sort_by_key	2016-05-03 08:53:42 -07:00
Kenneth Moreland	cc497e6a1b	Remove cont/Assert.h and exec/Assert.h These asserts are consolidated into the unified Assert.h. Also made some minor edits to add asserts where appropriate and a little bit of reconfiguring as found.	2016-04-20 15:41:14 -06:00
Matt Larsen	e5c4aa3f78	Fixing cuda index error	2016-03-08 12:41:11 -08:00
Matt Larsen	249cce352b	Adding type restrictions to serial atomics	2016-03-08 10:39:23 -08:00
Matt Larsen	40b6db7eee	Inserted missing ,	2016-03-08 09:51:50 -08:00
Matt Larsen	3b46706e1f	Adding compare and swap and removing unsigned atomics	2016-03-08 09:41:02 -08:00
Matt Larsen	e8b08f2e00	Merge branch 'master' into feature/atomics	2016-03-04 08:03:33 -08:00
Robert Maynard	2f98cdf717	Resolves Issue #56 : ChooseCudaDevice functions are in the proper namespace.	2016-02-26 13:51:28 -05:00
Matt Larsen	2baac9cd8b	initial commit of atomic adds	2016-02-10 07:51:31 -08:00
Kenneth Moreland	3f446ad261	Add ErrorControlCuda for better CUDA error checking. Add lots of checks to CUDA calls in the timer to try to identify any problems that might be showing up on the dashboard. Also adding some print statements around the sleep function in the device adapter testing. For some reason the problem just went away with them.	2016-01-12 15:19:54 -07:00
Kenneth Moreland	f582729803	Synchronize the CUDA timer on both the start and end events Previously, the timer for CUDA devices only called cudaEventSynchronize at the end event when asking for the elapsed time. This, however, could allow time to pass from when the timer was reset to when the start event happened that was not recorded in the timer. This added synchronization should make sure that all time spent in CUDA is recorded.	2016-01-12 15:13:56 -07:00
Robert Maynard	c70da5fc23	Extend vtkm::DeviceAdapterTraits to include a unique numeric identifier. Previously each device adapter only had a unique string name. This was not the best when it came to developing data structures to track the status of a given device at runtime. This adds in a unique numeric identifier to each device adapter. This will allow classes to easily create bitmasks / lookup tables for the validity of devices.	2015-12-16 11:18:52 -05:00
Robert Maynard	a7127f0fc3	Adding vtkm::cont::RuntimeDeviceInformation. The RuntimeDeviceInformation class allows developers to check if a given device is supported on a machine at runtime. This allows developers to properly check for CUDA support before running any worklets.	2015-12-15 17:25:27 -05:00

1 2 3 4 5 ...

356 Commits