vtk-m2

Author	SHA1	Message	Date
Robert Maynard	311618a15f	Enable highest level of warnings(W4) under MSVC This will make VTK-m warning level match the one used by VTK. This commit also resolves the first round of warnings that W4 exposes.	2017-09-22 13:04:28 -04:00
Kenneth Moreland	c3a3184d51	Update copyright for Sandia Sandia National Laboratories recently changed management from the Sandia Corporation to the National Technology & Engineering Solutions of Sandia, LLC (NTESS). The copyright statements need to be updated accordingly.	2017-09-20 15:33:44 -06:00
Allison Vacanti	00320e5dc0	Use vtkm::UInt64 for byte sizes. vtkm::Id could be just 32bits, which limits the number of values we would be able to store for large arrays.	2017-09-19 09:21:14 -04:00
Allison Vacanti	28ab480a40	Fix warnings on renar.	2017-09-18 15:33:02 -04:00
Robert Maynard	b9e69217ae	Merge topic 'typedef_to_using_round_4' f6863594 Convert VTK-m over to use 'using' instead of 'typedef' Acked-by: Kitware Robot <kwrobot@kitware.com> Merge-request: !885	2017-08-17 16:38:49 -04:00
Sujin Philip	72a6cf4a21	Change cuda calls to use the per-thread stream.	2017-08-17 11:03:02 -04:00
Robert Maynard	f68635941e	Convert VTK-m over to use 'using' instead of 'typedef'	2017-08-17 10:47:25 -04:00
Robert Maynard	89f439999a	Reduce the amount of typedef statements in DeviceAdapters By using the auto keyword and decltype we can reduce the number of complex typedefs that exist when writing device adapter algorithms. The goal being that it is easier for developers to see the actual algorithms being implemented, by reducing the amount of template 'noise'.	2017-08-16 14:23:21 -04:00
Allison Vacanti	0a828189ad	Reuse user-provided cuda device pointers when possible.	2017-08-03 17:21:13 -04:00
David C. Lonie	bd042ec567	Add CudaAllocator to encapsulate runtime managed memory logic. Unified memory is now used when we detect that the hardware supports it.	2017-07-31 09:08:27 -04:00
Robert Maynard	76532264c3	Correct signed to unsigned warnings in ArrayManagerExecutionCuda.	2017-07-20 10:42:09 -04:00
David C. Lonie	379c3a0fad	Use current device when allocating managed memory.	2017-07-13 12:55:22 -04:00
Li-Ta Lo	65910f139c	put return value back to ScanInclusivePortal	2017-07-10 12:11:51 -06:00
Li-Ta Lo	16b61d8697	Make ScanInclusiveByKey and ScanInclusiveByKey void functions. These two algorithms does not return meaningful return values. Generic interface and implementation are both void. Remove erronous return type and statement for CUDA backend.	2017-07-07 15:11:42 -06:00
Robert Maynard	09a08fea8d	Correct errors in TaskTuner found from the TestBuilds.	2017-07-07 08:40:03 -04:00
Robert Maynard	c11f29c093	Move the parameter sweeping code to a separate header. The parameter sweeping code is only enabled when tuning for new GPU's so we should move it to a separate header to make DeviceAdapterAlgorithmThrust easier to read.	2017-07-05 14:58:19 -04:00
David C. Lonie	b2c3e41645	Refactor array transfer logic for basic storage. The old templated array transfer mechanism generated a lot of code that ended up doing a simple, type-agnostic memcpy for most devices. This patch specialized array handles for basic storage and uses a fast-path array transfer implementation. This reduces the size of the vtkm_cont library by 27% on gcc (from 6.2MB to 4.5MB).	2017-06-29 13:18:44 -04:00
Robert Maynard	cfda0593be	ArrayManagerExecutionThrustDevice stops generating casting warnings.	2017-06-16 08:50:32 -04:00
Robert Maynard	5dd346007b	Respect VTK-m convention of parameters all or nothing on a line clang-format BinPack settings have been disabled to make sure that the VTK-m style guideline is obeyed.	2017-05-26 13:53:28 -04:00
Robert Maynard	60a405ef65	Add TaskTiling1D/3D which use faux virtuals to reduce binary size. Redesigns the TBB and Serial backends and the vtkm::exec::Task concept so that we can re-use the same launching logic for all Worklets, instead of generating per worlet code. To keep the performance the same the TilingTask now is past a range of indices to work on, rather than a single index. Binary size reduction: WorkletTests_SERIAL old - 19MB WorkletTests_SERIAL new - 18MB WorkletTests_TBB old - 39MB WorkletTests_TBB new - 18MB libvtkAcceleratorsVTKm old - 48MB libvtkAcceleratorsVTKm new - 19MB	2017-05-25 11:00:01 -04:00
Kitware Robot	4ade5f5770	clang-format: apply to the entire tree	2017-05-25 07:51:37 -04:00
Kitware Robot	efbde1d54b	clang-format: sort include directives	2017-05-18 12:59:33 -04:00
Robert Maynard	022c36fa4f	Add vtkm::exec::TaskBase, and rename WorkletInvokeFunctor to TaskSingular Previously WorkletInvokeFunctor inherited from vtkm::exec::FunctorBase, which is also the base class for all users Worklets and for all functors based to DeviceAdapter::Schedule. This is done for a few reasons. The first is that we reduce the minimum size of user worklets. Previously the users worklet would hold a reference to the error message, and so would the wrapper class added when calling DeviceAdapter::Schedule. Now we only have the users worklet holding a reference. Second, by refactoring to have two base classes we can better improve the documentation on what responsibilities FunctorBase.h has, compared to TaskBase.	2017-05-02 16:38:43 -04:00
Sujin Philip	e9898cc5cf	Merge topic 'virtual-methods' 4049b5b2 Add ClipWithImplicitFunction Filter 82d02e46 Modify ImplicitFunctions to use Virtual Methods 968960c1 Add Virtual Methods Framework Acked-by: Kitware Robot <kwrobot@kitware.com> Acked-by: Kenneth Moreland <kmorel@sandia.gov> Merge-request: !750	2017-05-02 16:12:04 -04:00
Sujin Philip	968960c1a1	Add Virtual Methods Framework	2017-05-01 16:51:42 -04:00
Robert Maynard	80b9d74a23	Merge topic 'embed_more_into_vtkm_cont' ec6589d3 Only enable -fPIC on component static libraries when necessary. cbfe5fdd Fix up various issues with ArrayHandles in vtkm_cont. 355eea88 Get the vtkm cont cuda object to compile properly. 6ecc22bb First pass at compiling ArrayHandle into vtkm_cont. Acked-by: Kitware Robot <kwrobot@kitware.com> Merge-request: !715	2017-04-26 13:47:10 -04:00
Li-Ta Lo	897b2f0f63	add tests for 1, 2 and ARRAY_SIZE elements for both ScanInclusiveByKey and ScanExclusiveByKey	2017-04-19 13:38:28 -06:00
Li-Ta Lo	a205f21043	make ScanExclusiveByKey return void, rearrange parameter ordering	2017-04-17 16:11:02 -06:00
Li-Ta Lo	7023266585	add both generic and Thrust ScanExclusiveByKey	2017-04-17 15:03:49 -06:00
Li-Ta Lo	e77f9fac6a	add CUDA implementation of ScanInclusiveByKey using Thrust library	2017-04-14 11:25:25 -06:00
David C. Lonie	cbfe5fddd9	Fix up various issues with ArrayHandles in vtkm_cont.	2017-04-05 15:45:11 -07:00
Robert Maynard	355eea887c	Get the vtkm cont cuda object to compile properly.	2017-04-05 15:45:10 -07:00
David C. Lonie	6ecc22bb8c	First pass at compiling ArrayHandle into vtkm_cont.	2017-04-05 15:45:01 -07:00
Li-Ta Lo	2bdc0be5ca	add cuda calls for memory advise as per Tom Fogel	2017-03-14 14:19:01 -06:00
Li-Ta Lo - 194699	6ce8a0135a	Merge branch 'master' into unified-memory	2017-03-09 14:54:03 -07:00
Li-Ta Lo - 194699	b470175f98	new unified memory effort with the new Thrust device	2017-03-09 14:51:45 -07:00
Sujin Philip	9eddce6c99	Rename StreamCompact to CopyIf Plus, removes the version that uses one array as both input and stencil.	2017-03-06 11:08:27 -05:00
Sujin Philip	8c4bbc39ad	Use C++11 =delete keyword	2017-02-24 09:39:22 -05:00
Sujin Philip	a88807fd7e	Catch all exceptions by reference	2017-02-23 13:25:01 -05:00
David C. Lonie	7a41621d82	Move default device selection out of private headers. This will make the librarification of vtk-m easier as we tread that path. Refs #120.	2017-02-16 13:40:35 -05:00
Li-Ta Lo - 194699	835073dae2	clean up with custom allocator	2017-02-13 11:45:17 -07:00
David C. Lonie	f601e38ba8	Simplify exception hierarchy. Remove the ErrorControl class such that all subclasses now inherit from error. Renamed all exception classes via s/ErrorControl/Error/. See issue #57.	2017-02-07 15:42:38 -05:00
David C. Lonie	575d74d143	Manage cuda device memory with cudaMalloc instead of thrust::vector.	2017-01-30 15:36:37 -05:00
Christopher Sewell	82c40a6374	First support for unified memory	2017-01-18 11:43:49 -07:00
Kenneth Moreland	98c8cb8657	Do not attempt to execute CUDA kernels with no blocks I noticed that when I attempted to execute a CUDA kernel with 0 blocks, it generated a CUDA error. The error did not really matter since nothing was really supposed to run anyway. However, once we are careful about checking CUDA errors, it will cause test failures and likely other misdiagnoses.	2016-12-15 11:33:48 -07:00
Kenneth Moreland	55c159d6f0	Check error codes from CUDA functions Most functions in the CUDA runtime API return an error code that must be checked to determine whether the operation completed successfully. Most operations in VTK-m just called the function and assumed it completed correctly, which could lead to further errors. This change wraps most CUDA calls in a VTKM_CUDA_CALL macro that checks the error code and throws an exception if the call fails.	2016-12-14 10:43:44 -07:00
Robert Maynard	b97b4cc7ef	Allow thrust::reduce to work when iterator and initial value types differ. natively thrust::reduce is unable to handle the use case where the iterator type and the initial value/return value are two different types. We use array handle cast to work around this problem when we detect this usecase.	2016-11-25 13:11:19 -05:00
Robert Maynard	2cfc9743e3	Reduce can support reduce to a T type that isn't the arrayhandles T type. This has been done so that operations such as computing the Min/Max of an array can be done in a single reduce step.	2016-11-25 11:40:46 -05:00
Kenneth Moreland	fdaccc22db	Remove exports for header-only functions/methods Change the VTKM_CONT_EXPORT to VTKM_CONT. (Likewise for EXEC and EXEC_CONT.) Remove the inline from these macros so that they can be applied to everything, including implementations in a library. Because inline is not declared in these modifies, you have to add the keyword to functions and methods where the implementation is not inlined in the class.	2016-11-15 22:22:13 -07:00
Christopher Sewell	1ebf0c17b6	Attempt 10 to resolve Windows compiler warning with streaming storage	2016-10-21 13:10:49 -06:00

1 2 3 4

158 Commits