blender

Author	SHA1	Message	Date
Campbell Barton	6f3f891c58	Rename BLI_rct*_init_pt_size -> radius	2017-03-08 23:23:39 +11:00
Sergey Sharybin	75cb4850f0	Cycles: Use 1-based line number for #line directives AMD CPU platform was complaining about #line 0 directives in the code.	2017-03-08 12:45:18 +01:00
Sergey Sharybin	ecfbfe478b	Cycles: Log which device kernels are being loaded for	2017-03-08 12:33:51 +01:00
Sergey Sharybin	712f7c3640	Cycles: Make it possible to access KernelGlobals from split data initialization function	2017-03-08 11:02:54 +01:00
Sergey Sharybin	ef7c36f5ed	Cycles: Cleanup, remove residue of previous split kernel data This is all in split data state array.	2017-03-08 10:26:29 +01:00
Sergey Sharybin	a095611eb8	Fix T50886: Blender crashes on render Was a mistake in one of the previous TLS commits. See comment in the pool_create to see some details why it was crashing.	2017-03-08 09:41:38 +01:00
meta-androcto	3505be8361	update theme back to black re: T50869	2017-03-08 18:31:24 +11:00
Mai Lavelle	64751552f7	Cycles: Fix indentation	2017-03-08 01:31:32 -05:00
Mai Lavelle	fe7cc94dfa	Cycles: Fix strict warning about unused variable	2017-03-08 01:31:32 -05:00
Mai Lavelle	306034790f	Cycles: Calculate size of split state buffer kernel side By calculating the size of the state buffer in the kernel rather than the host less code is needed and the size actually reflects the requested features. Will also be a little faster in some cases because of larger global work size.	2017-03-08 01:31:30 -05:00
Mai Lavelle	997e345bd2	Cycles: Fix crash after failed kernel build Pointers to kernels were uninitialized leading to freeing of random memory addresses. Another reason it would be good to use smart pointers.	2017-03-08 01:31:09 -05:00
Mai Lavelle	18e50927f7	Cycles: Faster building of split kernel Simple change to make it so that only kernels that have been modified are rebuilt. Might only be useful during development.	2017-03-08 01:31:09 -05:00
Mai Lavelle	223f45818e	Cycles: Initialize rng_state for split kernel Because the split kernel can render multiple samples in parallel it is necessary to have everything initialized before rendering of any samples begins. The code that normally handles initialization of `rng_state` (`kernel_path_trace_setup()`) only does so for the first sample, which was causing artifacts in the split kernel due to uninitialized `rng_state` for some samples. Note that because the split kernel can render samples in parallel this means that the split kernel is incompatible with the LCG.	2017-03-08 01:31:09 -05:00
Mai Lavelle	cd7d5669d1	Cycles: Remove sum_all_radiance kernel This was only needed for the previous implementation of parallel samples. As we don't have that any more it can be removed. Real reason for removal tho is this: `per_sample_output_buffers` was being calculated too small and artifacts resulted. The tile buffer is already the correct size and calculating the size for `per_sample_output_buffers` is a bit difficult with the current layout of the code. As `per_sample_output_buffers` was only needed for `sum_all_radiance`, removing that kernel and writing output to the tile buffer directly fixes the artifacts.	2017-03-08 01:31:07 -05:00
Mai Lavelle	4cf501b835	Cycles: Split path initialization into own kernel This makes it easier to initialize things correctly in the data_init kernel before they are needed by path tracing.	2017-03-08 01:30:43 -05:00
Mai Lavelle	5b8f1c8d34	Cycles: Seperate kernel loading time from render time	2017-03-08 01:24:55 -05:00
Mai Lavelle	b78e543af9	Cycles: Add names to buffer allocations This is to help debug and track memory usage for generic buffers. We have similar for textures already since those require a name, but for buffers the name is only for debugging proposes.	2017-03-08 01:24:55 -05:00
Mai Lavelle	817873cc83	Cycles: CUDA implementation of split kernel	2017-03-08 01:24:53 -05:00
Mai Lavelle	0892352bfe	Cycles: CPU implementation of split kernel	2017-03-08 00:52:41 -05:00
Mai Lavelle	352ee7c3ef	Cycles: Remove ccl_fetch and SOA	2017-03-08 00:52:41 -05:00
Sergey Sharybin	a87766416f	Cycles: Report device maximum allocation and detected global size	2017-03-08 00:52:41 -05:00
Mai Lavelle	365a4239c5	Cycles: Workaround for driver hangs Simple workaround for some issues we've been having with AMD drivers hanging and rendering systems unresponsive. Unfortunately this makes things a bit slower, but its better than having to do hard reboots. Will be removed when drivers have been fixed. Define CYCLES_DISABLE_DRIVER_WORKAROUNDS to disable for testing purposes.	2017-03-08 00:52:41 -05:00
Mai Lavelle	230c00d872	Cycles: OpenCL split kernel refactor This does a few things at once: - Refactors host side split kernel logic into a new device agnostic class `DeviceSplitKernel`. - Removes tile splitting, a new work pool implementation takes its place and allows as many threads as will fit in memory regardless of tile size, which can give performance gains. - Refactors split state buffers into one buffer, as well as reduces the number of arguments passed to kernels. Means there's less code to deal with overall. - Moves kernel logic out of OpenCL kernel files so they can later be used by other device types. - Replaced OpenCL specific APIs with new generic versions - Tiles can now be seen updating during rendering	2017-03-08 00:52:41 -05:00
Mai Lavelle	520b53364c	Cycles: Add OpenCL kernel for zeroing memory buffers Transferring memory to the device was very slow and there's really no need when only zeroing a buffer.	2017-03-08 00:52:41 -05:00
Mai Lavelle	dfd6055eb0	Cycles: Add more atomic operations	2017-03-08 00:52:41 -05:00
Mai Lavelle	bc652766e8	Cycles: Expose passes size to device tasks This is needed so devices can know the size of a tile buffer before any tiles are acquired.	2017-03-08 00:52:41 -05:00
Mai Lavelle	0f56f7a811	Cycles: Allow device_memory to be used directly This is useful for when theres no host side memory attched to the buffer	2017-03-08 00:52:41 -05:00
Sergey Sharybin	9e566b06e3	Task scheduler: Add concept of suspended pools Suspended pools allows to push huge amount of initial tasks without any threading synchronization and hence overhead. This gives ~50% speedup of cached rigid body with file from T50027 and seems to have no negative affect in other scenes here.	2017-03-07 17:32:01 +01:00
Sergey Sharybin	347410a322	Depsgraph: Remove workarounds from depsgraph for keeping threads alive This is something what should be done in the task scheduler instead with local thread queues so we handle this in a single place.	2017-03-07 17:32:01 +01:00
Sergey Sharybin	55c2cd85f0	Task scheduler: Initial implementation of local tasks queues The idea is to allow some amount of tasks to be pushed from working thread to it's local queue, so we can acquire some work without doing whole mutex lock. This should allow us to remove some hacks from depsgraph which was added there to keep threads alive.	2017-03-07 17:32:01 +01:00
Sergey Sharybin	2f722f1a49	Task scheduler: Use real pthread's TLS to access active thread's data This allows us to avoid TLS stored in pool which gives us advantage of using pre-allocated tasks pool for the pools created from non-main thread. Even on systems with slow pthread TLS it should not be a problem because we access it once at a pool construction time. If we want to use this more often (for example, to get rid of push_from_thread) we'll have to do much more accurate benchmark.	2017-03-07 17:32:01 +01:00
Sergey Sharybin	a07ad02156	Task scheduler: Refactor the way we store thread-spedific data Basically move all thread-specific data (currently it's only task memory pool) from a dedicated array of taskScheduler to TaskThread. This way we can add more thread-specific data in the future with less of a hassle.	2017-03-07 17:32:01 +01:00
Sergey Sharybin	9522f8acf0	Task scheduler: Remove per-pool threads limit This feature was adding extra complexity to task scheduling which required yet extra variables to be worried about to be modified in atomic manner, which resulted in following issues: - More complex code to maintain, which increases risks of something going wrong when we modify the code. - Extra barriers and/or locks during task scheduling, which causes extra threading overhead. - Unable to use some other implementation (such as TBB) even for the comparison tests. Notes about other changes. There are two places where we really had to use that limit. One of them is the single threaded dependency graph. This will now construct a single-threaded scheduler at evaluation time. This shouldn't be a problem because it only happens when using debugging command line arguments and the code simply don't run in regular Blender operation. The code seems a bit duplicated here across old and new depsgraph, but think it's OK since the old depsgraph is already gone in 2.8 branch and i don't see where else we might want to use such a single-threaded scheduler. When/if we'll want to do so, we can move it to a centralized single-threaded scheduler in threads.c. OpenGL render was a bit more tricky to port, but basically we are using conditional variables to wait background thread to do all the job.	2017-03-07 17:32:01 +01:00
Aaron Carlisle	35d78121f0	Fix typo in command line arg list	2017-03-07 09:07:58 -05:00
Julian Eisel	af076031d6	Update keymap presets for recent transform manipulator changes Part of T50565.	2017-03-07 11:54:40 +01:00
Julian Eisel	ca796f872e	Once more T50565: Allow using planar constraints for scale manipulator	2017-03-07 11:23:07 +01:00
Clément Foucault	15fa806160	Rigid body: fix viewport not updating on properties change.	2017-03-06 16:25:47 +01:00
raa	f1c764fd8f	Fix width calculation for split layouts	2017-03-06 16:35:56 +03:00
Sergey Sharybin	0e995e0bfe	Cycles: Fix strict -Wpedantic warnings with GCC Patch by Stefan Werner, thanks!	2017-03-06 14:18:26 +01:00
Sergey Sharybin	b498db06eb	Task scheduler: Cleanup, use BLI_assert() instead of assert()	2017-03-06 11:33:27 +01:00
Sergey Sharybin	3623f32b48	FFmpeg: Update for the deprecated API in 3.2.x Should be no functional changes.	2017-03-06 10:34:57 +01:00
Luca Rood	355ad008a2	Surface Deform Modifier: Respect object transforms at bind time This slightly changes SDef behavior, by now respecting object transforms at bind time, thus not requiring the objects to be aligned in their respective local spaces, but instead using world space.	2017-03-06 03:43:26 -03:00
Julian Eisel	80444effc6	Multi-View: Map cursor coordinates to visual coordinates When rendering multi-view in side-by-side or top-bottom mode, we squash the UI to half of its size and draw it twice on screen. That means the cursor coordinates used for UI interaction don't match what's visible on screen. This commit is a little event system hack (tm) to fix this. It has some small glitches with cursor grabbing, but nothing to bad. We'll also use it for viewport HMD support. D1350, thanks for the feedback @dfelinto!	2017-03-06 01:32:35 +01:00
Campbell Barton	e72af060ab	CMake: confine WIN32 options	2017-03-06 04:05:00 +11:00
Campbell Barton	5f98cd6360	Cleanup: typos	2017-03-05 23:36:49 +11:00
Campbell Barton	a461216885	BMesh: Add 'cut' separate mode for intersect tool It was only possible to separate all geometry from an intersection or none. Made this into an enum with a 3rd option to 'Cut', (now default) which keeps each side of the intersection separate without splitting faces in half.	2017-03-05 23:36:46 +11:00
Campbell Barton	3caeb51d7f	Fix T50855: Intersect (knife) w/o separate doesn't select	2017-03-05 22:28:16 +11:00
Jörg Müller	f75b52eca1	Fix T50843: Pitched Audio renders incorrectly in VSE There was a bug in the intended code behaviour to always seek with a pitch of 1.0 regardless of pitch/pitch animation/doppler effects. Check the bug report for a more detailed explanation of problems concerning pitch and seeking.	2017-03-05 12:19:32 +01:00
Campbell Barton	4a4d71414e	BLI_rect: add init from point functions Initialize a rectangle from point+size.	2017-03-05 20:51:23 +11:00
Luca Rood	2089a17f7e	Fix T50838: Surface Deform DM use after free issue Implementd fix suggested by @sergey in T50838.	2017-03-04 03:16:50 -03:00

1 2 3 4 5 ...

66907 Commits