blender

Author	SHA1	Message	Date
Mai Lavelle	c837bd5ea5	Cycles: Fix CUDA build error for some compilers Needed to include `util_types.h` before using `uint`.	2017-03-08 16:44:43 -05:00
Sergey Sharybin	97c4c2689f	Cycles: Make it more obvious message which initialization failed	2017-03-08 13:57:21 +01:00
Sergey Sharybin	75cb4850f0	Cycles: Use 1-based line number for #line directives AMD CPU platform was complaining about #line 0 directives in the code.	2017-03-08 12:45:18 +01:00
Sergey Sharybin	ecfbfe478b	Cycles: Log which device kernels are being loaded for	2017-03-08 12:33:51 +01:00
Sergey Sharybin	712f7c3640	Cycles: Make it possible to access KernelGlobals from split data initialization function	2017-03-08 11:02:54 +01:00
Sergey Sharybin	ef7c36f5ed	Cycles: Cleanup, remove residue of previous split kernel data This is all in split data state array.	2017-03-08 10:26:29 +01:00
Mai Lavelle	64751552f7	Cycles: Fix indentation	2017-03-08 01:31:32 -05:00
Mai Lavelle	fe7cc94dfa	Cycles: Fix strict warning about unused variable	2017-03-08 01:31:32 -05:00
Mai Lavelle	306034790f	Cycles: Calculate size of split state buffer kernel side By calculating the size of the state buffer in the kernel rather than the host less code is needed and the size actually reflects the requested features. Will also be a little faster in some cases because of larger global work size.	2017-03-08 01:31:30 -05:00
Mai Lavelle	997e345bd2	Cycles: Fix crash after failed kernel build Pointers to kernels were uninitialized leading to freeing of random memory addresses. Another reason it would be good to use smart pointers.	2017-03-08 01:31:09 -05:00
Mai Lavelle	18e50927f7	Cycles: Faster building of split kernel Simple change to make it so that only kernels that have been modified are rebuilt. Might only be useful during development.	2017-03-08 01:31:09 -05:00
Mai Lavelle	223f45818e	Cycles: Initialize rng_state for split kernel Because the split kernel can render multiple samples in parallel it is necessary to have everything initialized before rendering of any samples begins. The code that normally handles initialization of `rng_state` (`kernel_path_trace_setup()`) only does so for the first sample, which was causing artifacts in the split kernel due to uninitialized `rng_state` for some samples. Note that because the split kernel can render samples in parallel this means that the split kernel is incompatible with the LCG.	2017-03-08 01:31:09 -05:00
Mai Lavelle	cd7d5669d1	Cycles: Remove sum_all_radiance kernel This was only needed for the previous implementation of parallel samples. As we don't have that any more it can be removed. Real reason for removal tho is this: `per_sample_output_buffers` was being calculated too small and artifacts resulted. The tile buffer is already the correct size and calculating the size for `per_sample_output_buffers` is a bit difficult with the current layout of the code. As `per_sample_output_buffers` was only needed for `sum_all_radiance`, removing that kernel and writing output to the tile buffer directly fixes the artifacts.	2017-03-08 01:31:07 -05:00
Mai Lavelle	4cf501b835	Cycles: Split path initialization into own kernel This makes it easier to initialize things correctly in the data_init kernel before they are needed by path tracing.	2017-03-08 01:30:43 -05:00
Mai Lavelle	5b8f1c8d34	Cycles: Seperate kernel loading time from render time	2017-03-08 01:24:55 -05:00
Mai Lavelle	b78e543af9	Cycles: Add names to buffer allocations This is to help debug and track memory usage for generic buffers. We have similar for textures already since those require a name, but for buffers the name is only for debugging proposes.	2017-03-08 01:24:55 -05:00
Mai Lavelle	817873cc83	Cycles: CUDA implementation of split kernel	2017-03-08 01:24:53 -05:00
Mai Lavelle	0892352bfe	Cycles: CPU implementation of split kernel	2017-03-08 00:52:41 -05:00
Mai Lavelle	352ee7c3ef	Cycles: Remove ccl_fetch and SOA	2017-03-08 00:52:41 -05:00
Sergey Sharybin	a87766416f	Cycles: Report device maximum allocation and detected global size	2017-03-08 00:52:41 -05:00
Mai Lavelle	365a4239c5	Cycles: Workaround for driver hangs Simple workaround for some issues we've been having with AMD drivers hanging and rendering systems unresponsive. Unfortunately this makes things a bit slower, but its better than having to do hard reboots. Will be removed when drivers have been fixed. Define CYCLES_DISABLE_DRIVER_WORKAROUNDS to disable for testing purposes.	2017-03-08 00:52:41 -05:00
Mai Lavelle	230c00d872	Cycles: OpenCL split kernel refactor This does a few things at once: - Refactors host side split kernel logic into a new device agnostic class `DeviceSplitKernel`. - Removes tile splitting, a new work pool implementation takes its place and allows as many threads as will fit in memory regardless of tile size, which can give performance gains. - Refactors split state buffers into one buffer, as well as reduces the number of arguments passed to kernels. Means there's less code to deal with overall. - Moves kernel logic out of OpenCL kernel files so they can later be used by other device types. - Replaced OpenCL specific APIs with new generic versions - Tiles can now be seen updating during rendering	2017-03-08 00:52:41 -05:00
Mai Lavelle	520b53364c	Cycles: Add OpenCL kernel for zeroing memory buffers Transferring memory to the device was very slow and there's really no need when only zeroing a buffer.	2017-03-08 00:52:41 -05:00
Mai Lavelle	dfd6055eb0	Cycles: Add more atomic operations	2017-03-08 00:52:41 -05:00
Mai Lavelle	bc652766e8	Cycles: Expose passes size to device tasks This is needed so devices can know the size of a tile buffer before any tiles are acquired.	2017-03-08 00:52:41 -05:00
Mai Lavelle	0f56f7a811	Cycles: Allow device_memory to be used directly This is useful for when theres no host side memory attched to the buffer	2017-03-08 00:52:41 -05:00
Sergey Sharybin	0e995e0bfe	Cycles: Fix strict -Wpedantic warnings with GCC Patch by Stefan Werner, thanks!	2017-03-06 14:18:26 +01:00
Sergey Sharybin	3623f32b48	FFmpeg: Update for the deprecated API in 3.2.x Should be no functional changes.	2017-03-06 10:34:57 +01:00
Jörg Müller	f75b52eca1	Fix T50843: Pitched Audio renders incorrectly in VSE There was a bug in the intended code behaviour to always seek with a pitch of 1.0 regardless of pitch/pitch animation/doppler effects. Check the bug report for a more detailed explanation of problems concerning pitch and seeking.	2017-03-05 12:19:32 +01:00
Sergey Sharybin	810d7d4694	Cycles: Fix possibly uninitialized variable Hopefully this was a reason of randomly disappearing textures in our renders.	2017-03-03 10:10:26 +01:00
Sergey Sharybin	351c9239ed	Cleanup: Use explicit unsigned int in atomics	2017-03-01 12:01:19 +01:00
Sergey Sharybin	87f236cd10	Cycles: Fix division by zero in volume code which was producing -nan	2017-02-28 17:33:06 +01:00
Aaron Carlisle	6d1ac79514	Cleanup: Grey --> Gray	2017-02-27 19:33:57 -05:00
Sergey Sharybin	5acac13eb4	Cycles: Fix compilation error on vanilla Ubuntu 16.10 Patch by @swerner, thanks!	2017-02-27 15:22:51 +01:00
Sergey Sharybin	f1b21d5960	Fix T50634: Hair Primitive as Triangles + Hair shader with a texture = crash Attributes were not resized after pushing new triangles to the mesh.	2017-02-27 15:21:14 +01:00
Sergey Sharybin	209a64111e	Fix part of T50634: Hair Primitive as Triangles + Hair shader with a texture = crash Wrong formula was used to calculate needed verts and tris to be reserved.	2017-02-27 15:21:14 +01:00
Sergey Sharybin	00ceb6d2f4	Cycles: Make it more clear values never changes by using const qualifier	2017-02-27 15:21:14 +01:00
Sergey Sharybin	cc78690be3	Cycles: Forgot this in previous commit	2017-02-27 12:54:35 +01:00
Sergey Sharybin	238db604c5	Cycles: Add more logs about what's going on in shader optimization	2017-02-27 12:38:24 +01:00
Sergey Sharybin	845ba1a6fb	Cycles: Experiment with replacing Sharp Glossy with GGX when Filter Glossy is used The idea is to make it simpler to remove noise from scenes when some prop uses Sharp glossy closure and causes noise in certain cases. Previously Sharp Glossy was not affected by Filter Glossy at all, which was quite confusing. Here is a file which demonstrates the issue: {F417797} After applying the patch all the noise from the scene is gone. This change also solves fireflies reported in T50700. Reviewers: brecht, lukasstockner97 Differential Revision: https://developer.blender.org/D2416	2017-02-27 12:33:59 +01:00
Brecht Van Lommel	8c5826f59a	Fix T50698: Cycles baking artifacts with transparent surfaces.	2017-02-25 03:12:53 +01:00
Brecht Van Lommel	15f1072ee2	Fix build error with macOS / clang / c++11.	2017-02-25 03:12:53 +01:00
Sergey Sharybin	1e29286c8c	Cycles: Fix compilation warning with CUDA on OSX	2017-02-24 14:33:10 +01:00
Sergey Sharybin	50328b41a7	Cycles: Fix compilation error on 32bit Linux	2017-02-23 17:30:26 +01:00
Sergey Sharybin	4e12113bea	Cycles: Fix wrong render results with texture limit and half-float textures	2017-02-23 14:46:22 +01:00
Sergey Sharybin	13e075600a	Cycles: Add utility function to convert float to half handles overflow and underflow, but not NaN/inf.	2017-02-23 14:42:06 +01:00
Sergey Sharybin	60592f6778	Fix T50748: Render Time incorrect when refreshing rendered preview in GPU mode	2017-02-23 10:51:06 +01:00
Sergey Sharybin	36c4fc1ea9	Cycles: Fix shading with autosmooth and custom normals New logic of split_faces was leaving mesh in a proper state from Blender's point of view, but Cycles wanted loop normals to be "flushed" to vertex normals. Now we do such a flush from Cycles side again, so we don't leave bad meshes behind. Thanks Bastien for assistance here!	2017-02-22 10:54:36 +01:00
Sergey Sharybin	2c30fd83f1	Cycles: Additionally report all OpenCL cflags This way we can control exact spaces and such added to the cflags which is crucial to troubleshoot certain drivers.	2017-02-22 10:06:02 +01:00
Mai Lavelle	4e9b17da4c	Cycles: Speedup by avoiding extra calculations in noise texture when unneeded Noise texture is now faster when the color socket is unused. Potential for speedup spotted by @nutel. Some performance results: Render Time Before After Difference Gooseberry benchmark 47:51.34 45:55.57 -4% Koro 12:24.92 12:18.46 -0.8% Simple cube (Color socket) 48.53 48.72 +0.3% Simple cube (Fac socket) 48.74 32.78 -32.7% Goethe displacement 1:21.18 1:08.47 -15.6% Cycles brick displacement 3:02.38 2:16.76 -25.0% Large displacement scene 23:54.12 20:09.62 -15.6% Reviewed By: sergey Differential Revision: https://developer.blender.org/D2513	2017-02-21 07:24:33 -05:00

1 2 3 4 5 ...

6537 Commits