blender

Author	SHA1	Message	Date
Sergey Sharybin	4e6324dd59	Cycles: Guard memcpy to potentially re-allocating memory with lock Basically, make re-alloc and memcpy from the same lock, otherwise one thread might be re-allocating thread while another one is trying to copy data there. Reported by Mohamed Sakr in IRC, thanks!	2017-08-14 14:55:47 +02:00
Brecht Van Lommel	dc7fcebb33	Code cleanup: make L_transparent part of PathRadiance.	2017-08-13 01:19:07 +02:00
Brecht Van Lommel	7542282c06	Code cleanup: make DebugData part of PathRadiance.	2017-08-13 01:19:07 +02:00
Brecht Van Lommel	fce405059f	Code cleanup: make it easier to test only Sobol, CMJ or Pseudorandom.	2017-08-13 01:19:07 +02:00
Brecht Van Lommel	8f97108353	Cycles: optimize CPU split kernel data init.	2017-08-12 20:43:34 +02:00
Brecht Van Lommel	601f94a3c2	Code cleanup: remove unused Cycles random number code.	2017-08-12 20:40:38 +02:00
Brecht Van Lommel	6919393a51	Fix T52372: CUDA build error after recent changes.	2017-08-12 20:37:06 +02:00
Brecht Van Lommel	d7639d57dc	Fix T52368: OSL trace() crash after recent changes.	2017-08-12 14:32:52 +02:00
Brecht Van Lommel	85ad248c36	Code cleanup: fix warning and improve terminology.	2017-08-12 13:18:05 +02:00
Sergey Sharybin	2e25754ecd	Cycles: Clarify new argument in PathRadiance	2017-08-11 13:49:50 +02:00
Sergey Sharybin	bd069a89aa	Fix T52229: Shadow Catcher artifacts when under transparency Added some extra tirckery to avoid background being tinted dark with transparent surface. Maybe a bit hacky, but seems to work fine.	2017-08-11 13:49:50 +02:00
Brecht Van Lommel	757c24b6bc	Cycles: remove square samples option. It doesn't seem that useful in practice, was mostly added to match some other renderers but also seems to be causing user confusing and accidental long render times. So let's just keep the UI simple and remove this. Differential Revision: https://developer.blender.org/D2768	2017-08-11 01:10:56 +02:00
Brecht Van Lommel	8a7c207f0b	Cycles: change defaults for filter glossy, clamp and branched path AA. We're adding some bias by default, which now I think is the right thing to do from a usability point of view since you really need to use those options anyway to get clean renders in a practical time. Differential Revision: https://developer.blender.org/D2769	2017-08-11 01:10:50 +02:00
Brecht Van Lommel	267e75158a	Fix T52322: denoiser broken on Windows after recent changes. It's not clear why this only happened on Windows, but the code was wrong and should do a bitcast here instead of conversion.	2017-08-11 01:09:35 +02:00
Sergey Sharybin	422fddab87	Cycles: Fix instanced shadow catcher objects influencing each other	2017-08-10 09:22:33 +02:00
Sergey Sharybin	5a618ab737	Cycles: De-duplicate trace-time object visibility calculation We already have enough files to worry about in BVH builders. no need to add yet another copy-paste code which is tempting to be running out of sync.	2017-08-10 09:21:02 +02:00
Sergey Sharybin	176ad9ecdd	Cycles: Remove ulong usage This is a bit confusing, especially when one mixes OpenCL code where ulong equals to uint64_t with CPU side code where ulong is expected to be something else from the naming. This commit makes it so we use explicit name, common on all platforms.	2017-08-09 14:08:58 +02:00
Mai Lavelle	55d28e604e	Cycles: Proper fix for recent OpenCL image crash Problem was that some code checks to see if device_pointer is null or not and the new allocator wasn't even setting the pointer to anything as it tracks memory location separately. Setting the pointer to non null keeps all users of device_pointer happy.	2017-08-09 04:27:39 -04:00
Mai Lavelle	06bf34227b	Revert "Cycles: Fix crash changing image after recent OpenCL changes" This reverts commit f2809ae0a671057caa1005e2b9cc91648c33dd1f.	2017-08-09 04:24:03 -04:00
Sergey Sharybin	99c13519a1	Cycles: More fixes for Windows 32 bit - Apparently MSVC does not support compound literals in C++ (at least by the looks of it). - Not sure how opencl_device_assert was managing to set protected property of the Device class.	2017-08-08 22:32:51 +02:00
Sergey Sharybin	c961737d0f	Cycles: Fix compilation error of filter kernels on 32 bit Windows We don't enable global SSE optimizations in regular kernel, and we keep those disabled on Linux 32bit. One possible workaround would be to pass arguments by ccl_ref, but that is quite a few of code which better be done accurately.	2017-08-08 22:01:17 +02:00
Sergey Sharybin	f2809ae0a6	Cycles: Fix crash changing image after recent OpenCL changes Steps to reproduce: - Create shader Image texture -> Diffuse BSDF -> Output. Do NOT select image yet! - Start viewport render. - Select image from the ID browser of Image Texture node. Thing is: with the memory manager we always need to inform device that memory was freed.	2017-08-08 17:17:04 +02:00
Sergey Sharybin	0e57282999	Cycles: Fix compilation error without C++11 Common folks, nobody considered master a C++11 only branch. Such decision is to be done officially and will involve changes in quite a few infrastructure related areas.	2017-08-08 17:02:26 +02:00
Sergey Sharybin	19d19add1e	Cycles: Cleanup, de-duplicate function parameter list Was only needed to sue const reference on CPU. Now it is done using ccl_ref.	2017-08-08 15:27:25 +02:00
Sergey Sharybin	fd397a7d28	Cycles: Add utility macro ccl_ref It is defined to & for CPU side compilation, and defined to an empty for any GPU platform. The idea here is to use this macro instead of #ifdef block with bunch of duplicated lines just to make it so CPU code is efficient. Eventually we might switch to references on CUDA as well, but that would require some intensive testing.	2017-08-08 15:27:25 +02:00
Mai Lavelle	ec8ae4d5e9	Cycles: Pack kernel textures into buffers for OpenCL Image textures were being packed into a single buffer for OpenCL, which limited the amount of memory available for images to the size of one buffer (usually 4gb on AMD hardware). By packing textures into multiple buffers that limit is removed, while simultaneously reducing the number of buffers that need to be passed to each kernel. Benchmarks were within 2%. Fixes T51554. Differential Revision: https://developer.blender.org/D2745	2017-08-08 07:12:04 -04:00
Sergey Sharybin	451ccf7396	Cycles: Cleanup, move curve intersection functions to own file This way curve file becomes much shorter and it's also easier to write a benchmark application to check performance before/after future changes.	2017-08-07 20:53:30 +02:00
Sergey Sharybin	77a7a7f455	Cycles: Cleanup, trailign whitespace	2017-08-07 20:53:30 +02:00
Sergey Sharybin	95fe9b2617	Cycles: Cleanup, remove bvh prefix from curve functions Those are nothing to do with BVH, and can be used separately.	2017-08-07 20:53:30 +02:00
Sergey Sharybin	a4bbce8949	Cycles: Fix compilation error on NVidia OpenCL after recent refactor Still need to verify this is proper thing to do for AMD OpenCL. At least now i can compile OpenCL kernel on my laptop with sm21 card.	2017-08-07 20:52:24 +02:00
Brecht Van Lommel	fc38276d74	Fix Cycles shadow catcher objects influencing each other. Since all the shadow catchers are already assumed to be in the footage, the shadows they cast on each other are already in the footage too. So don't just let shadow catchers skip self, but all shadow catchers. Another justification is that it should not matter if the shadow catcher is modeled as one object or multiple separate objects, the resulting render should be the same. Differential Revision: https://developer.blender.org/D2763	2017-08-07 17:54:26 +02:00
Brecht Van Lommel	dc4d850d10	Fix Windows build errors with recent Cycles SIMD refactoring.	2017-08-07 17:54:26 +02:00
Sergey Sharybin	580741b317	Cycles: Cleanup, space after keyword	2017-08-07 14:47:51 +02:00
Brecht Van Lommel	ee77c1e917	Code refactor: use float4 instead of intrinsics for CPU denoise filtering. Differential Revision: https://developer.blender.org/D2764	2017-08-07 14:01:24 +02:00
Brecht Van Lommel	a24fbf3323	Code refactor: add, remove, optimize various SSE functions. * Remove some unnecessary SSE emulation defines. * Use full precision float division so we can enable it. * Add sqrt(), sqr(), fabs(), shuffle variations, mask(). * Optimize reduce_add(), select(). Differential Revision: https://developer.blender.org/D2764	2017-08-07 14:01:24 +02:00
Brecht Van Lommel	a8cc0d707e	Code refactor: split defines into separate header, changes to SSE type headers. I need to use some macros defined in util_simd.h for float3/float4, to emulate SSE4 instructions on SSE2. But due to issues with order of header includes this was not possible, this does some refactoring to make it work. Differential Revision: https://developer.blender.org/D2764	2017-08-07 14:01:24 +02:00
Brecht Van Lommel	5e4bad2c00	Cycles: remove option to disable transparent shadows globally. We already detect this automatically based on shading nodes and per shader settings, and performance of this option is ok now all devices. Differential Revision: https://developer.blender.org/D2767	2017-08-07 14:01:24 +02:00
Brecht Van Lommel	2a74f36dac	Fix Cycles CUDA adaptive megakernel build error.	2017-08-07 00:27:08 +02:00
Brecht Van Lommel	45dcd20ca9	Cycles: CUDA split performance tweaks, still far from megakernel. On Pabellon, 25.8s mega, 35.4s split before, 32.7s split after.	2017-08-05 14:32:59 +02:00
Brecht Van Lommel	cd023b6cec	Cycles: remove min bounces, modify RR to terminate less. Differential Revision: https://developer.blender.org/D2766	2017-08-04 23:11:03 +02:00
Sergey Sharybin	0d01cf4488	Cycles: Extra tweaks to performance of header expansion Two main things here: 1. Replace all unsafe for #line directive characters into a single loop, avoiding multiple iterations and multiple temporary strings created. 2. Don't merge token char by char but calculate start and end point and then copy all substring at once. This gives about 15% speedup of source processing time. At this point (with all previous commits from today) we've shrinked down compiled sources size from 108 MB down to ~5.5 MB and lowered processing time from 4.5 sec down to 0.047 sec on my laptop running Linux (this was a constant time which Blender will always spent first time loading kernel, even if we've got compiled clbin).	2017-08-03 08:07:06 +02:00
Campbell Barton	ba98f06acc	mikktspace: minor optimization Add a safe version of normalize since all uses of normalize did zero length checks, move this into a function. Also avoid unnecessary conversion. Gives minor speedup here (approx 3-5%).	2017-08-03 07:03:59 +10:00
Sergey Sharybin	f879cac032	Cycles: Avoid some expensive operations in header expansions Basically gather lines as-is during traversal, avoiding allocating memory for all the lines in headers. Brings additional performance improvement abut 20%.	2017-08-02 20:59:19 +02:00
Sergey Sharybin	a280697e77	Cycles: Support "precompiled" headers in include expansion algorithm The idea here is that it is possible to mark certain include statements as "precompiled" which means all subsequent includes of that file will be replaced with an empty string. This is a way to deal with tricky include pattern happening in single program OpenCL split kernel which was including bunch of headers about 10 times. This brings preprocessing time from ~1sec to ~0.1sec on my laptop.	2017-08-02 20:59:19 +02:00
Sergey Sharybin	4ad39964fd	Cycles: Speed up #include expansion algorithm The idea is to re-use files which were already processed. Gives about 4x speedup of processing time (~4.5sec vs 1.0sec) on my laptop for the whole OpenCL kernel. For users it will mean lower delay before OpenCL rendering might start.	2017-08-02 20:59:19 +02:00
Campbell Barton	5c963128ea	Cleanup: remove check for old GCC&PPC	2017-07-27 07:29:16 +10:00
Jeff Knox	e93804318f	Fix T51450: viewport render time keeps increasing after render is done. Reviewed By: brecht Differential Revision: https://developer.blender.org/D2747	2017-07-25 01:47:04 +02:00
James Fulop	c3c0495b30	Fix T51948: pen pressure not detected with some Wacom tablets. Generalizes current conditions, QT implements it the same way.	2017-07-24 13:54:36 +02:00
Brecht Van Lommel	9e929c911e	Fix Cycles multi scatter GGX different render results with Clang and GCC. The order of evaluation of function arguments is undefined, and the order was reversed between these compilers. This was causing regressions tests to give different results between Linux and macOS.	2017-07-23 23:25:12 +02:00
Brecht Van Lommel	e982ebd6d4	Fix T52152: allow zero roughness for Cycles principled BSDF, don't clamp.	2017-07-22 23:58:51 +02:00

1 2 3 4 5 ...

6912 Commits