blender

Author	SHA1	Message	Date
Sergey Sharybin	a950af8e24	Fix T53012: Shadow catcher creates artifacts on contact area The issue was caused by light sample being evaluated to nan at some point. This is root of the cause which is to be fixed, but is very hard to trace down especially via ssh (the issue only happens on AVX2 release build). Will give it a closer look when back to my AVX2 machine. For until then this is a good check to have anyway, it corresponds to what's happening in regular radiance sum.	2017-10-06 17:27:34 +05:00
Sergey Sharybin	0d3c8d0701	Cycles: Cleanup, indentation and wrapping	2017-10-06 16:54:37 +05:00
Brecht Van Lommel	4537e85584	Fix T53001: more workarounds for crash in AMD compiler with recent drivers.	2017-10-05 17:57:58 +02:00
Brecht Van Lommel	fb99ea79f8	Code refactor: split displace/background into separate kernels, remove luma.	2017-10-05 17:57:58 +02:00
Brecht Van Lommel	49199963bf	Fix incorrect CUDA remaining time estimate after previous commit.	2017-10-04 23:25:51 +02:00
Brecht Van Lommel	6da6f8d33f	Cycles: CUDA faster rendering of small tiles, using multiple samples like OpenCL. The work size is still very conservative, and this doesn't help for progressive refine. For that we will need to render multiple tiles at the same time. But this should already help for denoising renders that require too much memory with big tiles, and just generally soften the performance dropoff with small tiles. Differential Revision: https://developer.blender.org/D2856	2017-10-04 21:58:47 +02:00
Brecht Van Lommel	77f300e2a9	Fix use of uninitialized memory in Cycles normal baking.	2017-10-04 21:11:14 +02:00
Brecht Van Lommel	5bb677e592	Code refactor: zero render buffers outside of kernel. This was originally done with the first sample in the kernel for better performance, but it doesn't work anymore with atomics. Any benefit was very minor anyway, too small to measure it seems.	2017-10-04 21:11:14 +02:00
Brecht Van Lommel	12f4538205	Code refactor: use split variance calculation for mega kernels too. There is no significant difference in denoised benchmark scenes and denoising ctests, so might as well make it all consistent.	2017-10-04 21:11:14 +02:00
Brecht Van Lommel	e3e16cecc4	Code refactor: remove rng_state buffer and compute hash on the fly. A little faster on some benchmark scenes, a little slower on others, seems about performance neutral on average and saves a little memory.	2017-10-04 21:11:14 +02:00
Brecht Van Lommel	5b7d6ea54b	Code refactor: add WorkTile struct for passing work to kernel. This makes sharing some code between mega/split in following commits a bit easier, and also paves the way for rendering multiple tiles later.	2017-10-04 21:11:14 +02:00
Brecht Van Lommel	660e8e59e7	Fix T52645, T52645: AMD OpenCL compiler crash with recent drivers. Work around the bug by reshuffling code.	2017-10-04 21:00:46 +02:00
Ray Molenkamp	57d7e5b6ee	Fix T42489 and T52936: Loading blend with minimized window results in crash or empty screen on windows. Reviewed By: @brecht , @sergey Differential Revision: http://developer.blender.org/D2866	2017-10-04 11:44:22 -06:00
Sergey Sharybin	61d5c5a64f	Fix T52981: 2D Curve shapes do not render untill extruded Regression since 9298c53.	2017-10-03 15:29:39 +05:00
Brecht Van Lommel	f55735e533	CMake: support CUDA 9 toolkit, and automatically disable sm_2x binaries. Fermi cards (GTX 4xx and 5xx) are no longer supported with this version, so we can keep supporting both CUDA 8 and 9 for a while.	2017-10-01 14:14:53 +02:00
Brecht Van Lommel	9298c53e4c	Fix T52943: don't export curves objects with no faces to Cycles. Also skip any objects with zero ray visibility and meshes with zero faces.	2017-09-29 14:54:34 +02:00
Brecht Van Lommel	d2bbd41b4e	Fix Cycles OpenCL compiler error after recent changes.	2017-09-29 14:54:10 +02:00
Campbell Barton	5a1954a5cb	Drop platform support for Solaris & AIX These platforms didn't see maintenance in years. This commit just removes ifdef's & cmake check.	2017-09-29 19:16:34 +10:00
Brecht Van Lommel	c10ac1bb5c	macOS: officially upgrade to 10.9 libraries from lib/darwin. This removes a bunch of code that is no longer needed, and running "make update" will now automatically download the new libraries. Differential Revision: https://developer.blender.org/D2861	2017-09-28 20:53:06 +02:00
Kim Christensen	2a36ee16c1	Fix T52574: make Cycles rendered tile counter more clear. Differential Revision: https://developer.blender.org/D2853	2017-09-28 15:18:53 +02:00
Brecht Van Lommel	400e6f37b8	Cycles: reduce subsurface stack memory usage. This is done by storing only a subset of PathRadiance, and by storing direct light immediately in the main PathRadiance. Saves about 10% of CUDA stack memory, and simplifies subsurface indirect ray code.	2017-09-28 15:18:43 +02:00
Brecht Van Lommel	88520dd5b6	Code refactor: simplify CUDA context push/pop. Makes it possible to call a function like mem_alloc() when the context is already active. Also fixes some missing pops in case of errors.	2017-09-27 13:43:21 +02:00
Sergey Sharybin	0d4e519b74	OpenVDB: Fix compilation error against OpenVDB 4 One crucial thing here: OpenVDB shoudl be compiled WITHOUT OPENVDB_ENABLE_3_ABI_COMPATIBLE flag. This is how OpenVDB's Makefile is configured and it's not really possible to detect this for a compiled library. If we ever want to support that option, we need to add extra CMake argument and use old version 3 API everywhere.	2017-09-25 14:44:17 +05:00
Bastien Montagne	1d8aebaa09	Add an 'atomic cas' wrapper for pointers. Avoids having to repeat obfuscating castings everywhere...	2017-09-25 10:40:50 +02:00
Sergey Sharybin	cb6f07f59e	Cycles: Cleanup, indentation	2017-09-25 11:15:54 +05:00
Sergey Sharybin	c0480bc972	Cycles: Fix compilation error of OpenCL megakernel on Apple	2017-09-23 17:07:19 +05:00
Sergey Sharybin	b460b8fb4a	Cycles: Fix compilation error of megakernel on NVidia device It is more readable to explicitly compare to NULL anyway.	2017-09-23 17:03:02 +05:00
Aaron Carlisle	efd5e3c254	Remove quicktime support It has been deprecated since at least macOS 10.9 and fully removed in 10.12. I am unsure if we should remove it only in 2.8. But you cannot build blender with it supported when using a modern xcode version anyway so I would tend towards just removing it also for 2.79 if that ever happens. Reviewers: mont29, dfelinto, juicyfruit, brecht Reviewed By: mont29, brecht Subscribers: Blendify, brecht Maniphest Tasks: T52807 Differential Revision: https://developer.blender.org/D2333	2017-09-22 16:40:05 -04:00
Brecht Van Lommel	07ec0effb6	Code cleanup: simplify kernel side work stealing code.	2017-09-21 22:29:18 +02:00
Stefan Werner	ee30a4381f	Added extra "const" to satisfy the strict clang version in Xcode 9	2017-09-20 21:47:45 +02:00
Brecht Van Lommel	18a353dd24	Fix T52368: Cycles OSL trace() failing on Windows 32 bit.	2017-09-20 19:38:08 +02:00
Brecht Van Lommel	14223357e5	Fix T52853: harmless Cycles test failure in debug mode.	2017-09-20 19:38:08 +02:00
Brecht Van Lommel	90d4b823d7	Cycles: use defensive sampling for picking BSDFs and BSSRDFs. For the first bounce we now give each BSDF or BSSRDF a minimum sample weight, which helps reduce noise for a typical case where you have a glossy BSDF with a small weight due to Fresnel, but not necessarily small contribution relative to a diffuse or transmission BSDF below. We can probably find a better heuristic that also enables this on further bounces, for example when looking through a perfect mirror, but I wasn't able to find a robust one so far.	2017-09-20 19:38:08 +02:00
Brecht Van Lommel	095a01a73a	Cycles: slightly improve BSDF sample stratification for path tracing. Similar to what we did for area lights previously, this should help preserve stratification when using multiple BSDFs in theory. Improvements are not easily noticeable in practice though, because the number of BSDFs is usually low. Still nice to eliminate one sampling dimension.	2017-09-20 19:38:08 +02:00
Brecht Van Lommel	b3afc8917c	Code cleanup: refactor BSSRDF closure sampling, for next commit.	2017-09-20 19:38:08 +02:00
Brecht Van Lommel	d029399e6b	Code cleanup: remove SOBOL_SKIP hack, seems no longer needed.	2017-09-20 19:38:08 +02:00
Brecht Van Lommel	d750d182e5	Code cleanup: remove hack to avoid seeing transparent objects in noise. Previously the Sobol pattern suffered from some correlation issues that made the outline of objects like a smoke domain visible. This helps simplify the code and also makes some other optimizations possible.	2017-09-20 19:38:08 +02:00
Sergey Sharybin	3241905f40	Fix T52818: Tangent space calculation is really slow for high-density mesh with degenerated topology Now we replace O(N^2) computational complexity with O(N) extra memory penalty. Memory is much cheaper than CPU time. Keep in mind, memory penalty is like 4 megabytes per 1M vertices.	2017-09-19 17:50:09 +05:00
Sergey Sharybin	2dab6f499c	Mikkspace: Cleanup, reduce indentation level	2017-09-19 17:50:09 +05:00
Carlo Andreacchio	ab9079f459	Fix Cycles adaptive compile without volumes broken after recent changes. Differential Revision: https://developer.blender.org/D2847	2017-09-18 12:52:32 +02:00
Hristo Gueorguiev	6798a061b7	Cycles: Fix compilation error with OpenCL split kernel	2017-09-16 12:33:03 +02:00
Sergey Sharybin	7aafa32c09	Fix T51416: Blender Crashes while moving Sliders The issue here was that removing datablock from main database will poke editors update, which includes buttons context to free users of texture. Since Cycles will free datablocks from job thread, it might crash Blender since main thread might be in the middle of drawing. Solved by exposing extra arguments to bpy.data.foo.remove() which indicates whether we want to perform ID user count and interface updates. While scripts shouldn't be using those normally, this is the only way to allow Cycles to skip interface update when removing datablock. Reviewers: mont29 Reviewed By: mont29 Differential Revision: https://developer.blender.org/D2840	2017-09-14 17:03:40 +05:00
Brecht Van Lommel	32449e1b21	Code cleanup: store branch factor in PathState.	2017-09-13 15:24:14 +02:00
Brecht Van Lommel	9e258fc641	Code cleanup: avoid used of uninitialized value in case of precision issue.	2017-09-13 15:24:14 +02:00
Brecht Van Lommel	37d9e65ddf	Code cleanup: abstract shadow catcher logic more into accumulation code.	2017-09-13 15:24:14 +02:00
Brecht Van Lommel	f77cdd1d59	Code cleanup: deduplicate some branched and split kernel code. Benchmarks peformance on GTX 1080 and RX 480 on Linux is the same for bmw27, classroom, pabellon, and about 2% faster on fishy_cat and koro.	2017-09-13 15:24:14 +02:00
Brecht Van Lommel	c4c450045d	Code cleanup: tweak inlining for 2% better CUDA performance with hair.	2017-09-13 15:24:14 +02:00
Mathieu Menuet	659ba012b0	Cycles: change AO bounces approximation to do more glossy and transmission. Rather than treating all ray types equally, we now always render 1 glossy bounce and unlimited transmission bounces. This makes it possible to get good looking results with low AO bounces settings, making it useful to speed up interior renders for example. Reviewed By: brecht Differential Revision: https://developer.blender.org/D2818	2017-09-12 15:37:35 +02:00
Brecht Van Lommel	de6ecc82ed	Fix rare firefly in volume equiangular sampling when sampling short distance.	2017-09-12 12:50:44 +02:00
Brecht Van Lommel	cd6c9e9e5f	Cycles: improve sample stratification on area lights for path tracing. Previously we used a 1D sequence to select a light, and another 2D sequence to sample a point on the light. For multiple lights this meant each light would get a random subset of a 2D stratified sequence, which is not guaranteed to be stratified anymore. Now we use only a 2D sequence, split into segments along the X axis, one for each light. The samples that fall within a segment then each are a stratified sequence, at least in the limit. So for example for two lights, we split up the unit square into two segments [0,0.5[ x [0,1[ and [0.5,1[ x [0,1[. This doesn't make much difference in most scenes, mainly helps if you have a few large area lights or some types of HDR backgrounds.	2017-09-12 12:45:29 +02:00
Brecht Van Lommel	d454a44e96	Fix Cycles bug in RR termination, probability should never be > 1.0. This causes render differences in some scenes, for example fishy_cat and pabellon scenes render brighter in a few spots. This is an old bug, not due to recent RR changes.	2017-09-12 12:43:26 +02:00
Sergey Sharybin	467d92b8f1	Cycles: Tweaks to avoid compilation error of megakernel Also moved code out of deep-inside ifdef block, otherwise it was quite confusing.	2017-09-12 13:33:46 +05:00
Sergey Sharybin	ae41a08288	Cycles: Attempt to work around compilation of sm_20 and sm_21 Disabled forceinline for those architectures, which seems to be compiling successfully more often. There might be ~3% slowdown based on quick tests, but better be rendering something rather than failing to compile kernels again and again. Those architectures will be doomed for abandon once we'll switch to toolkit 9.	2017-09-08 18:37:54 +02:00
Brecht Van Lommel	ce1f2e271d	Cycles: disable fast math flags, only use a subset. Empty BVH nodes are set to NaN which must be preserved all the way to the tnear <= tfar test which can then give false for empty nodes. This needs strict semantices and careful argument ordering for min() and max(), so the second argument is used if either of the arguments is NaN. Fixes T52635: crash in BVH traversal with SSE4.1. Differential Revision: https://developer.blender.org/D2828	2017-09-08 15:12:37 +02:00
Brecht Van Lommel	c10ea88420	Fix T52660: CUDA volume texture rendering not working on Fermi GPUs.	2017-09-06 18:12:45 +02:00
Brecht Van Lommel	2d407fc288	Fix T52661: mesh light shader using backfacing not working, after new sampling.	2017-09-06 13:51:48 +02:00
Brecht Van Lommel	dd8016f708	Fix T52652: Cycles image box mapping has flipped textures. This breaks backwards compatibility some in that 3 sides will be mapped differently now, but difficult to avoid and can be considered a bugfix.	2017-09-06 13:51:45 +02:00
Sergey Sharybin	750e38a526	Cycles: Fix compilation error with CUDA after recent changes	2017-09-05 16:52:45 +02:00
Sergey Sharybin	f01e43fac3	Fix T52433: Volume Absorption color tint Need to exit the volume stack when shadow ray laves the medium. Thanks Brecht for review and help in troubleshooting!	2017-09-05 15:48:34 +02:00
Sergey Sharybin	b0bbb5f34f	Cycles: Cleanup, style	2017-09-05 12:43:02 +02:00
Sergey Sharybin	885c0a5f90	Cycles: Fix compilation warning	2017-09-04 13:28:15 +02:00
Sergey Sharybin	33249f6987	Fix T52533: Blender shuts down when rendering duplicated smoke domain	2017-09-04 13:14:54 +02:00
Joerg Mueller	8d207cdc3b	Fix T52472: VSE Audio Volume not set immediately Audio mixing is done with volume interpolation. A new handle started at volume 1, now starting at volume 0 for a smooth fade in.	2017-09-01 12:27:21 +02:00
Campbell Barton	18b7f05480	Cycles: follow strict class naming convention	2017-09-01 16:08:25 +10:00
Sergey Sharybin	018137f762	Cycles: Cleanup, indentation and trailing whitespace	2017-08-31 14:47:49 +02:00
Sergey Sharybin	8b9e1707a1	Cycles: Fix typo in comment	2017-08-31 13:24:32 +02:00
Stefan Werner	68dfa0f1b7	Fixing T52477 - switching from custom ray/triangle intersection code to the one from util_intersection.h. This fixes the bug and makes the code more readable and maintainable.	2017-08-30 11:48:49 +02:00
Mai Lavelle	124ffb45a6	Cycles: Fix build with networking enabled	2017-08-30 00:19:44 -04:00
Brecht Van Lommel	1457e5ea73	Fix Cycles Windows render errors with BVH2 CPU rendering. One problem is that it was always using __mm_blendv_ps emulation even if the instruction was supported. The other that the emulation function was wrong. Thanks a lot to Ray Molenkamp for tracking this one down.	2017-08-29 22:55:35 +02:00
Sergey Sharybin	f4b3678646	Atomics: Use system headers directly, without bad level dependency to BLI This will make it easier to re-use library as-is in other projects, such as Cycles standalone repo for example.	2017-08-28 11:06:15 +02:00
Sergey Sharybin	5121dacf9d	Fix for fix (tm): Residue of the debug code	2017-08-25 21:33:44 +02:00
Sergey Sharybin	90110d3732	Fix mistake in previous tangent space optimization	2017-08-25 21:30:20 +02:00
Sergey Sharybin	12f627cd9f	Cycles: Cleanup, naming of variable Always use b_ prefix for C++ RNA data.	2017-08-25 21:30:20 +02:00
Sergey Sharybin	ee61a97632	Cycles: Add assert to catch possibly wrong logic	2017-08-25 21:30:20 +02:00
Lukas Stockner	f9a3d01452	Cycles: Mark pixels with negative values as outliers If a pixel has negative components, something already went wrong, so the best option is to just ignore it. Should be good for 2.79.	2017-08-25 17:46:15 +02:00
Sergey Sharybin	d79fa8dc4d	Another optimization of tangent space calculation Don't use quick sort for small arrays, bubble sort works way faster for small arrays due to cache coherency. This is what qsort() from libc is doing actually. We can also experiment unrolling some extra small arrays, for example 3 and 4 element arrays. This reduces tangent space calculation for dragon from 3.1sec to 2.9sec.	2017-08-25 14:54:44 +02:00
Sergey Sharybin	49717d4971	Optimize tangent space calculation by inlining functions Brings tangent space calculation from 4.6sec to 3.1sec for dragon model in BI. Cycles is also somewhat faster, but it has other bottlenecks. Funny thing, using simple `static inline` already gives a lot of speedup here. That's just answering question whether it's OK to leave decision on what to inline up to a compiler..	2017-08-25 14:50:04 +02:00
Sergey Sharybin	90299e4216	Cycles: Add utility function to query current value of scoped timer	2017-08-25 14:27:34 +02:00
Sergey Sharybin	12d527f327	Cycles: Correct logging of sued CPU intrisics	2017-08-25 14:27:34 +02:00
Sergey Sharybin	dfae3de6bd	Cycles: Fix stack overflow during traversal caused by floating overflow Would be nice to be able to catch this with assert as well, will see what would be the best way to do this/.\ Need to verify with Mai that this solves crash for her and maybe consider porting this to 2.79.	2017-08-25 14:27:34 +02:00
Sergey Sharybin	436d1b4e90	Cycles: FIx issue with -0 being considered a non-finite value	2017-08-24 14:32:56 +02:00
Brecht Van Lommel	76b74a93a8	Fix Cycles CUDA transparent shadow error after recent fix in c22b52c. Fishy cat benchmark was rendering with wrong shadows. Cause is unclear, adding printf or rearranging code seems to avoid this issue, possibly a compiler bug. This reverts the fix and solves the OSL bug elsewhere.	2017-08-24 03:43:02 +02:00
Brecht Van Lommel	b85d36d811	Code cleanup: remove shader context. This was needed when we accessed OSL closure memory after shader evaluation, which could get overwritten by another shader evaluation. But all closures are immediatley converted to ShaderClosure now, so no longer needed.	2017-08-24 03:43:02 +02:00
Mai Lavelle	579edb1510	Cycles: Add maximum depth stat to bvh builder	2017-08-23 06:54:26 -04:00
Mai Lavelle	2540741dee	Fix implementation of atomic update max and move to a central location While unlikely to have had any serious effects because of limited use, the previous implementation was not actually atomic due to a data race and incorrectly coded CAS loop. We also had duplicates of this code in a few places, it's now been moved to a single location with all other atomic operations.	2017-08-23 06:54:25 -04:00
Sergey Sharybin	5c60721c9e	Fix T51805: Overlapping volumes renders incorrect on AMD GPU We need to make sure we can store all volume closures for all objects in volume stack. This is a bit tricky to detect what would be the "nestness" level of volumes so for now use maximum possible stack depth. Might cause some slowdown, but better to give reliable render output than to fail quickly. Should be safe for 2.79 after extra eyes.	2017-08-23 12:35:23 +02:00
Brecht Van Lommel	049932c4c3	Fix panorama render crash with split kernel, due to incorrect buffer pointer. Also some refactoring to clarify variable usage scope.	2017-08-22 00:41:07 +02:00
Brecht Van Lommel	296d74c4b1	Cycles: reorganize Performance panel layout, move viewport BVH type to debug.	2017-08-21 19:05:17 +02:00
Brecht Van Lommel	43a6cf1504	Cycles: attempt to recover from crashing CUDA/OpenCL drivers on Windows. I don't know if this will actually work, needs testing. Ref T52064.	2017-08-20 23:18:25 +02:00
Brecht Van Lommel	41e6068c76	Revert "Cycles: remove square samples option." This reverts commit 757c24b6bceaeeae95f743b72b6a7040880a0ebf. We'll revisit this when doing deeper sampling changes.	2017-08-20 23:46:05 +02:00
Brecht Van Lommel	1d1ddd48db	Fix T52470: cycles OpenCL hair rendering not working after recent changes.	2017-08-20 23:32:20 +02:00
Brecht Van Lommel	ce0fce2207	Code cleanup: deduplicate some bsdf node methods.	2017-08-20 17:37:22 +02:00
Brecht Van Lommel	b5f8063fb9	Cycles: support baking normals plugged into BSDFs, averaged with closure weight.	2017-08-20 16:51:53 +02:00
Brecht Van Lommel	0b07c2c8a2	Code cleanup: remove copy of shader graph for bump, no longer needed.	2017-08-20 14:27:51 +02:00
Brecht Van Lommel	c22b52cd36	Fix T52452: OSL trace broken after shadow catcher recent changes. We should only early out with any hit in BVH traversal if the only visibility bits used are opaque shadow. Not when opaque shadow is one of multiple bits.	2017-08-19 18:14:16 +02:00
Brecht Van Lommel	cfa8b762e2	Code cleanup: move rng into path state. Also pass by value and don't write back now that it is just a hash for seeding and no longer an LCG state. Together this makes CUDA a tiny bit faster in my tests, but mainly simplifies code.	2017-08-19 18:14:16 +02:00
Brecht Van Lommel	4d428d14af	Fix T52443: Cycles OpenCL build error after recent mesh lights changes.	2017-08-19 01:02:55 +02:00
Stefan Werner	7a4696197d	Cycles: Fix for a division by zero that could happen with solid angle triangle light sampling	2017-08-17 15:07:59 +02:00
Stefan Werner	8141eac2f8	Improved triangle sampling for mesh lights This implements Arvo's "Stratified sampling of spherical triangles". Similar to how we sample rectangular area lights, this is sampling triangles over their solid angle. It does significantly improve sampling close to the triangle, but doesn't do much for more distant triangles. So I added a simple heuristic to switch between the two methods. Unfortunately, I expect this to add render time in any case, even when it does not make any difference whatsoever. It'll take some benchmarking with various scenes and hardware to estimate how severe the impact is and if it is worth the change. Reviewers: #cycles, brecht Reviewed By: #cycles, brecht Subscribers: Vega-core, brecht, SteffenD Tags: #cycles Differential Revision: https://developer.blender.org/D2730	2017-08-17 12:44:32 +02:00
Lukas Stockner	5492d2cb67	Cycles: Calculate correct remaining time when using a larger pixel size	2017-08-17 02:00:44 +02:00
Lukas Stockner	66c1b23aa1	Cycles/BI: Add a pixel size option for speeding up viewport rendering This patch adds "Pixel Size" to the performance options, which allows to render in a smaller resolution, which is especially useful for displays with high DPI. Reviewers: Severin, dingto, sergey, brecht Reviewed By: brecht Subscribers: Severin, venomgfx, eyecandy, brecht Differential Revision: https://developer.blender.org/D1619	2017-08-15 01:22:40 +02:00
Stefan Werner	86eb8980d3	Cycles: Fixed broken camera motion blur when motion was not set to center on frame Reviewers: #cycles, sergey Reviewed By: #cycles, sergey Subscribers: sergey Differential Revision: https://developer.blender.org/D2787	2017-08-14 20:24:30 +02:00
Sergey Sharybin	4e6324dd59	Cycles: Guard memcpy to potentially re-allocating memory with lock Basically, make re-alloc and memcpy from the same lock, otherwise one thread might be re-allocating thread while another one is trying to copy data there. Reported by Mohamed Sakr in IRC, thanks!	2017-08-14 14:55:47 +02:00
Brecht Van Lommel	dc7fcebb33	Code cleanup: make L_transparent part of PathRadiance.	2017-08-13 01:19:07 +02:00
Brecht Van Lommel	7542282c06	Code cleanup: make DebugData part of PathRadiance.	2017-08-13 01:19:07 +02:00
Brecht Van Lommel	fce405059f	Code cleanup: make it easier to test only Sobol, CMJ or Pseudorandom.	2017-08-13 01:19:07 +02:00
Brecht Van Lommel	8f97108353	Cycles: optimize CPU split kernel data init.	2017-08-12 20:43:34 +02:00
Brecht Van Lommel	601f94a3c2	Code cleanup: remove unused Cycles random number code.	2017-08-12 20:40:38 +02:00
Brecht Van Lommel	6919393a51	Fix T52372: CUDA build error after recent changes.	2017-08-12 20:37:06 +02:00
Brecht Van Lommel	d7639d57dc	Fix T52368: OSL trace() crash after recent changes.	2017-08-12 14:32:52 +02:00
Brecht Van Lommel	85ad248c36	Code cleanup: fix warning and improve terminology.	2017-08-12 13:18:05 +02:00
Sergey Sharybin	2e25754ecd	Cycles: Clarify new argument in PathRadiance	2017-08-11 13:49:50 +02:00
Sergey Sharybin	bd069a89aa	Fix T52229: Shadow Catcher artifacts when under transparency Added some extra tirckery to avoid background being tinted dark with transparent surface. Maybe a bit hacky, but seems to work fine.	2017-08-11 13:49:50 +02:00
Brecht Van Lommel	757c24b6bc	Cycles: remove square samples option. It doesn't seem that useful in practice, was mostly added to match some other renderers but also seems to be causing user confusing and accidental long render times. So let's just keep the UI simple and remove this. Differential Revision: https://developer.blender.org/D2768	2017-08-11 01:10:56 +02:00
Brecht Van Lommel	8a7c207f0b	Cycles: change defaults for filter glossy, clamp and branched path AA. We're adding some bias by default, which now I think is the right thing to do from a usability point of view since you really need to use those options anyway to get clean renders in a practical time. Differential Revision: https://developer.blender.org/D2769	2017-08-11 01:10:50 +02:00
Brecht Van Lommel	267e75158a	Fix T52322: denoiser broken on Windows after recent changes. It's not clear why this only happened on Windows, but the code was wrong and should do a bitcast here instead of conversion.	2017-08-11 01:09:35 +02:00
Sergey Sharybin	422fddab87	Cycles: Fix instanced shadow catcher objects influencing each other	2017-08-10 09:22:33 +02:00
Sergey Sharybin	5a618ab737	Cycles: De-duplicate trace-time object visibility calculation We already have enough files to worry about in BVH builders. no need to add yet another copy-paste code which is tempting to be running out of sync.	2017-08-10 09:21:02 +02:00
Sergey Sharybin	176ad9ecdd	Cycles: Remove ulong usage This is a bit confusing, especially when one mixes OpenCL code where ulong equals to uint64_t with CPU side code where ulong is expected to be something else from the naming. This commit makes it so we use explicit name, common on all platforms.	2017-08-09 14:08:58 +02:00
Mai Lavelle	55d28e604e	Cycles: Proper fix for recent OpenCL image crash Problem was that some code checks to see if device_pointer is null or not and the new allocator wasn't even setting the pointer to anything as it tracks memory location separately. Setting the pointer to non null keeps all users of device_pointer happy.	2017-08-09 04:27:39 -04:00
Mai Lavelle	06bf34227b	Revert "Cycles: Fix crash changing image after recent OpenCL changes" This reverts commit f2809ae0a671057caa1005e2b9cc91648c33dd1f.	2017-08-09 04:24:03 -04:00
Sergey Sharybin	99c13519a1	Cycles: More fixes for Windows 32 bit - Apparently MSVC does not support compound literals in C++ (at least by the looks of it). - Not sure how opencl_device_assert was managing to set protected property of the Device class.	2017-08-08 22:32:51 +02:00
Sergey Sharybin	c961737d0f	Cycles: Fix compilation error of filter kernels on 32 bit Windows We don't enable global SSE optimizations in regular kernel, and we keep those disabled on Linux 32bit. One possible workaround would be to pass arguments by ccl_ref, but that is quite a few of code which better be done accurately.	2017-08-08 22:01:17 +02:00
Sergey Sharybin	f2809ae0a6	Cycles: Fix crash changing image after recent OpenCL changes Steps to reproduce: - Create shader Image texture -> Diffuse BSDF -> Output. Do NOT select image yet! - Start viewport render. - Select image from the ID browser of Image Texture node. Thing is: with the memory manager we always need to inform device that memory was freed.	2017-08-08 17:17:04 +02:00
Sergey Sharybin	0e57282999	Cycles: Fix compilation error without C++11 Common folks, nobody considered master a C++11 only branch. Such decision is to be done officially and will involve changes in quite a few infrastructure related areas.	2017-08-08 17:02:26 +02:00
Sergey Sharybin	19d19add1e	Cycles: Cleanup, de-duplicate function parameter list Was only needed to sue const reference on CPU. Now it is done using ccl_ref.	2017-08-08 15:27:25 +02:00
Sergey Sharybin	fd397a7d28	Cycles: Add utility macro ccl_ref It is defined to & for CPU side compilation, and defined to an empty for any GPU platform. The idea here is to use this macro instead of #ifdef block with bunch of duplicated lines just to make it so CPU code is efficient. Eventually we might switch to references on CUDA as well, but that would require some intensive testing.	2017-08-08 15:27:25 +02:00
Mai Lavelle	ec8ae4d5e9	Cycles: Pack kernel textures into buffers for OpenCL Image textures were being packed into a single buffer for OpenCL, which limited the amount of memory available for images to the size of one buffer (usually 4gb on AMD hardware). By packing textures into multiple buffers that limit is removed, while simultaneously reducing the number of buffers that need to be passed to each kernel. Benchmarks were within 2%. Fixes T51554. Differential Revision: https://developer.blender.org/D2745	2017-08-08 07:12:04 -04:00
Sergey Sharybin	451ccf7396	Cycles: Cleanup, move curve intersection functions to own file This way curve file becomes much shorter and it's also easier to write a benchmark application to check performance before/after future changes.	2017-08-07 20:53:30 +02:00
Sergey Sharybin	77a7a7f455	Cycles: Cleanup, trailign whitespace	2017-08-07 20:53:30 +02:00
Sergey Sharybin	95fe9b2617	Cycles: Cleanup, remove bvh prefix from curve functions Those are nothing to do with BVH, and can be used separately.	2017-08-07 20:53:30 +02:00
Sergey Sharybin	a4bbce8949	Cycles: Fix compilation error on NVidia OpenCL after recent refactor Still need to verify this is proper thing to do for AMD OpenCL. At least now i can compile OpenCL kernel on my laptop with sm21 card.	2017-08-07 20:52:24 +02:00
Brecht Van Lommel	fc38276d74	Fix Cycles shadow catcher objects influencing each other. Since all the shadow catchers are already assumed to be in the footage, the shadows they cast on each other are already in the footage too. So don't just let shadow catchers skip self, but all shadow catchers. Another justification is that it should not matter if the shadow catcher is modeled as one object or multiple separate objects, the resulting render should be the same. Differential Revision: https://developer.blender.org/D2763	2017-08-07 17:54:26 +02:00
Brecht Van Lommel	dc4d850d10	Fix Windows build errors with recent Cycles SIMD refactoring.	2017-08-07 17:54:26 +02:00
Sergey Sharybin	580741b317	Cycles: Cleanup, space after keyword	2017-08-07 14:47:51 +02:00
Brecht Van Lommel	ee77c1e917	Code refactor: use float4 instead of intrinsics for CPU denoise filtering. Differential Revision: https://developer.blender.org/D2764	2017-08-07 14:01:24 +02:00
Brecht Van Lommel	a24fbf3323	Code refactor: add, remove, optimize various SSE functions. * Remove some unnecessary SSE emulation defines. * Use full precision float division so we can enable it. * Add sqrt(), sqr(), fabs(), shuffle variations, mask(). * Optimize reduce_add(), select(). Differential Revision: https://developer.blender.org/D2764	2017-08-07 14:01:24 +02:00
Brecht Van Lommel	a8cc0d707e	Code refactor: split defines into separate header, changes to SSE type headers. I need to use some macros defined in util_simd.h for float3/float4, to emulate SSE4 instructions on SSE2. But due to issues with order of header includes this was not possible, this does some refactoring to make it work. Differential Revision: https://developer.blender.org/D2764	2017-08-07 14:01:24 +02:00
Brecht Van Lommel	5e4bad2c00	Cycles: remove option to disable transparent shadows globally. We already detect this automatically based on shading nodes and per shader settings, and performance of this option is ok now all devices. Differential Revision: https://developer.blender.org/D2767	2017-08-07 14:01:24 +02:00
Brecht Van Lommel	2a74f36dac	Fix Cycles CUDA adaptive megakernel build error.	2017-08-07 00:27:08 +02:00
Brecht Van Lommel	45dcd20ca9	Cycles: CUDA split performance tweaks, still far from megakernel. On Pabellon, 25.8s mega, 35.4s split before, 32.7s split after.	2017-08-05 14:32:59 +02:00
Brecht Van Lommel	cd023b6cec	Cycles: remove min bounces, modify RR to terminate less. Differential Revision: https://developer.blender.org/D2766	2017-08-04 23:11:03 +02:00
Sergey Sharybin	0d01cf4488	Cycles: Extra tweaks to performance of header expansion Two main things here: 1. Replace all unsafe for #line directive characters into a single loop, avoiding multiple iterations and multiple temporary strings created. 2. Don't merge token char by char but calculate start and end point and then copy all substring at once. This gives about 15% speedup of source processing time. At this point (with all previous commits from today) we've shrinked down compiled sources size from 108 MB down to ~5.5 MB and lowered processing time from 4.5 sec down to 0.047 sec on my laptop running Linux (this was a constant time which Blender will always spent first time loading kernel, even if we've got compiled clbin).	2017-08-03 08:07:06 +02:00
Campbell Barton	ba98f06acc	mikktspace: minor optimization Add a safe version of normalize since all uses of normalize did zero length checks, move this into a function. Also avoid unnecessary conversion. Gives minor speedup here (approx 3-5%).	2017-08-03 07:03:59 +10:00
Sergey Sharybin	f879cac032	Cycles: Avoid some expensive operations in header expansions Basically gather lines as-is during traversal, avoiding allocating memory for all the lines in headers. Brings additional performance improvement abut 20%.	2017-08-02 20:59:19 +02:00
Sergey Sharybin	a280697e77	Cycles: Support "precompiled" headers in include expansion algorithm The idea here is that it is possible to mark certain include statements as "precompiled" which means all subsequent includes of that file will be replaced with an empty string. This is a way to deal with tricky include pattern happening in single program OpenCL split kernel which was including bunch of headers about 10 times. This brings preprocessing time from ~1sec to ~0.1sec on my laptop.	2017-08-02 20:59:19 +02:00
Sergey Sharybin	4ad39964fd	Cycles: Speed up #include expansion algorithm The idea is to re-use files which were already processed. Gives about 4x speedup of processing time (~4.5sec vs 1.0sec) on my laptop for the whole OpenCL kernel. For users it will mean lower delay before OpenCL rendering might start.	2017-08-02 20:59:19 +02:00
Campbell Barton	5c963128ea	Cleanup: remove check for old GCC&PPC	2017-07-27 07:29:16 +10:00
Jeff Knox	e93804318f	Fix T51450: viewport render time keeps increasing after render is done. Reviewed By: brecht Differential Revision: https://developer.blender.org/D2747	2017-07-25 01:47:04 +02:00
James Fulop	c3c0495b30	Fix T51948: pen pressure not detected with some Wacom tablets. Generalizes current conditions, QT implements it the same way.	2017-07-24 13:54:36 +02:00
Brecht Van Lommel	9e929c911e	Fix Cycles multi scatter GGX different render results with Clang and GCC. The order of evaluation of function arguments is undefined, and the order was reversed between these compilers. This was causing regressions tests to give different results between Linux and macOS.	2017-07-23 23:25:12 +02:00
Brecht Van Lommel	e982ebd6d4	Fix T52152: allow zero roughness for Cycles principled BSDF, don't clamp.	2017-07-22 23:58:51 +02:00
Brecht Van Lommel	ec831ee7d1	Fix Cycles denoising NaNs with a 1 sample renders. This was causing different render results with different compilers. We can't do much useful with 1 sample, but better for debugging.	2017-07-22 23:58:51 +02:00
Brecht Van Lommel	b4528d8897	Fix use of uninitialized value in Cycles, probably did not cause a bug.	2017-07-22 21:29:10 +02:00
Brecht Van Lommel	db8bc1d982	Fix a few harmless maybe uninitialized warnings with GCC 5.4. GCC seems to detect uninitialized into function calls now, but then isn't always smart enough to see that it is actually initialized. Disabling this warning entirely seems a bit too much, so initialize a bit more now.	2017-07-21 00:54:58 +02:00
Brecht Van Lommel	2b132fc3f7	Fix T52135: Cycles should not keep generated/packed images in memory after render.	2017-07-20 23:47:05 +02:00
Brecht Van Lommel	a4cd7b7297	Fix potential memory leak in Cycles loading of packed/generated images.	2017-07-20 23:46:58 +02:00
Brecht Van Lommel	3b12a71972	Fix T52125: principled BSDF missing with macOS OpenCL.	2017-07-20 15:15:43 +02:00
Stefan Werner	c1ca3c8038	Cycles: fixed the SM_2x CUDA kernel build that I broke in my previous commit	2017-07-20 13:28:34 +02:00
Stefan Werner	4bc6faf9c8	Fix T52107: Color management difference when using multiple and different GPUs together This commit unifies the flattened texture slot names for bindless and regular CUDA textures. Texture indices are now identical across all CUDA architectures, where before Fermi used different indices, which lead to problems when rendering on multi-GPU setups mixing Fermi with newer hardware.	2017-07-20 10:03:27 +02:00
Brecht Van Lommel	8b2785bda5	Fix T49498: continuous grab issues on macOS, particularly with gaming mouses. Change the implementation so it no longer takes over the mouse cursor motion from the OS, instead only move it when warping, similar to Windows and X11. Probably the reason it was not done this way originally is that you then get a 500ms delay after warping, but we can use a trick to avoid that and get much smoother mouse motion than before.	2017-07-18 16:11:33 +02:00
Sergey Sharybin	5f35682f3a	Fix T52021: Shadow catcher renders wrong when catcher object is behind transparent object Tweaked the path radiance summing and alpha to accommodate for possible contribution of light by transparent surface bounces happening prior to shadow catcher intersection. This commit will change the way how shadow catcher results looks when was behind semi transparent object, but the old result seemed to be fully wrong: there were big artifacts when alpha-overing the result on some actual footage.	2017-07-18 09:46:21 +02:00
Sergey Sharybin	d8906f30d3	Cycles: Remove meaningless camera ray check In branched path tracing main loop is always a camera ray, with varying number of transparent bounces.	2017-07-18 09:27:36 +02:00
Mai Lavelle	87164114a3	Cycles: Enable SSS from Principled BSDF only when actually in use This gives speed up for the split kernel in scenes using the principled BSDF but without subsurface scattering.	2017-07-12 04:40:24 -04:00
Mai Lavelle	1f933c94a7	Cycles: Fix comparison in principled BSDF Could have lead to black pixels.	2017-07-11 23:41:22 -04:00
Brecht Van Lommel	29ec0b1162	Fix T52027: OSL getattribute() crash, when optimizer calls it before rendering.	2017-07-11 22:39:51 +02:00
Sergey Sharybin	e26f61a2b5	Cycles: Disable OpenCL clFlush workarounds This is something which was reported to work fine by Mai, Benjamin and confirmed by myself. Disabling this workaround gains us some speedup: Before Now bmw27 04:28.42 04:07.79 classroom 09:26.48 08:54.53 fishy_cat 08:44.01 08:18.70 koro 09:17.98 08:57.18 pavillon_barcelone 12:26.64 11:52.81 Test environment is: - Ubuntu 16.04, with all updates installed - AMD RX 480 GPU - amdgpu pro driver version 17.10-450821	2017-07-11 12:16:58 +02:00
Sergey Sharybin	1dfc4be6ab	Opensubdiv: Fix compilation error with older Opensubdiv versions	2017-07-11 11:05:39 +02:00
Brecht Van Lommel	0584c5ba8e	Fix T51959: Windows + Intel GPU offset between UI drawing and mouse. Unfortunately this means disabling the code that ensures the title bar is properly scaled with DPI, however better to have that as a cosmetic issue than Blender being unusable with a lot of Intel GPUs.	2017-07-08 22:11:07 +02:00
Brecht Van Lommel	3361f2107b	Fix T51967: OSL crash after rendering finished (mainly on Windows).	2017-07-08 02:46:06 +02:00
Sergey Sharybin	fee7f688c3	Cycles: Fix ambiguity in call of min() function	2017-07-07 10:40:19 +02:00
Sergey Sharybin	aaec4ed07d	Fix T51980: Motion Tracking - png image files appear in the Blender program directory when using refine Residue of debug code remained form some older bug fix.	2017-07-07 09:27:24 +02:00
Mai Lavelle	9c3f1ad003	Cycles: Add artificial memory limit debug option for OpenCL	2017-07-06 05:25:46 -04:00
Mai Lavelle	95b345b2fe	Revert "Cycles: use std::min and max for extra overloads" We already have this in util_algorithm.h This reverts commit cff172c7621d89773baa99a9460f19056efb5f1e.	2017-07-06 04:21:29 -04:00
Mai Lavelle	f9963f29e8	Cycles: Dont allow global size to fall to zero	2017-07-05 20:19:15 -04:00
Mai Lavelle	222b96e5c7	Cycles: Detect out of memory before buffer allocation in OpenCL devices	2017-07-05 20:19:12 -04:00
Mai Lavelle	cff172c762	Cycles: use std::min and max for extra overloads	2017-07-05 19:43:34 -04:00
Sergey Sharybin	31f8ca5034	Cycles: Fix compilation error after recent logging changes This file uses std::ostream for helper << operators, so need to make sure corresponding header is included.	2017-07-05 20:40:55 +02:00
Sergey Sharybin	d37dd97e45	Cycles: Pass string by const reference rather than by value Some of the functions might have been inlined, but others i don't see how that was possible (don't think virtual functions can be inlined here). In any case, better be explicitly optimal in the code.	2017-07-05 12:27:41 +02:00
Sergey Sharybin	58c456b12d	Cycles: Fix compilation error when building without Glog and no C++11	2017-07-05 12:01:12 +02:00
Lukas Stockner	15fd758bd6	Fix T51950: Abnormally long Cycles OpenCL GPU render times with certain panoramic camera settings The problem here was that when a "invalid" path is generated by the panoramic camera, it was tagged as RAY_TO_REGENERATE with the intention of generating a new path in kernel_buffer_update. However, since that state was not handled in kernel_queue_enqueue, kernel_buffer_update did not process the path which resulted in an infinite loop.	2017-07-03 18:26:19 +02:00
Lukas Stockner	6782a6076c	Cycles: Add missing split kernel to CPUDevice	2017-07-03 18:26:18 +02:00
Luca Rood	d48a9528ca	Fix missing return error introduced by last commit End of non-void function was being reached since f5535fcb83fd7c1374697923b43565c9e303d225	2017-07-03 12:12:27 +02:00
Brecht Van Lommel	f5535fcb83	Fi T51023: MixRGB constant folding not effective with clamp option.	2017-07-03 05:25:27 +02:00
Brecht Van Lommel	cda24d0853	Fix T51855: Cycles emssive objects with NaN transform break lighting.	2017-07-03 05:04:43 +02:00
Lucas Veber	405121d613	Fix T51759: fluid simulation particles not remoevd when colliding with outflow objects. Reviewed By: brecht Differential Revision: https://developer.blender.org/D2719	2017-07-02 22:42:08 +02:00
Brecht Van Lommel	29c8c50442	Fix T51956: color noise with principled sss, radius 0 and branched path.	2017-07-02 19:21:08 +02:00
Brecht Van Lommel	52b9516e03	Fix principled BSDF incorrectly missing subsurface component with base color black.	2017-07-02 18:22:24 +02:00
Howard Trickey	76eefa5c0d	Better fix for isfinite problem - works in older gcc's too. Previous fix relying on __cplusplus value didn't fix for older gcc's on linux, but this fix does.	2017-06-30 09:04:01 -04:00
Howard Trickey	1938a81e42	Fix compile error after recent 9c2bbfb6 commit. Older C++ compilers use finite instead of isfinite.	2017-06-30 07:47:37 -04:00
Mai Lavelle	c8fa716c06	Cycles: Use float constants instead of double	2017-06-29 23:07:18 -04:00
Mai Lavelle	56dcfcce05	Cycles: Disable baking in mega kernel when not in use to improve build times	2017-06-29 23:07:18 -04:00
Campbell Barton	9c2bbfb6ce	Fix T50887: Holes in fluid mesh on Windows D2556 by @chrisr	2017-06-30 11:48:12 +10:00
Lukas Stockner	1f3fd8e60a	Fix T51909: Cycles: Uninitialized closure normals for the Hair BSDF As the title says, the normal wasn't set for the Hair BSDF because it wasn't needed before. However, the denoiser uses it to store the feature passes, so it needs to be set now.	2017-06-28 21:32:02 +02:00
Lukas Stockner	1979176088	Cycles: Fix excessive sampling weight of glossy Principled BSDF components If there was any specularity in the Principled BSDF, it would get a sampling weight of one regardless of its actual impact. This commit makes Cycles estimate the contribution of the component and adjust the weighting accordingly, which greatly improves the noise characteristics of the Principled BSDF in many cases. Note that this commit might slightly change the brightness of areas when using MultiGGX and high roughnesses, but the new brightness is more accurate and closer to the result of Branched Path Tracing. See T51836 for details. Differential Revision: https://developer.blender.org/D2677	2017-06-22 00:09:56 +02:00
Lukas Stockner	8cb741a598	Fix T51836: Cycles: Fix incorrect PDF approximations of the MultiGGX closures The PDF of the MultiGGX sampling is approximated by the singlescattering GGX term as well as a scaled diffuse term that makes up for the energy in the multiscattering component that's missed by GGX. However, there were two problems with the glossy terms: The diffuse term missed a normalization factor, and the singlescattering term was not properly scaled down based on the albedo estimate. The glass term was completely wrong and has been rewritten. It uses the fresnel factor to weight reflection vs. refraction and uses the glossy MultiGGX model for reflection. For refraction, the correct singlescattering term is now used, and a new albedo approximation is used that was derived by evaluating GGX albedo for roughnesses from 0 to 1 and IORs from 1 to 3 and fitting numerical approximations to it. The resulting model has a mean relative error of 9e-5, but could probably be simplified without losing noticable accuracy in the final render. The improved PDFs help with glossy highlights (due to better light sampling vs. closure sampling MIS) and fix the situation described in T51836 where mixing MultiGGX with other closures (as it happens in e.g. the Principled BSDF) causes incorrect darkening.	2017-06-22 00:09:56 +02:00
Brecht Van Lommel	14ea0c5fcc	Fix T51849: change Cycles clearcoat gloss to roughness. This is compatible with UE4 and more consistent with specular and transmission roughness, even if it deviates from the original Disney BRDF.	2017-06-21 19:55:20 +02:00
Campbell Barton	72c9141a7a	Cleanup: doxygen comments Also remove duplicate & mismatching comments from grease-pencil header. Keep comments close to implementation to avoid getting out of sync.	2017-06-19 10:04:30 +10:00
Sergey Sharybin	8bf108dd48	Guarded allocator: Fix type in macro definition The crash did not happen yet because we always had proper vmemh defined in the parent scope. Patch by Ivan Ivanov (aka obiwanus), thanks! Differential Revision: https://developer.blender.org/D2715	2017-06-17 16:13:30 +02:00
Sergey Sharybin	794311c92b	Cycles: Fix race condition happening in progress utility This is not enough to mutex-guard modification code of integer values, since this operation is NOT atomic. This is not even safe for a single byte data types. For now guarded the getter functions, similar to other functions in this module. Ideally we want to switch modification to an atomic operations, so we wouldn't need any locks in the getters.	2017-06-16 10:22:35 +02:00

... 2 3 4 5 6 ...

7164 Commits