blender

Author	SHA1	Message	Date
Dalai Felinto	8d3cc431d7	Fix T41471 Cycles Bake: Setting small tile size results in wrong bake with stripes rather than the expected noise pattern This problem was introduced in 983cbafd1877f8dbaae60b064a14e27b5b640f18 Basically the issue is that we were not getting a unique index in the baking routine for the RNG (random number generator). Reviewers: sergey Differential Revision: https://developer.blender.org/D749	2014-08-19 11:40:33 +02:00
Brecht Van Lommel	e4e58d4612	Fix T40370: cycles CUDA baking timeout with high number of AA samples. Now baking does one AA sample at a time, just like final render. There is also some code for shader antialiasing that solves T40369 but it is disabled for now because there may be unpredictable side effects.	2014-06-06 15:39:04 +02:00
Brecht Van Lommel	69c7522b24	Fix T40379: world MIS causing too much CUDA memory usage. The kernel for baking the world texture was the same as the one used for baking. Now that's separate which allows the kernel to reserve much less memory.	2014-05-27 15:11:32 +02:00
Brecht Van Lommel	c91d4ccbeb	Code refactor: rename kernel_displace.h to kernel_bake.h.	2014-05-19 15:07:32 +02:00
Campbell Barton	dc13969e48	Style cleanup: indentation, braces	2014-05-05 02:19:08 +10:00
Brecht Van Lommel	8af782ad22	Code cleanup: some reshuffling of SIMD defines moving more code to util_optimization.h.	2014-01-15 15:11:50 +01:00
Sergey Sharybin	5cd321203e	Fix compilation error with stricg GCC flags	2014-01-15 16:21:53 +06:00
Martijn Berger	0f3fed2970	OS X linker does not like empty compilation unit by itself in a library. Scons creates one library (.a) per kernel. This fixes that	2014-01-14 22:48:31 +01:00
Thomas Dinges	9351ac0d85	Cycles: Skip the compilation of the dedicated SSE2 kernel on x86-64, we can assume SSE2 here, so just re-use the regular one. Saves 500kb in the blender binary. Reviewed by: brecht Differential Revision: https://developer.blender.org/D199	2014-01-14 20:39:54 +01:00
Brecht Van Lommel	e9d03296c7	Better fix for #36935 and 36316: * 32 bit GCC builds now have the SSE BVH optimizations turned off, but still compile with SSE flags for better performance. * White color when rendering on Windows seems to have been unrelated to SSE, rather it was a graphics driver not supporting half float textures, added a check for that now.	2013-10-05 19:56:34 +00:00
Brecht Van Lommel	29f6616d60	Cycles: viewport render now takes scene color management settings into account, except for curves, that's still missing from the OpenColorIO GLSL shader. The pixels are stored in a half float texture, converterd from full float with native GPU instructions and SIMD on the CPU, so it should be pretty quick. Using a GLSL shader is useful for GPU render because it avoids a copy through CPU memory.	2013-08-30 23:49:38 +00:00
Brecht Van Lommel	01e22d1b9f	Cycles: more code refactoring to rename things internally as well. Also change property name back so we keep compatibility.	2013-08-23 14:34:34 +00:00
Brecht Van Lommel	b9ce231060	Cycles: relicense GNU GPL source code to Apache version 2.0. More information in this post: http://code.blender.org/ Thanks to all contributes for giving their permission!	2013-08-18 14:16:15 +00:00
Thomas Dinges	743a7a4a4b	Cycles: * GPU kernel can now be compiled without __NON_PROGRESSIVE__ again, was broken after my last commit. Also add a check for have_error(), in case the GPU kernel comes without Non-Progressive, to avoid a crash. * Don't compile progressive kernel twice on CPU, if __NON_PROGRESSIVE__ would be disabled there.	2013-08-09 20:03:49 +00:00
Thomas Dinges	a18112249d	Cycles / Non-Progressive integrator: * Non-Progressive integrator is now available on the GPU (CUDA, sm_20 and above). Implementation details: * kernel_path_trace() has been split up into two functions: kernel_path_trace_non_progressive() and kernel_path_trace_progressive(). * We compile two CUDA kernel entry functions (in kernel.cu) for the two integrators, they are still inside one .cubin file but due to the kernel separation there should be no performance problem. I tested with the BMW file on my Geforce 540M and the render times were the same for 100 samples (1.57 min in my case). This is part of my GSoC project, SVN merge of r59032 + manual merge of UI changes for this from my branch.	2013-08-09 18:47:25 +00:00
Thomas Dinges	29a82a04fa	Cycles: * Assure SSE2 intrinsics are also used on SSE3 CPUs and x86.	2013-06-27 00:03:48 +00:00
Brecht Van Lommel	649dd6f648	Fix cycles crash on some processors. We actually need S-SSE3 support for this new BVH traversal code, not just SSE3.	2013-06-18 16:52:02 +00:00
Brecht Van Lommel	d57c6748c4	Cycles: optimization for BVH traveral on CPU's with SSE3, using code from Embree. On the BMW scene, this gives roughly a 10% speedup overall with clang/gcc, and 30% speedup with visual studio (2008). It turns out visual studio was optimizing the existing code quite poorly compared to pretty good autovectorization by clang/gcc, but hand written SSE code also gives a smaller speed boost there. This code isn't enabled when using the hair minimum width feature yet, need to make that work with the SSE code still.	2013-06-18 09:36:06 +00:00
Thomas Dinges	11707119de	Cycles: * Code cleanup, remove unused "resolution" variable from the DeviceTask class, was never used.	2013-05-14 21:18:20 +00:00
Brecht Van Lommel	7c9d993347	Fix cycles intersection issue with overlapping faces on windows 32 bit and CPU without SSE3 support, due to 80 bit precision float register being used for one bounding box but not the one next to it.	2013-02-04 16:12:37 +00:00

20 Commits