blender

Author	SHA1	Message	Date
Brecht Van Lommel	030a023eb2	Cleanup: add zero bit counting functions	2019-08-26 16:07:01 +02:00
Campbell Barton	e12c08e8d1	ClangFormat: apply to source, most of intern Apply clang format as proposed in T53211. For details on usage and instructions for migrating branches without conflicts, see: https://wiki.blender.org/wiki/Tools/ClangFormat	2019-04-17 06:21:24 +02:00
Campbell Barton	1daa20ad9f	Cleanup: strip trailing space for cycles	2018-07-06 10:17:58 +02:00
Sergey Sharybin	4d38932cb4	Cycles: Use more stable version of integer square root function Old code was working quite unreliable in combination with fast math flag, especially when compiling with Clang. It seems we were hitting result of the following bug submitted to Clang [1]. Basically, it was happening so that (int)sqrtf(64) was 7 when Cycles is built with Clang but was correct 8 when built with GCC. This commit works this around. Annoying, but don't see other way to keep sampling pattern the same for Clang and GCC. [1] https://bugs.llvm.org//show_bug.cgi?id=24063	2017-05-09 17:07:17 +02:00
Sergey Sharybin	9d50175b6c	Cycles: Fix correlation issues in certain cases There were two cases where correlation issues were obvious: - File from T38710 was giving issues in 2.78a again - File from T50116 was having totally different shadow between sample 1 and sample 32. Use some more simplified version of CMJ hash which seems to give nice randomized value which solves the correlation. This commit will break all unit test files, but it's a bug fix so perhaps OK to commit this. This also fixes T41143: Sobol gives nonuniform noise Proper science paper about hash function is coming. Reviewers: brecht Reviewed By: brecht Subscribers: lukasstockner97 Differential Revision: https://developer.blender.org/D2385	2016-12-01 14:19:15 +01:00
Brecht Van Lommel	636195e402	Fix T48301: Cycles incorrect render with CMJ and viewport samples 0. Max samples 2147483647 was causing integer overflow.	2016-04-28 23:57:20 +02:00
Sergey Sharybin	b6d9cbe654	Cycles: Fix bug in CMJ pattern when number of samples is 1 It was wrongly considering 1 is a power of 2. While it is a correct thing (1 == 2^0) it's not what the math in some later formulas expects.	2016-02-24 14:23:45 +01:00
Sergey Sharybin	537bd0eb51	Fix T46671: Cycles assert with CMJ sample function With current formulation of cmj_fast_div_pow2() it should always return 0 in the case of first argument is zero and no assert really needed anymore.	2015-11-03 18:49:27 +05:00
Sergey Sharybin	5ff132182d	Cycles: Code cleanup, spaces around keywords This inconsistency drove me totally crazy, it's really confusing when it's inconsistent especially when you work on both Cycles and Blender sides. Shouldn;t cause merge PITA, it's whitespace changes only, Git should be able to merge it nicely.	2015-03-28 00:15:15 +05:00
Sergey Sharybin	61eab743f1	Cycles: Optimization for CMJ in CUDA kernels Two things: - Use intrinsics for clz/ctz (ctz is implemented via ffs()). - Use faster sqrt() function which precision is enough for integer values.	2015-03-13 12:38:14 +05:00
Thomas Dinges	ee36e75b85	Cleanup: Fix Cycles Apache header. This was already mixed a bit, but the dot belongs there.	2014-12-25 02:50:24 +01:00
Sergey Sharybin	d84c15696b	Fix T41601: Correlated multi-jitter with high samples "hangs" Issue was caused by the precision issues which made sdivm by 1 under it's actual value. We can try to do some eps magic, but from the tests on laptop and desktop doing integer division is not slower than using floats here.	2014-08-28 15:15:59 +06:00
Thomas Dinges	24ea3ab1a9	Cleanup: intrin.h is already included via util_optimization.h.	2014-08-27 03:22:36 +02:00
Sergey Sharybin	e827d904ae	Move include outside of the CCL namespace	2014-08-27 00:11:06 +06:00
Sergey Sharybin	44fc0ddee9	Cycles: Use compiler intrinsics for clz/ctz in CMJ code for MSVC	2014-08-26 14:22:08 +06:00
Brecht Van Lommel	c18712e868	Cycles: change __device and similar qualifiers to ccl_device in kernel code. This to avoids build conflicts with libc++ on FreeBSD, these __ prefixed values are reserved for compilers. I apologize to anyone who has patches or branches and has to go through the pain of merging this change, it may be easiest to do these same replacements in your code and then apply/merge the patch. Ref T37477.	2013-11-18 08:48:15 +01:00
Brecht Van Lommel	e25ad0778f	Fix #36545 : crash with branched path tracing, correlated multi-jittered sampling and subsurface scattering.	2013-08-23 23:04:50 +00:00
Brecht Van Lommel	b9ce231060	Cycles: relicense GNU GPL source code to Apache version 2.0. More information in this post: http://code.blender.org/ Thanks to all contributes for giving their permission!	2013-08-18 14:16:15 +00:00
Brecht Van Lommel	16204bd647	Cycles: prepare to make CUDA 5.0 the official version we use * Add CUDA compiler version detection to cmake/scons/runtime * Remove noinline in kernel_shader.h and reenable --use_fast_math if CUDA 5.x is used, these were workarounds for CUDA 4.2 bugs * Change max number of registers to 32 for sm 2.x (based on performance tests from Martijn Berger and confirmed here), and also for NVidia OpenCL. Overall it seems that with these changes and the latest CUDA 5.0 download, that performance is as good as or better than the 2.67b release with the scenes and graphics cards I tested.	2013-06-19 17:54:23 +00:00
Brecht Van Lommel	484d765bd4	Cycles: attempt to fix internal compile error with some visual studio builds	2013-06-18 13:19:16 +00:00
Brecht Van Lommel	37f92119e4	Fix #35665 : more CUDA issues with recent kernel changes, tested on sm_20, sm_21 and sm_30 cards, so hopefully it should all work now. Also includes some warnings fixes related to nvcc compiler arguments, should make no difference otherwise.	2013-06-11 21:58:48 +00:00
Brecht Van Lommel	b20a7e01d0	Cycles: experimental correlated multi-jittered sampling pattern that can be used instead of sobol. So far one doesn't seem to be consistently better or worse than the other for the same number of samples but more testing is needed. The random number generator itself is slower than sobol for most number of samples, except 16, 64, 256, .. because they can be computed faster. This can probably be optimized, but we can do that when/if this actually turns out to be useful. Paper this implementation is based on: http://graphics.pixar.com/library/MultiJitteredSampling/ Also includes some refactoring of RNG code, fixing a Sobol correlation issue with the first BSDF and < 16 samples, skipping some unneeded RNG calls and using a simpler unit square to unit disk function.	2013-06-07 16:06:22 +00:00

22 Commits