blender/intern/cycles/kernel
Thomas Dinges 866c7fb6e6 Cycles: Add an AVX2 CPU kernel.
This kernel is compiled with AVX2, FMA3, and BMI compiler flags. At the moment only Intel Haswell benefits from this, but future AMD CPUs will have these instructions as well.

Makes rendering on Haswell CPUs a few percent faster, only benchmarked with clang on OS X though.

Part of my GSoC 2014.
2014-06-13 22:26:20 +02:00
..
closure Style cleanup: indentation, braces 2014-05-05 02:19:08 +10:00
geom Cycles: Calculate face normal on the fly. 2014-06-13 21:59:13 +02:00
osl Cycles: Support builtin images for OSL shading backend 2014-06-13 20:42:28 +06:00
shaders Cycles: Add dedicated nodes to split/combine vectors. 2014-06-13 21:59:14 +02:00
svm Cycles: Add dedicated nodes to split/combine vectors. 2014-06-13 21:59:14 +02:00
CMakeLists.txt Cycles: Add an AVX2 CPU kernel. 2014-06-13 22:26:20 +02:00
kernel_accumulate.h Fix cycles baking code build errors with OpenCL on some platforms. 2014-05-11 16:36:51 +02:00
kernel_avx2.cpp Cycles: Add an AVX2 CPU kernel. 2014-06-13 22:26:20 +02:00
kernel_avx.cpp Cycles Refactor: Add SSE Utility code from Embree for cleaner SSE code. 2014-06-13 21:59:12 +02:00
kernel_bake.h Fix T40370: cycles CUDA baking timeout with high number of AA samples. 2014-06-06 15:39:04 +02:00
kernel_camera.h Cycles: new camera_direction_from_point 2014-05-27 11:09:36 -03:00
kernel_compat_cpu.h Cycles Refactor: Add SSE Utility code from Embree for cleaner SSE code. 2014-06-13 21:59:12 +02:00
kernel_compat_cuda.h Cycles CUDA: make CUDA toolkit 6.0 the official supported version. 2014-04-30 16:07:27 +02:00
kernel_compat_opencl.h OpenCL + AMD adapt kernel to newer driver 2014-04-07 14:01:31 +02:00
kernel_differential.h Cycles: change __device and similar qualifiers to ccl_device in kernel code. 2013-11-18 08:48:15 +01:00
kernel_emission.h Style cleanup: indentation, braces 2014-05-05 02:19:08 +10:00
kernel_film.h Cycles: fix crash in SSE hair and half-floats on x86+vc2008 2014-02-27 15:01:20 +04:00
kernel_globals.h Fix T37264: cycles CPU render had limited number of float images, bumped to 1024. 2013-12-14 15:07:46 +01:00
kernel_jitter.h Cycles: change __device and similar qualifiers to ccl_device in kernel code. 2013-11-18 08:48:15 +01:00
kernel_light.h Cycles: MIS for lamps now loops over all lamps instead of picking one. 2014-05-01 19:25:13 +02:00
kernel_math.h Cycles: relicense GNU GPL source code to Apache version 2.0. 2013-08-18 14:16:15 +00:00
kernel_montecarlo.h Cycles code refactor: minor refactoring and comments for volume code. 2014-03-29 13:03:49 +01:00
kernel_passes.h Style cleanup: indentation, braces 2014-05-05 02:19:08 +10:00
kernel_path_state.h Fix T39804: cycles smoke domain visible in rendering. 2014-05-29 14:51:02 +02:00
kernel_path.h Fix T40456: cycles bug with branched path + sss + no sample all direct lights. 2014-06-04 17:57:19 +02:00
kernel_projection.h Fix cycles using acosf in panorama render when it should use safe_acosf. 2014-04-18 13:51:29 +02:00
kernel_random.h Fix T39804: cycles smoke domain visible in rendering. 2014-05-29 14:51:02 +02:00
kernel_shader.h Cycles: Calculate face normal on the fly. 2014-06-13 21:59:13 +02:00
kernel_shadow.h Fix T40289: Cycles leaking memory 2014-05-21 16:00:20 +10:00
kernel_sse2.cpp Fix T40370: cycles CUDA baking timeout with high number of AA samples. 2014-06-06 15:39:04 +02:00
kernel_sse3.cpp Fix T40370: cycles CUDA baking timeout with high number of AA samples. 2014-06-06 15:39:04 +02:00
kernel_sse41.cpp Fix T40370: cycles CUDA baking timeout with high number of AA samples. 2014-06-06 15:39:04 +02:00
kernel_subsurface.h Code cleanup: white space and cmake was broken on all platforms 2014-02-03 13:56:34 +11:00
kernel_textures.h Cycles: Calculate face normal on the fly. 2014-06-13 21:59:13 +02:00
kernel_types.h Fix T39804: cycles smoke domain visible in rendering. 2014-05-29 14:51:02 +02:00
kernel_volume.h Fix T39804: cycles smoke domain visible in rendering. 2014-05-29 14:51:02 +02:00
kernel.cl Fix T40370: cycles CUDA baking timeout with high number of AA samples. 2014-06-06 15:39:04 +02:00
kernel.cpp Fix T40370: cycles CUDA baking timeout with high number of AA samples. 2014-06-06 15:39:04 +02:00
kernel.cu Fix T40370: cycles CUDA baking timeout with high number of AA samples. 2014-06-06 15:39:04 +02:00
kernel.h Cycles: Add an AVX2 CPU kernel. 2014-06-13 22:26:20 +02:00
SConscript Attempted fix for T40363: CUDA 30% slowdown in testbuilds compared to 2.70. 2014-05-26 16:52:28 +02:00