blender

Author	SHA1	Message	Date
Brecht Van Lommel	2851ed4a55	Cycles code refactor: use __launch_bounds__ instead of -maxrregcount for CUDA. This makes it easier to have per kernel number of registers. Also, all the tunable parameters for this are now in kernel.cu, rather than spread over cmake, scons and device_cuda.cpp.	2014-04-16 21:05:04 +02:00
Thomas Dinges	297a2223b5	Cycles / CUDA: Increase sm_2x registers to 40. This fixes the ptaxs "ACCESS_VIOLATION" error and should allow our Linux and Windows build bots to compile again. Unfortunately this comes with a performance penalty on sm_2x cards, so this is only a workaround for now. Branched Path is still globally disabled on GPU.	2014-04-08 23:25:54 +02:00
Brecht Van Lommel	27043b8e40	Cycles code internals: add support for mesh voxel grid attributes. These are internally stored as a 3D image textures, but accessible like e.g. UV coordinates though the attribute node and getattribute(). This is convenient for rendering e.g. smoke objects where data like density is really a property of the mesh, and it avoids having to specify the smoke object in a texture node, instead the material will work with any smoke domain.	2014-03-29 13:03:48 +01:00
Brecht Van Lommel	393216a6df	Cycles code refactor: move more code to geom folder, add some comments.	2014-03-29 13:03:48 +01:00
Brecht Van Lommel	e2184c653e	Cycles: add support for curve deformation motion blur.	2014-03-29 13:03:47 +01:00
Brecht Van Lommel	6020d00990	Cycles: add support for mesh deformation motion blur.	2014-03-29 13:03:47 +01:00
Brecht Van Lommel	84470a1190	Cycles code refactor: move geometry related kernel files into own directory.	2014-03-29 13:03:45 +01:00
Campbell Barton	66671f1aae	Cycles: fix for building with cmake when gcc refuses sse args	2014-03-27 10:40:14 +11:00
Campbell Barton	23fd670c39	Code cleanup: cmake	2014-03-13 23:31:06 +11:00
Thomas Dinges	da523185fb	Fix compilation of Cycles AVX kernel with cmake.	2014-01-16 18:32:54 +01:00
Thomas Dinges	de28a4d4b2	Cycles: Add an AVX kernel for CPU rendering. * AVX is available on Intel Sandy Bridge and newer and AMD Bulldozer and newer. * We don't use dedicated AVX intrinsics yet, but gcc auto vectorization gives a 3% performance improvement for Caminandes. Tested on an i5-3570, Linux x64. * No change for Windows yet, MSVC 2008 does not support AVX. Reviewed by: brecht Differential Revision: https://developer.blender.org/D216	2014-01-16 17:04:11 +01:00
Brecht Van Lommel	d9e52ac98b	Code cleanup: move half float functions to separate header file.	2014-01-15 15:29:22 +01:00
Martijn Berger	993b946681	DingTo forgot to make sure kernel_sse41 is compiled in even when empty	2014-01-14 21:49:48 +01:00
Thomas Dinges	9351ac0d85	Cycles: Skip the compilation of the dedicated SSE2 kernel on x86-64, we can assume SSE2 here, so just re-use the regular one. Saves 500kb in the blender binary. Reviewed by: brecht Differential Revision: https://developer.blender.org/D199	2014-01-14 20:39:54 +01:00
Jens Verwiebe	a0b424aa4c	Take back last header copy, due it is for native only, must be a runtime solution, todo: do by definitions	2014-01-06 20:43:54 +01:00
Jens Verwiebe	48d8faeb79	Cmake: fix kernelcompile after introduction of util_simd.h	2014-01-06 20:26:02 +01:00
Campbell Barton	c3bc2fd941	CMake: cleanup and add include	2014-01-04 13:17:07 +11:00
Brecht Van Lommel	e369a5c485	Cycles Volume Render: support for rendering of homogeneous volume with absorption. This is the simplest possible volume rendering case, constant density inside the volume and no scattering or emission. My plan is to tweak, verify and commit more volume rendering effects one by one, doing it all at once makes it difficult to verify correctness and track down bugs. Documentation is here: http://wiki.blender.org/index.php/Doc:2.6/Manual/Render/Cycles/Materials/Volume Currently this hooks into path tracing in 3 ways, which should get us pretty far until we add more advanced light sampling. These 3 hooks are repeated in the path tracing, branched path tracing and transparent shadow code: * Determine active volume shader at start of the path * Change active volume shader on transmission through a surface * Light attenuation over line segments between camera, surfaces and background This is work by "storm", Stuart Broadfoot, Thomas Dinges and myself.	2013-12-28 16:57:10 +01:00
Brecht Van Lommel	133f770ab3	Code cleanup: move shadow_blocked function into separate file.	2013-12-28 16:57:10 +01:00
Martijn Berger	e3a79258d1	Cycles: test code for sse 4.1 kernel and alignment for some vector types. This is mostly work towards enabling the __KERNEL_SSE__ option to start using SIMD operations for vector math operations. This 4.1 kernel performes about 8% faster with that option but overall is still slower than without the option. WITH_CYCLES_OPTIMIZED_KERNEL_SSE41 is the cmake flag for testing this kernel. Alignment of int3, int4, float3, float4 to 16 bytes seems to give a slight 1-2% speedup on tested systems with the current kernel already, so is enabled now.	2013-11-22 14:42:41 +01:00
Thomas Dinges	b5a5773fa9	Cycles / CUDA: * Remove support for CUDA Toolkit 4.x, only Toolkit 5.0 and above are supported now. * Remove support for sm_1x cards (< Fermi) for good. We didn't officially support those cards for a few releases already, now remove some special code that was still there.	2013-10-08 15:29:28 +00:00
Stuart Broadfoot	3306afac87	Cycles Hair: Two basic bair shaders added A new hair bsdf node, with two closure options, is added. These closures allow the generation of the reflective and transmission components of hair. The node allows control of the highlight colour, roughness and angular shift. Llimitations include: -No glint or fresnel adjustments. -The 'offset' is un-used when triangle primitives are used.	2013-09-15 23:58:00 +00:00
Thomas Dinges	5a6bcd1d42	Cycles: * Refactor PathState struct and functions into its own file.	2013-09-08 18:59:39 +00:00
Campbell Barton	b97334f992	add GPL header to treehash.c and add missing includes to cmake.	2013-08-24 03:17:28 +00:00
Thomas Dinges	285ef99931	Cycles: * Added 2 new nodes to combine and separate HSV colors. Screenshot: http://www.pasteall.org/pic/show.php?id=54828	2013-07-03 23:46:56 +00:00
Thomas Dinges	00234dab2f	Merged revision(s) 57587-57670 from trunk/blender into soc-2013-dingto	2013-06-23 18:04:13 +00:00
Thomas Dinges	e4ef608020	Cycles / Vector Transform Node: * Implementation of Vector Transform Node into Cycles. * OSL backend is done, SVM needs the matrices still.	2013-06-23 17:51:08 +00:00
Brecht Van Lommel	8d6e5e2fee	Cycles: update build configurations to include CUDA sm_35 architecture. When using a compiler older than CUDA 5.0 it will give a warning and skip this architecture.	2013-06-20 13:10:47 +00:00
Thomas Dinges	e6fc174152	Merged revision(s) 57499-57586 from trunk/blender into soc-2013-dingto	2013-06-19 20:40:54 +00:00
Brecht Van Lommel	16204bd647	Cycles: prepare to make CUDA 5.0 the official version we use * Add CUDA compiler version detection to cmake/scons/runtime * Remove noinline in kernel_shader.h and reenable --use_fast_math if CUDA 5.x is used, these were workarounds for CUDA 4.2 bugs * Change max number of registers to 32 for sm 2.x (based on performance tests from Martijn Berger and confirmed here), and also for NVidia OpenCL. Overall it seems that with these changes and the latest CUDA 5.0 download, that performance is as good as or better than the 2.67b release with the scenes and graphics cards I tested.	2013-06-19 17:54:23 +00:00
Thomas Dinges	9e16c5a9e4	Cycles / Blackbody node: * First (brute force) implementation for SVM. This works and delivers the same result as OSL, but it's slow. * Code inside svm_blackbody.h inspired by a patch by Philipp Oeser (#35698), thanks. Ideas: * Use a lookup table to perform the calculations on render/ level. * Implement it as a RNA property only, and do the calculation like Sun/Sky precompute.	2013-06-15 23:47:09 +00:00
Brecht Van Lommel	37f92119e4	Fix #35665 : more CUDA issues with recent kernel changes, tested on sm_20, sm_21 and sm_30 cards, so hopefully it should all work now. Also includes some warnings fixes related to nvcc compiler arguments, should make no difference otherwise.	2013-06-11 21:58:48 +00:00
Thomas Dinges	cf359f6c7f	Cycles / Wavelength to RGB node: * Added a node to convert wavelength (in nanometer, from 380nm to 780nm) to RGB values. This can be useful to match real world colors easier. Example render: http://www.pasteall.org/pic/show.php?id=53202 ToDo: * Move some functions into an util file, maybe a common util_color.h or so. * Test GPU, unfortunately sm_21 doesn't work for me yet.	2013-06-09 20:46:22 +00:00
Brecht Van Lommel	b20a7e01d0	Cycles: experimental correlated multi-jittered sampling pattern that can be used instead of sobol. So far one doesn't seem to be consistently better or worse than the other for the same number of samples but more testing is needed. The random number generator itself is slower than sobol for most number of samples, except 16, 64, 256, .. because they can be computed faster. This can probably be optimized, but we can do that when/if this actually turns out to be useful. Paper this implementation is based on: http://graphics.pixar.com/library/MultiJitteredSampling/ Also includes some refactoring of RNG code, fixing a Sobol correlation issue with the first BSDF and < 16 samples, skipping some unneeded RNG calls and using a simpler unit square to unit disk function.	2013-06-07 16:06:22 +00:00
Thomas Dinges	3758193c18	Cycles / Wireframe node: * Added a wireframe node (Input category) to get access to Mesh wireframe data. The thickness can be controlled via a "Size" parameter, and is available in world units (default) and screen pixel size. * Only the triangulated mesh is available now, quads is for later. Documentation: http://wiki.blender.org/index.php/Doc:2.6/Manual/Render/Cycles/Nodes/More#Wireframe Render and Example file: http://www.pasteall.org/pic/show.php?id=51731 http://www.pasteall.org/blend/21510	2013-05-20 15:58:37 +00:00
Brecht Van Lommel	ed1a08382f	Cycles: code refactoring to deduplicate the various BVH traversal variations. Now there is a single BVH traversal code with #ifdefs for various features. At runtime it will then select the appropriate variation to use depending if instancing, hair or motion blur is in use. This makes scenes without hair render a bit faster, especially after the minimum width feature was added. It's not the most beautiful code, but we can't use c++ templates and there were already 4 copies, adding 4 more to handle the hair case separately would be too much.	2013-04-17 20:07:22 +00:00
Brecht Van Lommel	de9dffc61e	Cycles: initial subsurface multiple scattering support. It's not working as well as I would like, but it works, just add a subsurface scattering node and you can use it like any other BSDF. It is using fully raytraced sampling compatible with progressive rendering and other more advanced rendering algorithms we might used in the future, and it uses no extra memory so it's suitable for complex scenes. Disadvantage is that it can be quite noisy and slow. Two limitations that will be solved are that it does not work with bump mapping yet, and that the falloff function used is a simple cubic function, it's not using the real BSSRDF falloff function yet. The node has a color input, along with a scattering radius for each RGB color channel along with an overall scale factor for the radii. There is also no GPU support yet, will test if I can get that working later. Node Documentation: http://wiki.blender.org/index.php/Doc:2.6/Manual/Render/Cycles/Nodes/Shaders#BSSRDF Implementation notes: http://wiki.blender.org/index.php/Dev:2.6/Source/Render/Cycles/Subsurface_Scattering	2013-04-01 20:26:52 +00:00
Brecht Van Lommel	7c9d993347	Fix cycles intersection issue with overlapping faces on windows 32 bit and CPU without SSE3 support, due to 80 bit precision float register being used for one bounding box but not the one next to it.	2013-02-04 16:12:37 +00:00
Brecht Van Lommel	57cf48e7c6	Cycles Hair: refactoring to support generic attributes for hair curves. There should be no functional changes yet. UV, tangent and intercept are now stored as attributes, with the intention to add more like multiple uv's, vertex colors, generated coordinates and motion vectors later. Things got a bit messy due to having both triangle and curve data in the same mesh data structure, which also gives us two sets of attributes. This will get cleaned up when we split the mesh class.	2013-01-03 12:08:54 +00:00
Brecht Van Lommel	54729df020	Cycles OSL: diffuse_toon and specular_toon closures. These are toon shaders with a size parameter between 0.0 and 1.0 that gives a angle of reflection between 0° and 90°, and a smooth parameter that gives and angle over which a smooth transition from full to no reflection happens. These work with global illumination and do importance sampling of the area within the angle. Note that unlike most other BSDF's these are not energy conserving in general, in particular if their weight is 1.0 and size > 2/3 (or 60°) they will add more energy in each bounce. Diffuse: http://www.pasteall.org/pic/show.php?id=42119 Specular: http://www.pasteall.org/pic/show.php?id=42120	2012-12-19 21:17:16 +00:00
Brecht Van Lommel	06888b7beb	Cycles OSL minor optimizations: recycle shading context, don't do memory allocations for trace data, avoid some virtual function calls. Only helps a few percentages.	2012-12-15 10:18:42 +00:00
Brecht Van Lommel	8d4bd2cf3b	Cycles OSL: add diffuse_ramp closure in addition to phong_ramp.	2012-12-11 14:39:41 +00:00
Brecht Van Lommel	209cd25745	Cycles OSL: phong_ramp(N, exponent, colors[8]) closure added, which works like a specular ramp shader. Note this is OSL only still, for experimenting. Patch by Thomas.	2012-11-06 19:59:07 +00:00
Brecht Van Lommel	615fe0295f	Cycles OSL: refactoring and fixes * Moved kernel/osl/nodes to kernel/shaders * Renamed standard attributes to use geom:, particle:, object: prefixes * Update stdosl.h to properly reflect the closures we support * Fix the wrong stdosl.h being used for building shaders * Add geom:numpolyvertices, geom:trianglevertices, geom:polyvertices attributes	2012-11-03 14:32:13 +00:00
Thomas Dinges	950524722c	Cycles: * Build system fixes for closure refactor.	2012-10-20 14:08:49 +00:00
Brecht Van Lommel	9a1c1f132d	Cycles OSL: most closure code is now shared between OSL and SVM. Also fix transmission pass and filter glossy option. The BSDF closure class is now more similar to the SVM closures, and includes some flags and labels that are needed to properly categorize the BSDF's for render passes. Phong closure is gone for the moment, needs to be adapated to the new structure still.	2012-10-20 12:18:00 +00:00
Campbell Barton	536d9fec80	code cleanup: - move object_iterators.c --> view3d_iterators. (ED_object.h had to include ED_view3d.h which isn't so nice) - move projection functions from view3d_view.c --> view3d_project.c (view3d_view was becoming a mishmash of utility functions and operators). - some some cmake includes as system-includes.	2012-10-17 04:13:03 +00:00
Brecht Van Lommel	fe16b26206	Cycles: fix some update issues with camera motion blur, and do some more work for getting object motion blur ready.	2012-10-15 21:12:58 +00:00
Campbell Barton	b0c7c8756f	code cleanup: cycles now uses system includes for boost/oiio.. etc, so we dont get warnings from system headers.	2012-09-20 09:04:43 +00:00
Lukas Toenne	a9105a7dea	Fix for Cycles (CUDA) compilation (again ...). Moved the AttributeStandard enum typedef and the attribute_standard_name mapping function to util_attribute/util_types headers, so they can properly be used by kernel and render files alike. This should avoid any std C includes which are not available in CUDA. Thanks to Sergey for help!	2012-09-07 11:06:45 +00:00

1 2

82 Commits