blender

Author	SHA1	Message	Date
Brecht Van Lommel	7c0a0bae79	Fix #33375 : OSL geom:trianglevertices gave wrong coordinates for static BVH. Also some simple OSL optimization, passing thread data pointer directly instead of via thread local storage, and creating ustrings for attribute lookup.	2012-12-01 19:15:05 +00:00
Brecht Van Lommel	fdadfde5c5	Fix #33158 : motion vector pass wrong in cycles in some scenes, wrong vectors due to float precision problem in matrix inverse.	2012-11-21 01:00:03 +00:00
Brecht Van Lommel	a80b0915c7	Fix #33243 : cycles CUDA going missing sometimes, disabled the new code now that can detect if a device becomes available while Blender runs, appears to be unreliable for some reason.	2012-11-20 17:39:56 +00:00
Campbell Barton	353ad46e10	code cleanup: quiet double promotion warnings	2012-11-07 23:52:33 +00:00
Brecht Van Lommel	204113b791	Fix #33107 : cycles fixed threads 1 was still having two cores do work, because main thread works as well.	2012-11-07 21:00:49 +00:00
Sergey Sharybin	6eec49ed20	Cycles: memory usage report This commit adds memory usage information while rendering. It reports memory used by device, meaning: - For CPU it'll report real memory consumption - For GPU rendering it'll report GPU memory consumption, but it'll also mean the same memory is used from host side. This information displays information about memory requested by Cycles, not memory really allocated on a device. Real memory usage might be higher because of memory fragmentation or optimistic memory allocator. There's really nothing we can do against this. Also in contrast with blender internal's render cycles memory usage does not include memory used by scene, only memory needed by cycles itself will be displayed. So don't freak out if memory usage reported by cycles would be much lower than blender internal's. This commit also adds RenderEngine.update_memory_stats callback which is used to tell memory consumption from external engine to blender. This information is used to generate information line after rendering is finished.	2012-11-05 08:04:57 +00:00
Brecht Van Lommel	57b7f405a4	Fix related to #32929 : update list of available devices for cycles rendering while Blender is running, not only on load. It might help a case when Blender is started before the CUDA driver is fully initialized.	2012-10-22 14:04:44 +00:00
Campbell Barton	536d9fec80	code cleanup: - move object_iterators.c --> view3d_iterators. (ED_object.h had to include ED_view3d.h which isn't so nice) - move projection functions from view3d_view.c --> view3d_project.c (view3d_view was becoming a mishmash of utility functions and operators). - some some cmake includes as system-includes.	2012-10-17 04:13:03 +00:00
Sergey Sharybin	3b88a29abf	Cycles: progressive refine option Just makes progressive refine :) This means the whole image would be refined gradually using as much threads as it's set in performance settings. Having enough tiles is required to have this option working as it's expected. Technically it's implemented by repeatedly computing next sample for all the tiles before switching to next sample. This works around 7-12% slower than regular tile-based rendering, so use this option only if you really need it. This commit also fixes progressive update of image when Save Buffers option is enabled. And one more thing this commit fixes is handling display buffer with Save Buffers option enabled. If this option is enabled image buffer wouldn't have neither byte nor float buffer until image is fully rendered which could backfire in missing image while rendering in cases color management cache became full. This issue solved by allocating byte buffer for image buffer from tile update callback. Patch was reviewed by Brecht. He also made some minor edits to original version to patch. Thanks, man!	2012-10-13 12:38:32 +00:00
Campbell Barton	b0c7c8756f	code cleanup: cycles now uses system includes for boost/oiio.. etc, so we dont get warnings from system headers.	2012-09-20 09:04:43 +00:00
Lukas Toenne	efaf512406	Revert r50528: "Performance fix for Cycles: Don't wait in the main UI thread when resetting devices." This commit leads to random freezes in Cycles rendering: https://projects.blender.org/tracker/index.php?func=detail&aid=32545&group_id=9&atid=498 The goal of this commit was to remove UI lag for OSL, but since that is not officially supported yet, better revert it until a proper fix can be implemented in 2.65.	2012-09-17 12:07:06 +00:00
Brecht Van Lommel	3d38ad1b17	Attempted fix for #32415 : tighten up cycles opencl initialization checks to try to avoid crashes. Don't think these should be needed but maybe it helps.	2012-09-12 11:25:47 +00:00
Lukas Toenne	31ed71cb6b	Performance fix for Cycles: Don't wait in the main UI thread when resetting devices. When the scene is updated Cycles resets the renderer device, cancelling all existing tasks. The main thread would wait for all running tasks to finish before continuing. This is ok when tasks can actually cancel in a timely fashion. For OSL however, this does not work, since the OSL shader group optimization takes quite a bit of time and can not be easily be cancelled once running (on my crappy machine in full debug mode: ~0.12 seconds for simple node trees). This would lead to very laggy UI behavior and make it difficult to accurately control elements such as sliders. This patch removes the wait condition from the device->task_cancel method. Instead it just sets the do_cancel flag and returns. To avoid backlog in the task pool of the device it will return early from the BlenderSession::sync function while the reset is going on (tested in Session::resetting). Once all existing tasks have finished the do_cancel flag is finally cleared again (checked in TaskPool::num_decrease). Care has to be taken to avoid race conditions on the do_cancel flag, since it can now be modified outside the TaskPool::cancel function itself. For this purpose the scope of the TaskPool::num_mutex locks has been extended, in most cases the mutex is now locked by the TaskPool itself before calling TaskScheduler methods, instead of only locking inside the num_increase/num_decrease functions themselves. The only occurrence of a lock outside of the TaskPool methods is in TaskScheduler::thread_run. This patch is most useful in combination with the OSL renderer mode, so it can probably wait until after the 2.64 release. SVM tasks tend to be cancelled quickly, so the effect is less noticeable.	2012-09-11 11:41:51 +00:00
Brecht Van Lommel	adea12cb01	Cycles: merge of changes from tomato branch. Regular rendering now works tiled, and supports save buffers to save memory during render and cache render results. Brick texture node by Thomas. http://wiki.blender.org/index.php/Doc:2.6/Manual/Render/Cycles/Nodes/Textures#Brick_Texture Image texture Blended Box Mapping. http://wiki.blender.org/index.php/Doc:2.6/Manual/Render/Cycles/Nodes/Textures#Image_Texture http://mango.blender.org/production/blended_box/ Various bug fixes by Sergey and Campbell. * Fix for reading freed memory in some node setups. * Fix incorrect memory read when synchronizing mesh motion. * Fix crash appearing when direct light usage is different on different layers. * Fix for vector pass gives wrong result in some circumstances. * Fix for wrong resolution used for rendering Render Layer node. * Option to cancel rendering when doing initial synchronization. * No more texture limit when using CPU render. * Many fixes for new tiled rendering.	2012-09-04 13:29:07 +00:00
Thomas Dinges	2fcd6827bf	Cycles: * Removed outdated OpenCL comments, kernel features are defined in kernel_types.h now.	2012-08-01 14:56:15 +00:00
Campbell Barton	c6cffe98fa	code cleanup: removed/renamed shadow & duplicate variable definitions.	2012-06-09 18:20:40 +00:00
Campbell Barton	0fbb6bff27	style cleanup: block comments	2012-06-09 17:22:52 +00:00
Brecht Van Lommel	131de4352b	Cycles: fixes to make CUDA 4.2 work, compiling gave errors in shadows and other places, was mainly due to instancing not working, but also found issues in procedural textures. The problem was with --use_fast_math, this seems to now have way lower precision for some operations. Disabled this flag and selectively use fast math functions. Did not find performance regression on GTX 460 after doing this.	2012-05-28 19:21:13 +00:00
Campbell Barton	857dedbc58	style cleanup	2012-05-27 00:36:50 +00:00
Brecht Van Lommel	dd9c1b7fbf	Cycles: OpenCL image texture support, fix an attribute node issue and refactor feature enabling #defines a bit.	2012-05-13 12:32:44 +00:00
Brecht Van Lommel	8103381ded	Cycles: threading optimizations * Multithreaded image loading, each thread can load a separate image. * Better multithreading for multiple instanced meshes, different threads can now build BVH's for different meshes, rather than all cooperating on the same mesh. Especially noticeable for dynamic BVH building for the viewport, gave about 2x faster build on 8 core in fairly complex scene with many objects. * The main thread waiting for worker threads can now also work itself, so (num_cores + 1) threads will be working, this supposedly gives better performance on some operating systems, but did not measure performance for this very detailed yet.	2012-05-05 19:44:33 +00:00
Brecht Van Lommel	a899ce19d0	Cycles: tweak ATI OpenGL/CUDA fix more with extra error check.	2012-05-04 08:00:58 +00:00
Brecht Van Lommel	0fcf17fc72	Possible fix for #31054 : cycles viewport rendering not working with CUDA for computation and ATI card for OpenGL.	2012-05-03 23:39:42 +00:00
Brecht Van Lommel	07b2241fb1	Cycles: merging features from tomato branch. === BVH build time optimizations === * BVH building was multithreaded. Not all building is multithreaded, packing and the initial bounding/splitting is still single threaded, but recursive splitting is, which was the main bottleneck. * Object splitting now uses binning rather than sorting of all elements, using code from the Embree raytracer from Intel. http://software.intel.com/en-us/articles/embree-photo-realistic-ray-tracing-kernels/ * Other small changes to avoid allocations, pack memory more tightly, avoid some unnecessary operations, ... These optimizations do not work yet when Spatial Splits are enabled, for that more work is needed. There's also other optimizations still needed, in particular for the case of many low poly objects, the packing step and node memory allocation. BVH raytracing time should remain about the same, but BVH build time should be significantly reduced, test here show speedup of about 5x to 10x on a dual core and 5x to 25x on an 8-core machine, depending on the scene. === Threads === Centralized task scheduler for multithreading, which is basically the CPU device threading code wrapped into something reusable. Basic idea is that there is a single TaskScheduler that keeps a pool of threads, one for each core. Other places in the code can then create a TaskPool that they can drop Tasks in to be executed by the scheduler, and wait for them to complete or cancel them early. === Normal ==== Added a Normal output to the texture coordinate node. This currently gives the object space normal, which is the same under object animation. In the future this might become a "generated" normal so it's also stable for deforming objects, but for now it's already useful for non-deforming objects. === Render Layers === Per render layer Samples control, leaving it to 0 will use the common scene setting. Environment pass will now render environment even if film is set to transparent. Exclude Layers" added. Scene layers (all object that influence the render, directly or indirectly) are shared between all render layers. However sometimes it's useful to leave out some object influence for a particular render layer. That's what this option allows you to do. === Filter Glossy === When using a value higher than 0.0, this will blur glossy reflections after blurry bounces, to reduce noise at the cost of accuracy. 1.0 is a good starting value to tweak. Some light paths have a low probability of being found while contributing much light to the pixel. As a result these light paths will be found in some pixels and not in others, causing fireflies. An example of such a difficult path might be a small light that is causing a small specular highlight on a sharp glossy material, which we are seeing through a rough glossy material. With path tracing it is difficult to find the specular highlight, but if we increase the roughness on the material the highlight gets bigger and softer, and so easier to find. Often this blurring will be hardly noticeable, because we are seeing it through a blurry material anyway, but there are also cases where this will lead to a loss of detail in lighting.	2012-04-28 08:53:59 +00:00
Thomas Dinges	ed47be3bf2	Cycles/OpenCL: * Reverted the general activation of __KERNEL_SHADING__. Better to handle this in the device file. This way each platform gets specifically what it is capable of atm. * Nvidia has Shading + Multi Closure * AMD (Apple) has only Clay Render * AMD (non Apple) has Basic Shading	2012-04-09 17:44:33 +00:00
Thomas Dinges	d024238fb2	Cycles / OpenCL: * Enable __KERNEL_SHADING__ per default for OpenCL. This enables basic shading (color, emission, textures...) for AMD cards. You need the latest AMD catalyst driver in order to have this work.	2012-04-05 16:19:51 +00:00
Campbell Barton	6ca7d82932	code cleanup: white space, spelling & ';;' end of lines.	2012-02-25 16:04:03 +00:00
Brecht Van Lommel	f4bb31f26b	Cycles: tweak for AMD opencl compile of advanced shading, from Daniel Genrich, still does not work correct but should compile if you have enough memory.	2012-02-24 15:53:19 +00:00
Brecht Van Lommel	0cf38faea4	Fix #30140 : cycles multi GPU rendering with one device supporting full shading and the other not can't work correct, disabled that now.	2012-02-23 20:27:17 +00:00
Brecht Van Lommel	dc181ea7e4	Fix: cycles crash with multiple OpenCL platforms installed, tracked down by Sergey.	2012-02-20 14:19:34 +00:00
Brecht Van Lommel	b023665551	Cycles: another fix for CUDA render passes, needed to align float4 passes.	2012-01-27 13:58:32 +00:00
Brecht Van Lommel	803286dde8	Cycles: render passes for CUDA cards with compute model >= 2.x.	2012-01-26 19:07:01 +00:00
Brecht Van Lommel	f99343d3b8	Cycles: Render Passes Currently supported passes: * Combined, Z, Normal, Object Index, Material Index, Emission, Environment, Diffuse/Glossy/Transmission x Direct/Indirect/Color Not supported yet: * UV, Vector, Mist Only enabled for CPU devices at the moment, will do GPU tweaks tommorrow, also for environment importance sampling. Documentation: http://wiki.blender.org/index.php/Doc:2.6/Manual/Render/Cycles/Passes	2012-01-25 17:23:52 +00:00
Brecht Van Lommel	5873301257	Sample as Lamp option for world shaders, to enable multiple importance sampling. By default lighting from the world is computed solely with indirect light sampling. However for more complex environment maps this can be too noisy, as sampling the BSDF may not easily find the highlights in the environment map image. By enabling this option, the world background will be sampled as a lamp, with lighter parts automatically given more samples. Map Resolution specifies the size of the importance map (res x res). Before rendering starts, an importance map is generated by "baking" a grayscale image from the world shader. This will then be used to determine which parts of the background are light and so should receive more samples than darker parts. Higher resolutions will result in more accurate sampling but take more setup time and memory. Patch by Mike Farnsworth, thanks!	2012-01-20 17:49:17 +00:00
Brecht Van Lommel	778774ba16	Fix: cycles CPU device not being used when it should be on some multi-GPU configurations.	2012-01-11 13:18:06 +00:00
Brecht Van Lommel	d7932ceea8	Cycles: multi GPU rendering support. The rendering device is now set in User Preferences > System, where you can choose between OpenCL/CUDA and devices. Per scene you can then still choose to use CPU or GPU rendering. Load balancing still needs to be improved, now it just splits the entire render in two, that will be done in a separate commit.	2012-01-09 16:58:01 +00:00
Brecht Van Lommel	049ab98469	Cycles: device code refactoring, no functional changes.	2012-01-04 18:06:32 +00:00
Brecht Van Lommel	b5595298d3	Cycles code refactoring: change displace kernel into more generic shader evaluate kernel, added background shader evaluate.	2011-12-31 15:18:13 +00:00
Thomas Dinges	2d1de2e78d	Cycles/CUDA: * Rename shader model to compute capability in error messages.	2011-12-20 18:59:10 +00:00
Brecht Van Lommel	690de79580	Cycles: some tweaks for apple opencl with ATI cards, to get it working up to the level of ambient occlusion render, shaders still fail. Fixes found with much help from Jens and Dalai.	2011-12-20 17:36:56 +00:00
Brecht Van Lommel	72d2d05770	Cycles: border rendering support, includes some refactoring in how pixels are accessed on devices.	2011-12-20 12:25:37 +00:00
Miika Hamalainen	5466befb38	Fix cycles compile for win32.	2011-12-13 10:17:17 +00:00
Brecht Van Lommel	9e01abf777	Cycles: require Experimental to be set to enable CUDA on cards with shader model lower than 1.3, since we're not officially supporting these. We're already not providing CUDA binaries for these, so better make it clear when compiling from source too.	2011-12-12 22:51:35 +00:00
Brecht Van Lommel	45de380771	Cycles * Compile all of cycles with -ffast-math again * Add scons compilation of cuda binaries, tested on mac/linux. * Add UI option for supported/experimental features, to make it more clear what is supported, opencl/subdivision is experimental. * Remove cycles xml exporter, was just for testing.	2011-12-01 16:33:21 +00:00
Brecht Van Lommel	086e4ed825	Cycles: improve error reporting for opencl and cuda, showing error messages in viewport instead of only console.	2011-11-22 20:49:33 +00:00
Brecht Van Lommel	eb2baf9abc	Fix #29274 : problem compiling cycles opencl kernel from directory with spaces. Some drivers don't support passing include paths with spaces in them, nor does the opencl spec specify anything about how to quote/escape such paths, so for now we just resolved #includes ourselves. Alternative would have been to use c preprocessor, but this also resolves all #ifdefs, which we do not want.	2011-11-22 16:38:58 +00:00
Brecht Van Lommel	47853bf6f6	Cycles: OpenCL tweaks * Reduce kernel arguments size, helps compile for apple nvidia. * Fix use of unitialized variable in displace kernel. * Use build flags in opencl kernel md5 hash. * Reorganize code for kernel feature #defines a bit.	2011-11-22 13:15:19 +00:00
Brecht Van Lommel	db8024f4b5	Fix #29259 : cycles issues on certain processors. Now two versions of the kernel are compiled, one SSE optimized and the other not, and it will choose between them at runtime.	2011-11-15 15:13:38 +00:00
Thomas Dinges	880225db77	OpenCL/Nvidia: * Enable OpenCL Full Shading on NVIDIA cards. Notes: It makes not much sense to use OpenCL on a nVidia card (as it is slower compared to CUDA), but as OpenCL comes without dependencies, it's an good alternative if you don't want to install the CUDA toolkit or the build comes without CUDA kernels.	2011-11-12 22:22:00 +00:00
Brecht Van Lommel	c42772fc95	Cycles: * Add back option to bundle CUDA kernel binaries with builds. * Disable runtime CUDA kernel compilation on Windows, couldn't get this working, since it seems to depend on visual studio being installed, even though for this particular case it shouldn't be needed. CMake only at the moment. * Runtime compilation on linux/mac should now work if nvcc is not installed in the default location, but available in PATH.	2011-11-10 12:52:17 +00:00

1 2

82 Commits