blender

Author	SHA1	Message	Date
Brecht Van Lommel	f75b09e7e6	Cycles: abort rendering when --cycles-device not found Rather than just printing a message and falling back to the CPU. For render farms it's better to avoid a potentially slow render on the CPU if the intent was to render on the GPU. Ref T82193, D9086	2020-10-29 16:01:38 +01:00
Stefan Werner	009971ba7a	Cycles: Separate Embree device for each CPU Device. Before, Cycles was using a shared Embree device across all instances. This could result in crashes when viewport rendering and material preview were using Cycles simultaneously. Fixes issue T80042 Maniphest Tasks: T80042 Differential Revision: https://developer.blender.org/D8772	2020-09-01 21:00:55 +02:00
Kévin Dietrich	c82166ffcd	Cycles: move some Scene related methods out of Session This moves `Session::get_requested_device_features`, `Session::load_kernels`, and `Session::update_scene` out of `Session` and into `Scene`, as mentioned in D8544. Reviewed By: brecht Differential Revision: https://developer.blender.org/D8590	2020-08-18 11:50:37 +02:00
Brecht Van Lommel	93791381fe	Cleanup: reduce hardcoded numbers in denoising neighbor tiles code	2020-07-10 17:10:05 +02:00
Brecht Van Lommel	0a3bde6300	Cycles: add denoising settings to the render properties Enabling render and viewport denoising is now both done from the render properties. View layers still can individually be enabled/disabled for denoising and have their own denoising parameters. Note that the denoising engine also affects how denoising data passes are output even if no denoising happens on the render itself, to make the passes compatible with the engine. This includes internal refactoring for how denoising parameters are passed along, trying to avoid code duplication and unclear naming. Ref T76259	2020-06-24 15:17:36 +02:00
Brecht Van Lommel	207338bb58	Cycles: port curve-ray intersection from Embree for use in Cycles GPU This keeps render results compatible for combined CPU + GPU rendering. Peformance and quality primitives is quite different than before. There are now two options: * Rounded Ribbon: render hair as flat ribbon with (fake) rounded normals, for fast rendering. Hair curves are subdivided with a fixed number of user specified subdivisions. This gives relatively good results, especially when used with the Principled Hair BSDF and hair viewed from a typical distance. There are artifacts when viewed closed up, though this was also the case with all previous primitives (but different ones). * 3D Curve: render hair as 3D curve, for accurate results when viewing hair close up. This automatically subdivides the curve until it is smooth. This gives higher quality than any of the previous primitives, but does come at a performance cost and is somewhat slower than our previous Thick curves. The main problem here is performance. For CPU and OpenCL rendering performance seems usually quite close or better for similar quality results. However for CUDA and Optix, performance of 3D curve intersection is problematic, with e.g. 1.45x longer render time in Koro (though there is no equivalent quality and rounded ribbons seem fine for that scene). Any help or ideas to optimize this are welcome. Ref T73778 Depends on D8012 Maniphest Tasks: T73778 Differential Revision: https://developer.blender.org/D8013	2020-06-22 13:28:01 +02:00
Brecht Van Lommel	e50f1ddc65	Cycles: use TBB for task pools and task scheduler No significant performance improvement is expected, but it means we have a single thread pool throughout Blender. And it should make adding more parallellization in the future easier. After previous refactoring commits this is basically a drop-in replacement. One difference is that the task pool had a mechanism for scheduling tasks to the front of the queue to minimize memory usage. TBB has a smarter algorithm to balance depth-first and breadth-first scheduling of tasks and we assume that removes the need to manually provide hints to the scheduler. Fixes T77533	2020-06-22 13:27:37 +02:00
Patrick Mours	9f7d84b656	Cycles: Add support for P2P memory distribution (e.g. via NVLink) This change modifies the multi-device implementation to support memory distribution across devices, to reduce the overall memory footprint of large scenes and allow scenes to fit entirely into combined GPU memory that previously had to fall back to host memory. Reviewed By: brecht Differential Revision: https://developer.blender.org/D7426	2020-06-08 17:55:49 +02:00
Brecht Van Lommel	53981c7fb6	Cleanup: refactor adaptive sampling to more easily change some parameters No functional changes yet, this is work towards making CPU and GPU results match more closely.	2020-04-07 20:29:48 +02:00
Dalai Felinto	2d1cce8331	Cleanup: `make format` after SortedIncludes change	2020-03-19 09:33:58 +01:00
Patrick Mours	38589de10c	Cycles: Add support for denoising in the viewport The OptiX denoiser can be a great help when rendering in the viewport, since it is really fast and needs few samples to produce convincing results. This patch therefore adds support for using any Cycles denoiser in the viewport also (but only the OptiX one is selectable because the NLM one is too slow to be usable currently). It also adds support for denoising on a different device than rendering (so one can e.g. render with the CPU but denoise with OptiX). Reviewed By: #cycles, brecht Differential Revision: https://developer.blender.org/D6554	2020-02-11 18:03:43 +01:00
Patrick Mours	70a32adfeb	Fix assert in Cycles memory statistics when using OptiX on multiple GPUs The acceleration structure built by OptiX may be different between GPUs, so cannot assume the memory size is the same for all. This fixes that by moving the memory management for all OptiX acceleration structures into the responsibility of each device (was already the case for BLAS previously, now for TLAS too).	2019-11-28 13:57:02 +01:00
Patrick Mours	a2b52dc571	Cycles: add Optix device backend This uses hardware-accelerated raytracing on NVIDIA RTX graphics cards. It is still currently experimental. Most features are supported, but a few are still missing like baking, branched path tracing and using CPU memory. https://wiki.blender.org/wiki/Reference/Release_Notes/2.81/Cycles#NVIDIA_RTX For building with Optix support, the Optix SDK must be installed. See here for build instructions: https://wiki.blender.org/wiki/Building_Blender/CUDA Differential Revision: https://developer.blender.org/D5363	2019-09-13 11:50:11 +02:00
Campbell Barton	e12c08e8d1	ClangFormat: apply to source, most of intern Apply clang format as proposed in T53211. For details on usage and instructions for migrating branches without conflicts, see: https://wiki.blender.org/wiki/Tools/ClangFormat	2019-04-17 06:21:24 +02:00
Brecht Van Lommel	e691929686	Merge branch 'blender2.7'	2019-03-17 12:54:19 +01:00
Brecht Van Lommel	e17f7af0ce	Cleanup: remove Cycles advanced shading features toggle. It's effectively always enabled, only not on some unsupported OpenCL devices. For testing those it's not useful to disable these features. This is replaced by the more fine grained feature toggles that we have now.	2019-03-17 01:58:39 +01:00
Jeroen Bakker	5051e580e4	Merge branch 'blender2.7'	2019-03-15 16:28:33 +01:00
Jeroen Bakker	2f6257fd7f	Cycles/OpenCL: Compile Kernels During Scene Update The main goals of this change is faster starting when using foreground rendering. This patch will build kernels in parallel to the update process of the scene. When these optimized kernels are not available (yet) an AO kernel will be used. These AO kernels are fast to compile (3-7 seconds) and can be reused by all scenes. When the final kernels become available we will switch to these kernels. In background mode the AO kernels will not be used. Some kernels are being used during Scene update (displace, background light). When these kernels are being used the process can halt until these become available. Reviewed By: brecht, #cycles Maniphest Tasks: T61752 Differential Revision: https://developer.blender.org/D4428	2019-03-15 16:18:21 +01:00
Jeroen Bakker	15edae617f	Merge branch 'blender2.7'	2019-02-26 14:07:57 +01:00
Jeroen Bakker	dabe5cd31a	T61971: Compilation Displacement/Background Kernel Displacement and Background kernels are selectively used, but always compiled. This patch will not compile these kernels when they are not needed. Displacement kernel is only used for true displacement. Background kernel is only used when there is a (Cycles)Light of type `LIGHT_BACKGROUND`. Reviewed By: brecht, #cycles Tags: #cycles Maniphest Tasks: T61971 Differential Revision: https://developer.blender.org/D4412	2019-02-26 14:06:25 +01:00
Brecht Van Lommel	f4b1f1f0be	Merge branch 'blender2.7'	2019-01-30 18:36:54 +01:00
Brecht Van Lommel	001414fb2f	Cycles: delay CUDA and OpenCL initialization to avoid driver crashes. We've had many reported crashes on Windows where we suspect there is a corrupted OpenCL driver. The purpose here is to keep Blender generally usable in such cases. Now it always shows None / CUDA / OpenCL in the preferences, and only when selecting one will it reveal if there are any GPUs available. This should avoid crashes when opening the preferences or on startup. Differential Revision: https://developer.blender.org/D4265	2019-01-29 17:00:02 +01:00
Brecht Van Lommel	63c0653170	Merge branch 'master' into blender2.8	2018-11-29 23:54:30 +01:00
Brecht Van Lommel	a8b8da5567	Fix T58183: crash with CPU + GPU rendering after profiling changes. Multi-device was not passing along profiler to the CPU.	2018-11-29 23:43:27 +01:00
Campbell Barton	9893fee4e6	Merge branch 'master' into blender2.8	2018-11-29 12:55:58 +11:00
Lukas Stockner	7fa6f72084	Cycles: Add sample-based runtime profiler that measures time spent in various parts of the CPU kernel This commit adds a sample-based profiler that runs during CPU rendering and collects statistics on time spent in different parts of the kernel (ray intersection, shader evaluation etc.) as well as time spent per material and object. The results are currently not exposed in the user interface or per Python yet, to see the stats on the console pass the "--cycles-print-stats" argument to Cycles (e.g. "./blender -- --cycles-print-stats"). Unfortunately, there is no clear way to extend this functionality to CUDA or OpenCL, so it is CPU-only for now. Reviewers: brecht, sergey, swerner Reviewed By: brecht, swerner Differential Revision: https://developer.blender.org/D3892	2018-11-29 02:45:24 +01:00
Sergey Sharybin	78a6689aea	Merge branch 'master' into blender2.8	2018-11-09 14:34:33 +01:00
Sergey Sharybin	2330cadb0f	Cycles: Cleanup, don't use strict C prototypes Those are more like a legacy of language, which is not needed in C++.	2018-11-09 12:04:41 +01:00
Sergey Sharybin	cb4b5e12ab	Cycles: Cleanup, spacing after preprocessor It is supposed to be two spaces before comment stating which if else/endif statements corresponds to. Was mainly violated in the header guards.	2018-11-09 11:34:54 +01:00
Sergey Sharybin	fc12a736bb	Merge branch 'master' into blender2.8	2018-10-31 11:49:04 +01:00
Sergey Sharybin	e0cc3e9809	Cycles: Fix wrong BVH used when disabling AVX2 in debug settings Mainly useful for debugging. Previously, when AVX2 was disabled in the debug panel but BVH layout was kept on BVH8 nothing was rendered. Needed to make it so supported BVH layout mask for devices is queried in "dynamic", so it is possible to use DebugFlags there.	2018-10-31 11:46:52 +01:00
Campbell Barton	de777ad9e6	Merge branch 'master' into blender2.8	2018-07-06 10:18:52 +02:00
Campbell Barton	1daa20ad9f	Cleanup: strip trailing space for cycles	2018-07-06 10:17:58 +02:00
Campbell Barton	2bc952fdb6	Merge branch 'master' into blender2.8	2018-02-18 22:33:05 +11:00
Thomas Dinges	9e717c0495	Cycles: Remove Fermi texture code. This should be the last Fermi removal commit, unless I missed something. It's been a pleasure Fermi!	2018-02-17 22:56:58 +01:00
Campbell Barton	5376c739f5	Merge branch 'master' into blender2.8	2018-02-06 23:06:23 +11:00
Brecht Van Lommel	ce3e0afe59	Fix T54001: AMD OpenCL fails with certain resolutions, after recent changes. We should actually be using CL_DEVICE_MEM_BASE_ADDR_ALIGN for sub buffers, previous change in this code was incorrect. Renamed the function now to make the specific purpose of this alignment clear, it's not required for data types in general.	2018-02-05 22:19:49 +01:00
Campbell Barton	fc1fd2704a	Merge branch 'master' into blender2.8	2018-01-23 11:45:39 +11:00
Sergey Sharybin	2f79d1c058	Cycles: Replace use_qbvh boolean flag with an enum-based property This was we can introduce other types of BVH, for example, wider ones, without causing too much mess around boolean flags. Thoughs: - Ideally device info should probably return bitflag of what BVH types it supports. It is possible to implement based on simple logic in device/ and mesh.cpp, rest of the changes will stay the same. - Not happy with workarounds in util_debug and duplicated enum in kernel. Maybe enbum should be stores in kernel, but then it's kind of weird to include kernel types from utils. Soudns some cyclkic dependency. Reviewers: brecht, maxim_d33 Reviewed By: brecht Differential Revision: https://developer.blender.org/D3011	2018-01-22 17:19:20 +01:00
Dalai Felinto	d9858d5897	Merge remote-tracking branch 'origin/master' into blender2.8	2018-01-19 12:46:23 -02:00
Brecht Van Lommel	0fe41009f0	Fix T53830: Cycles OpenCL debug assert on macOS, This was probably harmless besides some unnecessary memory usage due to aligning allocations too much.	2018-01-19 11:35:07 +01:00
Sergey Sharybin	c99481b632	Merge branch 'master' into blender2.8	2017-11-09 10:59:15 +01:00
Mai Lavelle	087331c495	Cycles: Replace __MAX_CLOSURE__ build option with runtime integrator variable Goal is to reduce OpenCL kernel recompilations. Currently viewport renders are still set to use 64 closures as this seems to be faster and we don't want to cause a performance regression there. Needs to be investigated. Reviewed By: brecht Differential Revision: https://developer.blender.org/D2775	2017-11-09 01:04:06 -05:00
Brecht Van Lommel	7b1d707481	Merge branch 'master' into blender2.8	2017-11-08 00:20:59 +01:00
Brecht Van Lommel	f79f386731	Code refactor: rename subsurface to local traversal, for reuse.	2017-11-07 22:35:12 +01:00
Campbell Barton	d4fe083b35	Merge branch 'master' into blender2.8	2017-11-04 21:45:52 +11:00
Brecht Van Lommel	6ec599c682	Fix T53247: mixed CPU + GPU render wrong texture limits.	2017-11-03 20:32:29 +01:00
Brecht Van Lommel	f5456df095	Merge branch 'master' into blender2.8	2017-10-24 02:05:41 +02:00
Brecht Van Lommel	070a668d04	Code refactor: move more memory allocation logic into device API. * Remove tex_* and pixels_* functions, replace by mem_. Add MEM_TEXTURE and MEM_PIXELS as memory types recognized by devices. * No longer create device_memory and call mem_* directly, always go through device_only_memory, device_vector and device_pixels.	2017-10-24 01:25:19 +02:00
Brecht Van Lommel	7ad9333fad	Code refactor: store device/interp/extension/type in each device_memory.	2017-10-24 01:03:59 +02:00

1 2 3

124 Commits