blender

Author	SHA1	Message	Date
Antony Riakiotakis	4fc3188112	Cycles: Get rid of one more OpenGL matrix manipulation/push/pop.	2015-05-11 16:41:18 +02:00
Antony Riakiotakis	e38f914421	Cycles: use vertex buffers when possible to draw tiles on the screen. Not terribly necessary in this case, since we are just drawing a quad, but makes blender overall more GL 3.x core ready.	2015-05-11 16:28:41 +02:00
Antony Riakiotakis	5588a51c9c	Cycles OpenGL: Don't use full matrix transform when we can just use simple addition.	2015-05-11 13:10:19 +02:00
Sergey Sharybin	3a2c0ccdd0	Cycles: Correction to opencl whitelist check Was using platform as a device id accidentally.	2015-05-10 20:02:06 +05:00
Sergey Sharybin	136d7a4f62	Cycles: Only whitelist AMD GPU devices in the OpenCL section Only those ones are priority for now, all the rest are still testable if CYCLES_OPENCL_TEST or CYCLES_OPENCL_SPLIT_KERNEL_TEST environment variables are set.	2015-05-09 23:40:26 +05:00
George Kyriazis	7f4479da42	Cycles: OpenCL kernel split This commit contains all the work related on the AMD megakernel split work which was mainly done by Varun Sundar, George Kyriazis and Lenny Wang, plus some help from Sergey Sharybin, Martijn Berger, Thomas Dinges and likely someone else which we're forgetting to mention. Currently only AMD cards are enabled for the new split kernel, but it is possible to force split opencl kernel to be used by setting the following environment variable: CYCLES_OPENCL_SPLIT_KERNEL_TEST=1. Not all the features are supported yet, and that being said no motion blur, camera blur, SSS and volumetrics for now. Also transparent shadows are disabled on AMD device because of some compiler bug. This kernel is also only implements regular path tracing and supporting branched one will take a bit. Branched path tracing is exposed to the interface still, which is a bit misleading and will be hidden there soon. More feature will be enabled once they're ported to the split kernel and tested. Neither regular CPU nor CUDA has any difference, they're generating the same exact code, which means no regressions/improvements there. Based on the research paper: https://research.nvidia.com/sites/default/files/publications/laine2013hpg_paper.pdf Here's the documentation: https://docs.google.com/document/d/1LuXW-CV-sVJkQaEGZlMJ86jZ8FmoPfecaMdR-oiWbUY/edit Design discussion of the patch: https://developer.blender.org/T44197 Differential Revision: https://developer.blender.org/D1200	2015-05-09 19:52:40 +05:00
Sergey Sharybin	f680c1b54a	Cycles: Communicate number of closures and nodes feature set to the device This way device can actually make a decision of how it can optimize the kernel in order to make it most efficient.	2015-05-09 19:28:00 +05:00
Sergey Sharybin	b3299bace0	Cycles: Pass requested tile size to the device via device task This is currently unused but crucial for things like calculating amount of device memory required to deal with the tasks. Maybe not really best place to store it, but consider it good enough for now.	2015-05-09 19:09:07 +05:00
Sergey Sharybin	0e4ddaadd4	Cycles: Change the way how we pass requested capabilities to the device Previously we only had experimental flag passed to device's load_kernel() which was all fine. But since we're gonna to have some extra parameters passed there it makes sense to wrap them into a single struct, which will make it easier to pass stuff around.	2015-05-09 19:05:49 +05:00
Sergey Sharybin	35812e65f4	Cycles: Fix compilation error on windows after recent logging changes	2015-04-10 22:35:10 +05:00
Sergey Sharybin	2f5dd83759	Cycles: Add some statistics logging Covers number of entities in the scene (objects, meshes etc), also reports sizes of textures being allocated.	2015-04-10 15:37:49 +05:00
Martijn Berger	f01456aaa4	Optionally use c++11 stuff instead of boost in cycles where possible. We do and continue to depend on boost though Reviewers: dingto, sergey Reviewed By: sergey Subscribers: #cycles Differential Revision: https://developer.blender.org/D1185	2015-03-29 22:12:40 +02:00
Sergey Sharybin	5ff132182d	Cycles: Code cleanup, spaces around keywords This inconsistency drove me totally crazy, it's really confusing when it's inconsistent especially when you work on both Cycles and Blender sides. Shouldn;t cause merge PITA, it's whitespace changes only, Git should be able to merge it nicely.	2015-03-28 00:15:15 +05:00
Sergey Sharybin	585dd26120	Cycles: Code cleanup, prepare for strict C++ flags	2015-03-27 18:23:31 +05:00
Sergey Sharybin	7f406a53c7	Cycles: Cleanup for indentation in device_cpu.cpp Perhaps became broken after rather recent change about which entry point to kernel to use.	2015-02-19 19:05:04 +05:00
Sergey Sharybin	7ea7c2aab2	Cycles: Fix inconsistent command line used for runtime kernel compilation Basically build-time compiled kernels were using --fast-math (which is correct) but run-time compiled did not.	2015-02-02 15:00:21 +05:00
Sergey Sharybin	77e6f2212f	Cycles: Allow paths customization via environment variables This is for development and test environment setup only, not for regular users usage hence no mentioning in the man page needed.	2015-02-02 02:02:10 +05:00
Sergey Sharybin	a922be9270	Cycles: Repot CPU and CUDA capabilities to system info operator For CPU it gives available instructions set (SSE, AVX and so). For GPU CUDA it reports most of the attribute values returned by cuDeviceGetAttribute(). Ideally we need to only use set of those which are driver-specific (so we don't clutter system info with values which we can get from GPU specifications and be sure they stay the same because driver can't affect on them).	2015-01-06 14:13:21 +05:00
Sergey Sharybin	1369bd562c	Cycles: Fix compilation error on AVX platforms with -arch-native Was a conflict in headers between clew and util_optimization.h.	2015-01-03 00:11:28 +05:00
Sergey Sharybin	9e2e408323	Cycles: Add logging to OSL and CUDA initialization/compilation This is what was handy troubleshooting issues in the studio, plus this is exactly the same thing which would be helpful when solving issues with paths to compiled shaders and cubins for standalone repository.	2015-01-01 01:31:08 +05:00
Sergey Sharybin	4497b6ac84	Cycles: Synchronize changes with standalone repository This changes were done in original commit of the standalone Cycles repository and needed here for easier patch synchronization.	2015-01-01 01:31:07 +05:00
Thomas Dinges	ee36e75b85	Cleanup: Fix Cycles Apache header. This was already mixed a bit, but the dot belongs there.	2014-12-25 02:50:24 +01:00
Campbell Barton	c07f6c02b3	Docs: reference the new manual	2014-12-08 11:18:58 +01:00
Thomas Dinges	e3a6f1c152	Cycles: Remove workaround for missing sm_52 kernel, now we require it for Maxwell cards.	2014-12-02 13:45:39 +01:00
Bastien Montagne	c14d34322b	Fix typo breaking compilation with SSE2. Spotted by sybrenstuvel (Sybren Stüvel), thanks!	2014-11-02 23:01:09 +01:00
Thomas Dinges	4ff8744669	Cycles / CUDA: Better fix for missing sm_52 kernel, in case user compiles himself.	2014-10-30 11:42:59 +01:00
Martijn Berger	4b33667b93	Deduplicate some code by using a function pointer to the real kernel This has no performance impact what so ever and is already used in the adaptive sampling patch	2014-10-30 10:23:44 +01:00
Sergey Sharybin	e556670b36	Cycles: Do cuda pointer arithmetic in integers, don't use pointer arithmetic This should hopefully fix https://bugs.debian.org/cgi-bin/bugreport.cgi?bug=765187	2014-10-14 17:54:41 +02:00
Jason Wilkins	8d084e8c8f	Ghost Context Refactor https://developer.blender.org/D643 Separates graphics context creation from window code in Ghost so that they can vary separately.	2014-10-07 15:47:32 -05:00
Sergey Sharybin	cd6129d1ff	Cycles: Workaround dead-slow expf() on 64bit linux Single precision exponent on 64bit linux tends to be order of magnitude slower than double precision version even with single<->double precision conversion. Some feedback in the mailing lists also suggests that logf() is also slow, but this i didn't confirm here in the studio yet. Depending on the shader setup it gives ~3% with the secret agent shot and up to around 15% with the bmw scene here.	2014-10-06 12:36:46 +02:00
Sergey Sharybin	68f2066602	Cycles: Make OpenCL folks happy to use __KERNEL_DEBUG__ Quite straightforward change, the only annoying thing is that we can't use indentation for include directive just because of the way headers inlineing works for OpenCL. Might do smarter job in path_source_replace_includes() but don't want to spend time on this yet.	2014-10-05 16:00:23 +06:00
Sergey Sharybin	0106b94f9d	Cycles: Fix for debug kernel not working with CUDA	2014-10-05 15:31:48 +06:00
Thomas Dinges	a613290775	Cycles / CUDA: Workaround to make sm_52 (Maxwell) cards work. * sm_52 can run a sm_50 kernel, so tell runtime detection to use that until we build a dedicated sm_52 kernel.	2014-10-05 04:13:40 +02:00
Sergey Sharybin	a654512356	Cycles: Implement preliminary test for volume stack update from SSS This adds an AABB collision check for objects with volumes and if there's a collision detected then the object will have SD_OBJECT_INTERSECTS_VOLUME flag. This solves a speed regression introduced by the fix for T39823 by skipping volume stack update in cases no volumes intersects the current SSS object.	2014-10-03 10:52:04 +02:00
Sergey Sharybin	fbed2047c8	Fix wrong track of the memory when doing device vector resize before freeing it This is rather legit case which happens i.e. when having persistent images enabled and session is updating the lookup tables. Now device_memory keeps track of amount of memory being allocated on the device, which makes freeing using the proper allocated size, not the CPU side buffer size.	2014-09-04 17:25:12 +06:00
Thomas Dinges	fb3f32760d	Cycles: Add an experimental CUDA kernel. Now we build 2 .cubins per architecture (e.g. kernel_sm_21.cubin, kernel_experimental_sm_21.cubin). The experimental kernel can be used by switching to the Experimental Feature Set: http://wiki.blender.org/index.php/Doc:2.6/Manual/Render/Cycles/Experimental_Features This enables Subsurface Scattering and Correlated Multi Jitter Sampling on GPU, while keeping the stability and performance of the regular kernel. Differential Revision: https://developer.blender.org/D762 Patch by Sergey and myself. Developer / Builder Note: CUDA Toolkit 6.5 is highly recommended for this, also note that building the experimental kernel requires a lot of system memory (~7-8GB).	2014-08-26 17:02:26 +02:00
Thomas Dinges	603348c56e	Cycles: Drop support for CUDA 5.0 Toolkit, only 6.0 and 6.5 (recommended) are supported now.	2014-08-21 23:35:20 +02:00
Dalai Felinto	8d3cc431d7	Fix T41471 Cycles Bake: Setting small tile size results in wrong bake with stripes rather than the expected noise pattern This problem was introduced in 983cbafd1877f8dbaae60b064a14e27b5b640f18 Basically the issue is that we were not getting a unique index in the baking routine for the RNG (random number generator). Reviewers: sergey Differential Revision: https://developer.blender.org/D749	2014-08-19 11:40:33 +02:00
Dalai Felinto	2c5b6859d9	Revert "Fix T41222 Blender gives weird output when baking (4096*4096) resolution on GPU" This reverts commit a48b372b04421b00644a0660bfdf42229b5ffceb. Leaving only the part that fix device_multi.cpp	2014-08-15 11:27:42 +02:00
Martijn Berger	c020bd2e73	Cycles OpenCL error to string removed in favour of the same function in clew.	2014-08-09 14:27:40 +02:00
Dalai Felinto	a48b372b04	Fix T41222 Blender gives weird output when baking (4096*4096) resolution on GPU In collaboration with Sergey Sharybin. Also thanks to Wolfgang Faehnle (mib2berlin) for help testing the solutions. Reviewers: sergey Differential Revision: https://developer.blender.org/D690	2014-08-05 13:50:50 -03:00
Sergey Sharybin	77b7e1fe9a	Deduplicate CUDA and OpenCL wranglers For now it was mainly about OpenCL wrangler being duplicated between Cycles and Compositor, but with OpenSubdiv work those wranglers were gonna to be duplicated just once again. This commit makes it so Cycles and Compositor uses wranglers from this repositories: - https://github.com/CudaWrangler/cuew - https://github.com/OpenCLWrangler/clew This repositories are based on the wranglers we used before and they'll be likely continued maintaining by us plus some more players in the market. Pretty much straightforward change with some tricks in the CMake/SCons to make this libs being passed to the linker after all other libraries in order to make OpenSubdiv linked against those wranglers in the future. For those who're worrying about Cycles being less standalone, it's not truth, it's rather more flexible now and in the future different wranglers might be used in Cycles. For now it'll just mean those libs would need to be put into Cycles repository together with some other libs from Blender such as mikkspace. This is mainly platform maintenance commit, should not be any changes to the user space. Reviewers: juicyfruit, dingto, campbellbarton Reviewed By: juicyfruit, dingto, campbellbarton Differential Revision: https://developer.blender.org/D707	2014-08-05 13:57:50 +06:00
Campbell Barton	9c3025cd26	Spelling	2014-08-02 16:53:52 +10:00
Martijn Berger	65bf694331	Implement get_split_task_count to make device_network compile again.	2014-07-29 07:40:04 +02:00
Dalai Felinto	fc55c41bba	Cycles Bake: show progress bar during bake Baking progress preview is not possible, in parts due to the way the API was designed. But at least you get to see the progress bar while baking. Reviewers: sergey Differential Revision: https://developer.blender.org/D656	2014-07-25 11:42:53 -03:00
Martijn Berger	bae2b3a688	Switch to Cuda 4.0 style api for kernel invocation. This is a small clean-up that has no functional changes but makes code a bit more readable. Differential revision: https://developer.blender.org/D659 Reviewed by: Sergey Sharybin, Thomas Dinges	2014-07-25 13:33:19 +02:00
Thomas Dinges	9acabc13de	Cleanup: Typo fixes.	2014-07-05 14:25:34 +02:00
Thomas Dinges	5898abe99d	Cycles: Update CUDA error messages, based on Toolkit 6.0. * Removed deprecated erros, and added some new ones, which might help to figure out problems in the future.	2014-07-02 01:50:42 +02:00
Thomas Dinges	4800c52700	Cleanup: Remove unused checks in CUDA device code.	2014-07-02 01:12:13 +02:00
Thomas Dinges	866c7fb6e6	Cycles: Add an AVX2 CPU kernel. This kernel is compiled with AVX2, FMA3, and BMI compiler flags. At the moment only Intel Haswell benefits from this, but future AMD CPUs will have these instructions as well. Makes rendering on Haswell CPUs a few percent faster, only benchmarked with clang on OS X though. Part of my GSoC 2014.	2014-06-13 22:26:20 +02:00

1 2 3 4 5

217 Commits