blender/intern/cycles/device
Martijn Berger 84f9587540 Cuda use streams and async to avoid busywaiting
This is my first stab at this and is based on this IRC converstation:

<mib2berlin> brecht: this is meaning as reminder only, I know you have other things to do > http://openvidia.sourceforge.net/index.php/Optimization_Notes#avoiding_busy_waits
<brecht> mib2berlin: thanks, bookmarked

only tested on Ubuntu 14.04 / cuda 5.0 but ill do some more testing tomorrow.

Also unsure about the placement and the lifetime of the stream and the event. But creating / deleting these seems to incur a non trivial cost.

Reviewers: brecht

Reviewed By: brecht

CC: mib2berlin, dingto

Differential Revision: https://developer.blender.org/D262
2014-01-28 18:40:08 +01:00
..
CMakeLists.txt Cycles: Build SSE41 kernel per default, remove build option. This hopefully also fixes some compile errors on various systems. 2014-01-14 22:04:32 +01:00
device_cpu.cpp Cycles: Add an AVX kernel for CPU rendering. 2014-01-16 17:04:11 +01:00
device_cuda.cpp Cuda use streams and async to avoid busywaiting 2014-01-28 18:40:08 +01:00
device_intern.h Cycles: network render code updated for latest changes and improved 2013-12-07 12:26:58 +01:00
device_memory.h Code cleanup: move half float functions to separate header file. 2014-01-15 15:29:22 +01:00
device_multi.cpp Cycles: network render code updated for latest changes and improved 2013-12-07 12:26:58 +01:00
device_network.cpp Cycles: network render code updated for latest changes and improved 2013-12-07 12:26:58 +01:00
device_network.h Cycles: network render code updated for latest changes and improved 2013-12-07 12:26:58 +01:00
device_opencl.cpp Cycles / OpenCL: Let the OpenCL runtime determine its optimal work-group size automatically, by passing a NULL pointer here. 2013-12-24 20:20:57 +01:00
device_task.cpp Cycles: viewport render now takes scene color management settings into account, 2013-08-30 23:49:38 +00:00
device_task.h Cycles: viewport render now takes scene color management settings into account, 2013-08-30 23:49:38 +00:00
device.cpp Code cleanup: move half float functions to separate header file. 2014-01-15 15:29:22 +01:00
device.h Cycles: network render code updated for latest changes and improved 2013-12-07 12:26:58 +01:00