Go to file
Sergey Sharybin a87fb34eda Use advantage of SSE2 instructions in gaussian blur node
This gives around 30% of speedup for gaussian blur node.

Pretty much straightforward implementation inside the node
itself, but needed to implement some additional things:

- Aligned malloc. It's needed to load data onto SSE registers
  faster. based on the aligned_malloc() from Libmv with
  some additional trickery going on to support arbitrary
  alignment (this magic is needed because of MemHead).

  In the practice only 16bit alignment is supported because
  of the lack of aligned malloc with arbitrary alignment
  for OSX. Not a bit deal for now because we need 16 bytes
  alignment at this moment only. Could be tweaked further
  later.

- Memory buffers in compositor are now aligned to 16 bytes.
  Should be harmless for non-SSE cases too. just mentioning.

Reviewers: campbellbarton, lukastoenne, jbakker

Reviewed By: campbellbarton

CC: lockal

Differential Revision: https://developer.blender.org/D564
2014-06-14 00:38:07 +06:00
build_files OSX: Set Blender.app and Blenderplayer.app creation and modification date to compiletime, else we would always have the time from checkout here 2014-06-02 14:20:07 +02:00
doc Automatically generate blender.1 man page during build process 2014-06-14 01:19:58 +10:00
extern Add patchset to recent Carve changes 2014-06-11 13:13:36 +06:00
intern Use advantage of SSE2 instructions in gaussian blur node 2014-06-14 00:38:07 +06:00
release UI: Add back ability to select a custom interface font 2014-06-14 02:32:38 +10:00
scons@2d6ebcb239 Basic support for UNC paths on Windows 2014-04-21 17:06:09 +02:00
source Use advantage of SSE2 instructions in gaussian blur node 2014-06-14 00:38:07 +06:00
.arcconfig Use HTTPS protocol for arc 2013-12-24 22:57:27 +06:00
.gitignore .gitignore: Adding CMakeLists.txt.user for QtCreator users. 2014-01-26 17:52:14 -08:00
.gitmodules Initialize git submodules for addons, locales and scons 2013-11-15 12:19:08 +06:00
CMakeLists.txt CMake: quite warnings with 3.0 (without breaking < 3.0 CMakes) 2014-06-11 03:00:57 -03:00
COPYING == docs == 2010-10-13 14:44:22 +00:00
GNUmakefile OSX: fix T40413, automatic threads for compile throw wrong output 2014-05-29 11:15:32 +02:00
SConstruct CMake/SCons: finish removing MSVC2012 references 2014-05-15 16:16:25 +10:00