History log of /external/libjpeg-turbo/simd/jfdctint-sse2-64.asm
Revision Date Author Comments (<<< Hide modified files) (Show modified files >>>)
55e328ecb83a767d88203930e84e44bf6cc883f5 22-Aug-2014 DRC <dcommander@users.sourceforge.net> Revert r1335 and r1336. It was a valiant effort, but on Windows, xmm8-xmm15 are non-volatile, and the overhead of pushing them onto the stack at the beginning of each function and popping them at the end was causing worse performance (in the neighborhood of 3-5%) than just using the work areas and limiting the register usage to xmm0-xmm7. Best to leave the SSE2 code alone. We can optimize the register usage for AVX2, once that port takes place.


git-svn-id: svn+ssh://svn.code.sf.net/p/libjpeg-turbo/code/trunk@1382 632fc199-4ca6-4c93-a231-07263d6284db
/external/libjpeg-turbo/simd/jfdctint-sse2-64.asm
8a74848a2374385c20ea7197b0821d2257fc52c8 10-Aug-2014 DRC <dcommander@users.sourceforge.net> Oops. The Windows version of collect_args/uncollect_args uses rsp, so we still need the rsp prologue/epilogue, despite the fact that we aren't using the stack as a work area. This fixes a segfault on Windows caused by r1335.


git-svn-id: svn+ssh://svn.code.sf.net/p/libjpeg-turbo/code/trunk@1336 632fc199-4ca6-4c93-a231-07263d6284db
/external/libjpeg-turbo/simd/jfdctint-sse2-64.asm
a8ab3424c70595f8815aa614253b034170817553 09-Aug-2014 DRC <dcommander@users.sourceforge.net> Attempt to improve performance by refactoring the compression-side color conversion and DCT algorithms so that they take full advantage of the additional registers available with 64-bit SSE2. This produces a somewhat yawn-worthy speedup of 2-3%, but at least the code is a lot more readable now.


git-svn-id: svn+ssh://svn.code.sf.net/p/libjpeg-turbo/code/trunk@1335 632fc199-4ca6-4c93-a231-07263d6284db
/external/libjpeg-turbo/simd/jfdctint-sse2-64.asm
24e92e9fc0e3d3c0b3107d5dfcb49649e3b67a49 10-May-2014 DRC <dcommander@users.sourceforge.net> Using subdirectories unfortunately opened up a can of worms. In order to prevent object name conflicts, it is necessary to use the subdir-objects automake directive, but it simply doesn't work right on some of the versions of automake we still have to support. Another option would be to add a separate Makefile.am file to each subdirectory, but that requires maintaining a completely different set of build rules for each one. Fortunately, however, we're in the 21st century now, so we can use filenames longer than 8.3.


git-svn-id: svn+ssh://svn.code.sf.net/p/libjpeg-turbo/code/trunk@1282 632fc199-4ca6-4c93-a231-07263d6284db
/external/libjpeg-turbo/simd/jfdctint-sse2-64.asm