History log of /external/libjpeg-turbo/simd/jsimd_arm_neon.S
Revision Date Author Comments (<<< Hide modified files) (Show modified files >>>)
6eb7d3798b5a79347c62825fc4c16f7ce673bdd0 16-Oct-2016 Alex Naidis <alex.naidis@linux.com> libjpeg-turbo: Upgrade to 1.5.1

The changes from 1.4.2 to 1.5.1 include
a big amount of fixes and huge performance improvements.
As highlights there is a full ARM 64-bit (ARMv8) NEON SIMD
implementation which improves compression of full-color JPEGs
by about 2-2.5x on average on Cortex-A53 and Cortex-A57 cores.
Also SIMD acceleration for Huffman encoding on NEON-capable
ARM 32-bit and 64-bit platforms was added.
Performance on x86/x86_64 was also improved by
adding better optimized SSE2 routines.

For the full changelog, please see the ChangeLog.md
file.

Partial decoding optimizations, the security fix
to adress b/27494207 and the fix for the AARCH64
conformance issueare present in the release.
The README.android file was edited to reflect this.

The configuration files were regenerated
and all Android specific changes were applied.

BUG:28268702

Change-Id: I538291d894df1da01d3f733771647df1fb61ec42
Signed-off-by: Alex Naidis <alex.naidis@linux.com>
/external/libjpeg-turbo/simd/jsimd_arm_neon.S
a6efae148889d42d60c2d2294cda7f03c6fbb490 25-Aug-2014 DRC <dcommander@users.sourceforge.net> Reformat code per Siarhei's original patch (to clearly indicate that the offset instructions are completely independent) and add Siarhei as an individual author (he no longer works for Nokia.)


git-svn-id: svn+ssh://svn.code.sf.net/p/libjpeg-turbo/code/trunk@1388 632fc199-4ca6-4c93-a231-07263d6284db
/external/libjpeg-turbo/simd/jsimd_arm_neon.S
d729f4da9c86b7212912a7d59e49d061d0e61d5f 23-Aug-2014 DRC <dcommander@users.sourceforge.net> ARM NEON SIMD support for YCC-to-RGB565 conversion, and optimizations to the existing YCC-to-RGB conversion code:

-----

https://github.com/ssvb/libjpeg-turbo/commit/aee36252be20054afce371a92406fc66ba6627b5.patch

From aee36252be20054afce371a92406fc66ba6627b5 Mon Sep 17 00:00:00 2001
From: Siarhei Siamashka <siarhei.siamashka@gmail.com>
Date: Wed, 13 Aug 2014 03:50:22 +0300
Subject: [PATCH] ARM: Faster NEON yuv->rgb conversion for Krait and Cortex-A15

The older code was developed and tested only on ARM Cortex-A8 and ARM Cortex-A9.
Tuning it for newer ARM processors can introduce some speed-up (up to 20%).

The performance of the inner loop (conversion of 8 pixels) improves from
~27 cycles down to ~22 cycles on Qualcomm Krait 300, and from ~20 cycles
down to ~18 cycles on ARM Cortex-A15.

The performance remains exactly the same on ARM Cortex-A7 (~58 cycles),
ARM Cortex-A8 (~25 cycles) and ARM Cortex-A9 (~30 cycles) processors.

Also use larger indentation in the source code for separating two independent
instruction streams.

-----

https://github.com/ssvb/libjpeg-turbo/commit/a5efdbf22ce9c1acd4b14a353cec863c2c57557e.patch

From a5efdbf22ce9c1acd4b14a353cec863c2c57557e Mon Sep 17 00:00:00 2001
From: Siarhei Siamashka <siarhei.siamashka@gmail.com>
Date: Wed, 13 Aug 2014 07:23:09 +0300
Subject: [PATCH] ARM: NEON optimized yuv->rgb565 conversion

The performance of the inner loop (conversion of 8 pixels):
* ARM Cortex-A7: ~55 cycles
* ARM Cortex-A8: ~28 cycles
* ARM Cortex-A9: ~32 cycles
* ARM Cortex-A15: ~20 cycles
* Qualcomm Krait: ~24 cycles

Based on the Linaro rgb565 patch from
https://sourceforge.net/p/libjpeg-turbo/patches/24/
but implements better instructions scheduling.


git-svn-id: svn+ssh://svn.code.sf.net/p/libjpeg-turbo/code/trunk@1385 632fc199-4ca6-4c93-a231-07263d6284db
/external/libjpeg-turbo/simd/jsimd_arm_neon.S
2e2ce5a1da365fd1dee9cf1f3d196df058ea54db 22-Aug-2014 DRC <dcommander@users.sourceforge.net> .func/.endfunc are only necessary when generating STABS debug info, which basically went out of style with parachute pants and Rick Astley. At any rate, none of the platforms for which we're building the ARM code use it (DWARF is the common format these days), and the .func/.endfunc directives cause the clang integrated assembler to fail (http://llvm.org/bugs/show_bug.cgi?id=20424).


git-svn-id: svn+ssh://svn.code.sf.net/p/libjpeg-turbo/code/trunk@1375 632fc199-4ca6-4c93-a231-07263d6284db
/external/libjpeg-turbo/simd/jsimd_arm_neon.S
3e00f03aea551192233e143bbc63c435e09a3afe 05-Feb-2014 DRC <dcommander@users.sourceforge.net> Formatting tweaks


git-svn-id: svn+ssh://svn.code.sf.net/p/libjpeg-turbo/code/branches/1.3.x@1107 632fc199-4ca6-4c93-a231-07263d6284db
/external/libjpeg-turbo/simd/jsimd_arm_neon.S
316617faf4a9c5f00bf76f4a0e9c9864d65ec97f 13-Jun-2012 DRC <dcommander@users.sourceforge.net> Accelerated 4:2:2 upsampling routine for ARM (improves performance ~20-30% when decompressing 4:2:2 JPEGs using fancy upsampling)


git-svn-id: svn+ssh://svn.code.sf.net/p/libjpeg-turbo/code/branches/1.2.x@837 632fc199-4ca6-4c93-a231-07263d6284db
/external/libjpeg-turbo/simd/jsimd_arm_neon.S
b071f01cbb989139b2f4dd68092839c4acbfa46e 06-Sep-2011 DRC <dcommander@users.sourceforge.net> Update Nokia contact info


git-svn-id: svn+ssh://svn.code.sf.net/p/libjpeg-turbo/code/trunk@694 632fc199-4ca6-4c93-a231-07263d6284db
/external/libjpeg-turbo/simd/jsimd_arm_neon.S
ad6955d46aa329edda9ae87279751f65408a667b 06-Sep-2011 DRC <dcommander@users.sourceforge.net> Improve performance of IFAST iDCT by changing the order of transpose and descale steps


git-svn-id: svn+ssh://svn.code.sf.net/p/libjpeg-turbo/code/trunk@693 632fc199-4ca6-4c93-a231-07263d6284db
/external/libjpeg-turbo/simd/jsimd_arm_neon.S
5129e3960fbb7cb0a5348bdd542c07ad6c57ab35 06-Sep-2011 DRC <dcommander@users.sourceforge.net> Make ARM ISLOW iDCT faster on typical cases, and eliminate the possibility of 16-bit overflows when handling arbitrary coefficients.


git-svn-id: svn+ssh://svn.code.sf.net/p/libjpeg-turbo/code/trunk@692 632fc199-4ca6-4c93-a231-07263d6284db
/external/libjpeg-turbo/simd/jsimd_arm_neon.S
98a44fe07b2f9694f4f74c320744613b9624f323 25-Aug-2011 DRC <dcommander@users.sourceforge.net> Improve the performance of YCbCr to RGB conversion on ARM


git-svn-id: svn+ssh://svn.code.sf.net/p/libjpeg-turbo/code/trunk@691 632fc199-4ca6-4c93-a231-07263d6284db
/external/libjpeg-turbo/simd/jsimd_arm_neon.S
ce4e3e8690faf320437e8d128287578738c74c8b 22-Aug-2011 DRC <dcommander@users.sourceforge.net> NEON-accelerated slow integer inverse DCT


git-svn-id: svn+ssh://svn.code.sf.net/p/libjpeg-turbo/code/trunk@690 632fc199-4ca6-4c93-a231-07263d6284db
/external/libjpeg-turbo/simd/jsimd_arm_neon.S
82bd52196d0ee3eda47bfbb3da8fb886739484bf 17-Aug-2011 DRC <dcommander@users.sourceforge.net> NEON-accelerated quantization


git-svn-id: svn+ssh://svn.code.sf.net/p/libjpeg-turbo/code/trunk@689 632fc199-4ca6-4c93-a231-07263d6284db
/external/libjpeg-turbo/simd/jsimd_arm_neon.S
4b024a62b2e22248345b9934052db340ee58e0f6 15-Aug-2011 DRC <dcommander@users.sourceforge.net> Improve performance of ARM NEON IFAST iDCT


git-svn-id: svn+ssh://svn.code.sf.net/p/libjpeg-turbo/code/trunk@686 632fc199-4ca6-4c93-a231-07263d6284db
/external/libjpeg-turbo/simd/jsimd_arm_neon.S
7a9376c11475d381d20b1a002ec9207f694f1b23 12-Aug-2011 DRC <dcommander@users.sourceforge.net> ARM NEON-accelerated RGB-to-YCbCr conversion


git-svn-id: svn+ssh://svn.code.sf.net/p/libjpeg-turbo/code/trunk@682 632fc199-4ca6-4c93-a231-07263d6284db
/external/libjpeg-turbo/simd/jsimd_arm_neon.S
b740054f3c5a611e6685c911b63c216823a66c23 11-Aug-2011 DRC <dcommander@users.sourceforge.net> Support for accelerated forward DCT using ARM NEON instructions


git-svn-id: svn+ssh://svn.code.sf.net/p/libjpeg-turbo/code/trunk@678 632fc199-4ca6-4c93-a231-07263d6284db
/external/libjpeg-turbo/simd/jsimd_arm_neon.S
8c60d22ff51486afacf772b6f6b8b44630ffbff8 17-Jun-2011 DRC <dcommander@users.sourceforge.net> NEON-optimized 2x2 and 4x4 scaled iDCTs


git-svn-id: svn+ssh://svn.code.sf.net/p/libjpeg-turbo/code/trunk@662 632fc199-4ca6-4c93-a231-07263d6284db
/external/libjpeg-turbo/simd/jsimd_arm_neon.S
4346f91fcb3b576fb037cef7dc94eac2e5a06250 15-Jun-2011 DRC <dcommander@users.sourceforge.net> iOS ARM support


git-svn-id: svn+ssh://svn.code.sf.net/p/libjpeg-turbo/code/trunk@659 632fc199-4ca6-4c93-a231-07263d6284db
/external/libjpeg-turbo/simd/jsimd_arm_neon.S
321e0686011df0973c8237e2a6cc374e7146f93e 03-May-2011 DRC <dcommander@users.sourceforge.net> ARM NEON support


git-svn-id: svn+ssh://svn.code.sf.net/p/libjpeg-turbo/code/trunk@607 632fc199-4ca6-4c93-a231-07263d6284db
/external/libjpeg-turbo/simd/jsimd_arm_neon.S