8b5a0093ee14f1a977c94f2566d5aa2ffebfe5e3 |
|
21-Jan-2015 |
DRC <dcommander@users.sourceforge.net> |
Oops. The MIPS SIMD implementations of h2v1 and h2v2 upsampling were not checking for DSPr2 support, so running 'djpeg -nosmooth' on a non-DSPr2-enabled platform caused an "illegal instruction" error. git-svn-id: svn+ssh://svn.code.sf.net/p/libjpeg-turbo/code/branches/1.4.x@1523 632fc199-4ca6-4c93-a231-07263d6284db
/external/libjpeg-turbo/simd/jsimd_mips.c
|
d729f4da9c86b7212912a7d59e49d061d0e61d5f |
|
23-Aug-2014 |
DRC <dcommander@users.sourceforge.net> |
ARM NEON SIMD support for YCC-to-RGB565 conversion, and optimizations to the existing YCC-to-RGB conversion code: ----- https://github.com/ssvb/libjpeg-turbo/commit/aee36252be20054afce371a92406fc66ba6627b5.patch From aee36252be20054afce371a92406fc66ba6627b5 Mon Sep 17 00:00:00 2001 From: Siarhei Siamashka <siarhei.siamashka@gmail.com> Date: Wed, 13 Aug 2014 03:50:22 +0300 Subject: [PATCH] ARM: Faster NEON yuv->rgb conversion for Krait and Cortex-A15 The older code was developed and tested only on ARM Cortex-A8 and ARM Cortex-A9. Tuning it for newer ARM processors can introduce some speed-up (up to 20%). The performance of the inner loop (conversion of 8 pixels) improves from ~27 cycles down to ~22 cycles on Qualcomm Krait 300, and from ~20 cycles down to ~18 cycles on ARM Cortex-A15. The performance remains exactly the same on ARM Cortex-A7 (~58 cycles), ARM Cortex-A8 (~25 cycles) and ARM Cortex-A9 (~30 cycles) processors. Also use larger indentation in the source code for separating two independent instruction streams. ----- https://github.com/ssvb/libjpeg-turbo/commit/a5efdbf22ce9c1acd4b14a353cec863c2c57557e.patch From a5efdbf22ce9c1acd4b14a353cec863c2c57557e Mon Sep 17 00:00:00 2001 From: Siarhei Siamashka <siarhei.siamashka@gmail.com> Date: Wed, 13 Aug 2014 07:23:09 +0300 Subject: [PATCH] ARM: NEON optimized yuv->rgb565 conversion The performance of the inner loop (conversion of 8 pixels): * ARM Cortex-A7: ~55 cycles * ARM Cortex-A8: ~28 cycles * ARM Cortex-A9: ~32 cycles * ARM Cortex-A15: ~20 cycles * Qualcomm Krait: ~24 cycles Based on the Linaro rgb565 patch from https://sourceforge.net/p/libjpeg-turbo/patches/24/ but implements better instructions scheduling. git-svn-id: svn+ssh://svn.code.sf.net/p/libjpeg-turbo/code/trunk@1385 632fc199-4ca6-4c93-a231-07263d6284db
/external/libjpeg-turbo/simd/jsimd_mips.c
|
495e43426e901bc95c54324050bc3ad8bc128f90 |
|
19-May-2014 |
DRC <dcommander@users.sourceforge.net> |
Allow for building the MIPS DSPr2 extensions if the host is mips-* as well as mipsel-*. The DSPr2 extensions are little endian, so we still have to check that the compiler defines __MIPSEL__ before enabling them. This paves the way for supporting big-endian MIPS, and in the near term, it allows the SIMD extensions to be built with Sourcery CodeBench. git-svn-id: svn+ssh://svn.code.sf.net/p/libjpeg-turbo/code/trunk@1316 632fc199-4ca6-4c93-a231-07263d6284db
/external/libjpeg-turbo/simd/jsimd_mips.c
|
5ef463056ae22f24c3915ba7ab03eefd5bb6fde7 |
|
18-May-2014 |
DRC <dcommander@users.sourceforge.net> |
SIMD-accelerated int upsample routine for MIPS DSPr2 git-svn-id: svn+ssh://svn.code.sf.net/p/libjpeg-turbo/code/trunk@1315 632fc199-4ca6-4c93-a231-07263d6284db
/external/libjpeg-turbo/simd/jsimd_mips.c
|
c728cfd8f236929808cf081232e0702f3382d80b |
|
18-May-2014 |
DRC <dcommander@users.sourceforge.net> |
Fix MIPS build git-svn-id: svn+ssh://svn.code.sf.net/p/libjpeg-turbo/code/trunk@1314 632fc199-4ca6-4c93-a231-07263d6284db
/external/libjpeg-turbo/simd/jsimd_mips.c
|
1419852c42a0b8d8779fa89419cf2824b9eb5866 |
|
15-May-2014 |
DRC <dcommander@users.sourceforge.net> |
Clean up code formatting in the SIMD interface functions git-svn-id: svn+ssh://svn.code.sf.net/p/libjpeg-turbo/code/trunk@1305 632fc199-4ca6-4c93-a231-07263d6284db
/external/libjpeg-turbo/simd/jsimd_mips.c
|
1b3fd7eead3ae2c83f88df43078878f0883aeea1 |
|
15-May-2014 |
DRC <dcommander@users.sourceforge.net> |
SIMD-accelerated NULL convert routine for MIPS DSPr2 git-svn-id: svn+ssh://svn.code.sf.net/p/libjpeg-turbo/code/trunk@1304 632fc199-4ca6-4c93-a231-07263d6284db
/external/libjpeg-turbo/simd/jsimd_mips.c
|
6a61c1e6dc0263148e8e8e1a13da8624cc5a065f |
|
14-May-2014 |
DRC <dcommander@users.sourceforge.net> |
SIMD-accelerated h2v2 smooth downsampling routine for MIPS DSPr2 git-svn-id: svn+ssh://svn.code.sf.net/p/libjpeg-turbo/code/trunk@1301 632fc199-4ca6-4c93-a231-07263d6284db
/external/libjpeg-turbo/simd/jsimd_mips.c
|
b844eaa36016305392ff63ae1ebae32dc893b15b |
|
13-May-2014 |
DRC <dcommander@users.sourceforge.net> |
SIMD-accelerated merged upsampling routines for MIPS DSPr2 git-svn-id: svn+ssh://svn.code.sf.net/p/libjpeg-turbo/code/trunk@1297 632fc199-4ca6-4c93-a231-07263d6284db
/external/libjpeg-turbo/simd/jsimd_mips.c
|
343478622debf6883c562c9718267495628d14f2 |
|
06-May-2014 |
DRC <dcommander@users.sourceforge.net> |
SIMD-accelerated slow integer IDCT routine for MIPS DSPr2 git-svn-id: svn+ssh://svn.code.sf.net/p/libjpeg-turbo/code/trunk@1269 632fc199-4ca6-4c93-a231-07263d6284db
/external/libjpeg-turbo/simd/jsimd_mips.c
|
fff6c23a65f1359b9e80e469f4fa36e10f869288 |
|
12-Oct-2013 |
DRC <dcommander@users.sourceforge.net> |
SIMD-accelerated integer convsamp routine for MIPS DSPr2 git-svn-id: svn+ssh://svn.code.sf.net/p/libjpeg-turbo/code/trunk@1059 632fc199-4ca6-4c93-a231-07263d6284db
/external/libjpeg-turbo/simd/jsimd_mips.c
|
3d727281696a197a12ba35ceb0beb36bd3938db3 |
|
09-Oct-2013 |
DRC <dcommander@users.sourceforge.net> |
SIMD-accelerated floating point quantize and convsamp routines for MIPS DSPr2 git-svn-id: svn+ssh://svn.code.sf.net/p/libjpeg-turbo/code/trunk@1058 632fc199-4ca6-4c93-a231-07263d6284db
/external/libjpeg-turbo/simd/jsimd_mips.c
|
d3131c1b3dfde62bebf7fb52d22c1ce54c8cea8a |
|
08-Oct-2013 |
DRC <dcommander@users.sourceforge.net> |
SIMD-accelerated fast integer inverse DCT routine for MIPS DSPr2 git-svn-id: svn+ssh://svn.code.sf.net/p/libjpeg-turbo/code/trunk@1056 632fc199-4ca6-4c93-a231-07263d6284db
/external/libjpeg-turbo/simd/jsimd_mips.c
|
71e06a7d81a9034c1b8dfce2ac1f04420bb373c5 |
|
08-Oct-2013 |
DRC <dcommander@users.sourceforge.net> |
SIMD-accelerated fast integer forward DCT routine for MIPS DSPr2 git-svn-id: svn+ssh://svn.code.sf.net/p/libjpeg-turbo/code/trunk@1055 632fc199-4ca6-4c93-a231-07263d6284db
/external/libjpeg-turbo/simd/jsimd_mips.c
|
a6b7fbd3521a88305897cbea7db4d0eef9e0ec55 |
|
30-Sep-2013 |
DRC <dcommander@users.sourceforge.net> |
SIMD-accelerated slow integer forward DCT and quantize routines for MIPS DSPr2 git-svn-id: svn+ssh://svn.code.sf.net/p/libjpeg-turbo/code/trunk@1054 632fc199-4ca6-4c93-a231-07263d6284db
/external/libjpeg-turbo/simd/jsimd_mips.c
|
e500591710cd1ee2a8dabb0e291a31448ace7077 |
|
27-Sep-2013 |
DRC <dcommander@users.sourceforge.net> |
SIMD-accelerated 3/4 and 3/2 decompression scaling for MIPS DSPr2 git-svn-id: svn+ssh://svn.code.sf.net/p/libjpeg-turbo/code/trunk@1047 632fc199-4ca6-4c93-a231-07263d6284db
/external/libjpeg-turbo/simd/jsimd_mips.c
|
2ccf4d1a70d6311484f2f8aeb386b2ed747ad95c |
|
27-Sep-2013 |
DRC <dcommander@users.sourceforge.net> |
SIMD-accelerated 1/2 and 1/4 decompression scaling for MIPS DSPr2 git-svn-id: svn+ssh://svn.code.sf.net/p/libjpeg-turbo/code/trunk@1046 632fc199-4ca6-4c93-a231-07263d6284db
/external/libjpeg-turbo/simd/jsimd_mips.c
|
49eaa7572d490471e0417c10a79fea086d956f62 |
|
27-Sep-2013 |
DRC <dcommander@users.sourceforge.net> |
SIMD-optimized RGB-to-grayscale conversion for MIPS DSPr2 git-svn-id: svn+ssh://svn.code.sf.net/p/libjpeg-turbo/code/trunk@1045 632fc199-4ca6-4c93-a231-07263d6284db
/external/libjpeg-turbo/simd/jsimd_mips.c
|
16962c11327f24f272b1ecfcd8daf7374e547eca |
|
27-Jul-2013 |
DRC <dcommander@users.sourceforge.net> |
SIMD support for performing upsampling using MIPS DSPr2 instructions git-svn-id: svn+ssh://svn.code.sf.net/p/libjpeg-turbo/code/trunk@996 632fc199-4ca6-4c93-a231-07263d6284db
/external/libjpeg-turbo/simd/jsimd_mips.c
|
6f2d3c2c97f32100fce098ef8a3dfac1569c796a |
|
27-Jul-2013 |
DRC <dcommander@users.sourceforge.net> |
SIMD support for performing downsampling using MIPS DSPr2 instructions git-svn-id: svn+ssh://svn.code.sf.net/p/libjpeg-turbo/code/trunk@995 632fc199-4ca6-4c93-a231-07263d6284db
/external/libjpeg-turbo/simd/jsimd_mips.c
|
86fbf35fb6010aa3d7c373a1cc7e3557e4b35ab9 |
|
27-Jul-2013 |
DRC <dcommander@users.sourceforge.net> |
SIMD support for performing fancy upsampling using MIPS DSPr2 instructions git-svn-id: svn+ssh://svn.code.sf.net/p/libjpeg-turbo/code/trunk@994 632fc199-4ca6-4c93-a231-07263d6284db
/external/libjpeg-turbo/simd/jsimd_mips.c
|
0be9fa57352e89cb24d5352b0ef96f42a80a7ecc |
|
24-Jul-2013 |
DRC <dcommander@users.sourceforge.net> |
SIMD support for performing color conversion using MIPS DSPr2 instructions git-svn-id: svn+ssh://svn.code.sf.net/p/libjpeg-turbo/code/trunk@993 632fc199-4ca6-4c93-a231-07263d6284db
/external/libjpeg-turbo/simd/jsimd_mips.c
|