Cross Reference: /art/compiler/optimizing/code_generator

History log of /art/compiler/optimizing/code_generator_arm64.h
Revision	Date	Author	Comments
3d21bdf8894e780d349c481e5c9e29fe1556051c	22-Apr-2015	Mathieu Chartier <mathieuc@google.com>	Move mirror::ArtMethod to native Optimizing + quick tests are passing, devices boot. TODO: Test and fix bugs in mips64. Saves 16 bytes per most ArtMethod, 7.5MB reduction in system PSS. Some of the savings are from removal of virtual methods and direct methods object arrays. Bug: 19264997 (cherry picked from commit e401d146407d61eeb99f8d6176b2ac13c4df1e33) Change-Id: I622469a0cfa0e7082a2119f3d6a9491eb61e3f3d Fix some ArtMethod related bugs Added root visiting for runtime methods, not currently required since the GcRoots in these methods are null. Added missing GetInterfaceMethodIfProxy in GetMethodLine, fixes --trace run-tests 005, 044. Fixed optimizing compiler bug where we used a normal stack location instead of double on ARM64, this fixes the debuggable tests. TODO: Fix JDWP tests. Bug: 19264997 Change-Id: I7c55f69c61d1b45351fd0dc7185ffe5efad82bd3 ART: Fix casts for 64-bit pointers on 32-bit compiler. Bug: 19264997 Change-Id: Ief45cdd4bae5a43fc8bfdfa7cf744e2c57529457 Fix JDWP tests after ArtMethod change Fixes Throwable::GetStackDepth for exception event detection after internal stack trace representation change. Adds missing ArtMethod::GetInterfaceMethodIfProxy call in case of proxy method. Bug: 19264997 Change-Id: I363e293796848c3ec491c963813f62d868da44d2 Fix accidental IMT and root marking regression Was always using the conflict trampoline. Also included fix for regression in GC time caused by extra roots. Most of the regression was IMT. Fixed bug in DumpGcPerformanceInfo where we would get SIGABRT due to detached thread. EvaluateAndApplyChanges: From ~2500 -> ~1980 GC time: 8.2s -> 7.2s due to 1s less of MarkConcurrentRoots Bug: 19264997 Change-Id: I4333e80a8268c2ed1284f87f25b9f113d4f2c7e0 Fix bogus image test assert Previously we were comparing the size of the non moving space to size of the image file. Now we properly compare the size of the image space against the size of the image file. Bug: 19264997 Change-Id: I7359f1f73ae3df60c5147245935a24431c04808a [MIPS64] Fix art_quick_invoke_stub argument offsets. ArtMethod reference's size got bigger, so we need to move other args and leave enough space for ArtMethod* and 'this' pointer. This fixes mips64 boot. Bug: 19264997 Change-Id: I47198d5f39a4caab30b3b77479d5eedaad5006ab
2d27c8e338af7262dbd4aaa66127bb8fa1758b86	28-Apr-2015	Roland Levillain <rpl@google.com>	Refactor InvokeDexCallingConventionVisitor in Optimizing. Change-Id: I7ede0f59d5109644887bf5d39201d4e1bf043f34
09a99965bb27649f5b1d373f76bfbec6a2500c9e	15-Apr-2015	Alexandre Rames <alexandre.rames@arm.com>	Opt compiler: ARM64: Follow other archs for a few codegen stubs. Code generation for HInstanceFieldGet, HInstanceFieldSet, HStaticFieldGet, and HStaticFieldSet are refactored to follow the structure used for other backends. Change-Id: I34a3bd17effa042238c6bf199848cbc2ec26ac5d
ad4450e5c3ffaa9566216cc6fafbf5c11186c467	17-Apr-2015	Zheng Xu <zheng.xu@arm.com>	Opt compiler: Implement parallel move resolver without using swap. The algorithm of ParallelMoveResolverNoSwap() is almost the same with ParallelMoveResolverWithSwap(), except the way we resolve the circular dependency. NoSwap() uses additional scratch register to resolve the circular dependency. For example, (0->1) (1->2) (2->0) will be performed as (2->scratch) (1->2) (0->1) (scratch->0). On architectures without swap register support, NoSwap() can reduce the number of moves from 3x(N-1) to (N+1) when there is circular dependency with N moves. And also, NoSwap() algorithm does not depend on architecture register layout information, which means it can support register pairs on arm32 and X/W, D/S registers on arm64 without additional modification. Change-Id: Idf56bd5469bb78c0e339e43ab16387428a082318
69a503050fb8a7b3a79b2cd2cdc2d8fbc594575d	14-Apr-2015	Zheng Xu <zheng.xu@arm.com>	ARM64: Remove suspend register. It also clean up build/remove frame used by JNI compiler and generates stp/ldp instead of str/ldr. Also x19 has been unblocked in both quick and optimizing compiler. Change-Id: Idbeac0942265f493266b2ef9b7a65bb4054f0e2d
c6b4dd8980350aaf250f0185f73e9c42ec17cd57	07-Apr-2015	David Srbecky <dsrbecky@google.com>	Implement CFI for Optimizing. CFI is necessary for stack unwinding in gdb, lldb, and libunwind. Change-Id: I1a3480e3a4a99f48bf7e6e63c4e83a80cfee40a2
d43b3ac88cd46b8815890188c9c2b9a3f1564648	01-Apr-2015	Mingyao Yang <mingyao@google.com>	Revert "Revert "Deoptimization-based bce."" This reverts commit 0ba627337274ccfb8c9cb9bf23fffb1e1b9d1430. Change-Id: I1ca10d15bbb49897a0cf541ab160431ec180a006
82e52ce8364e3e1c644d0d3b3b4f61364bf7089a	26-Mar-2015	Serban Constantinescu <serban.constantinescu@arm.com>	ARM64: Update to VIXL 1.9. Update VIXL's interface to VIXL 1.9. Change-Id: Iebae947539cbad65488b7195aaf01de284b71cbb Signed-off-by: Serban Constantinescu <serban.constantinescu@arm.com>
d75948ac93a4a317feaf136cae78823071234ba5	27-Mar-2015	Nicolas Geoffray <ngeoffray@google.com>	Intrinsify String.compareTo. Change-Id: Ia540df98755ac493fe61bd63f0bd94f6d97fbb57
0ba627337274ccfb8c9cb9bf23fffb1e1b9d1430	24-Mar-2015	Andreas Gampe <agampe@google.com>	Revert "Deoptimization-based bce." This breaks compiling the core image: Error after BCE: art::SSAChecker: Instruction 219 in block 1 does not dominate use 221 in block 1. This reverts commit e295e6ec5beaea31be5d7d3c996cd8cfa2053129. Change-Id: Ieeb48797d451836ed506ccb940872f1443942e4e
e295e6ec5beaea31be5d7d3c996cd8cfa2053129	07-Mar-2015	Mingyao Yang <mingyao@google.com>	Deoptimization-based bce. A mechanism is introduced that a runtime method can be called from code compiled with optimizing compiler to deoptimize into interpreter. This can be used to establish invariants in the managed code If the invariant does not hold at runtime, we will deoptimize and continue execution in the interpreter. This allows to optimize the managed code as if the invariant was proven during compile time. However, the exception will be thrown according to the semantics demanded by the spec. The invariant and optimization included in this patch are based on the length of an array. Given a set of array accesses with constant indices {c1, ..., cn}, we can optimize away all bounds checks iff all 0 <= min(ci) and max(ci) < array-length. The first can be proven statically. The second can be established with a deoptimization-based invariant. This replaces n bounds checks with one invariant check (plus slow-path code). Change-Id: I8c6e34b56c85d25b91074832d13dba1db0a81569
eeefa1276e83776f08704a3db4237423b0627e20	13-Mar-2015	Nicolas Geoffray <ngeoffray@google.com>	Update locations of registers after slow paths spilling. Change-Id: Id9aafcc13c1a085c17ce65d704c67b73f9de695d
579885a26d761f5ba9550f2a1cd7f0f598c2e1e3	22-Feb-2015	Serban Constantinescu <serban.constantinescu@arm.com>	Opt Compiler: ARM64: Enable explicit memory barriers over acquire/release Implement remaining explicit memory barrier code paths and temporarily enable the use of explicit memory barriers for testing. This CL also enables the use of instruction set features in the ARM64 backend. kUseAcquireRelease has been replaced with PreferAcquireRelease(), which for now is statically set to false (prefer explicit memory barriers). Please note that we still prefer acquire-release for the ARM64 Optimizing Compiler, but we would like to exercise the explicit memory barrier code path too. Change-Id: I84e047ecd43b6fbefc5b82cf532e3f5c59076458 Signed-off-by: Serban Constantinescu <serban.constantinescu@arm.com>
dc23d8318db08cb42e20f1d16dbc416798951a8b	16-Feb-2015	Nicolas Geoffray <ngeoffray@google.com>	Avoid generating jmp +0. When a block branches to a non-following block, but blocks in-between do branch to it, we can avoid doing the branch. Change-Id: I9b343f662a4efc718cd4b58168f93162a24e1219
3d087decd1886b818adcccd4f16802e5e54dd03e	28-Jan-2015	Serban Constantinescu <serban.constantinescu@arm.com>	Opt Compiler: ARM64: Enable Callee-saved register, as defined by AAPCS64. For now we block kQuickSuspendRegister - x19, since Quick and the runtime use this as a suspend counter register. Change-Id: I090d386670e81e7924e4aa9a3864ef30d0580a30 Signed-off-by: Serban Constantinescu <serban.constantinescu@arm.com>
1cf95287364948689f6a1a320567acd7728e94a3	12-Dec-2014	Nicolas Geoffray <ngeoffray@google.com>	Small optimization for recursive calls: avoid dex cache. Change-Id: I044757a2f06e535cdc1480c4fc8182b89635baf6
878d58cbaf6b17a9e3dcab790754527f3ebc69e5	16-Jan-2015	Andreas Gampe <agampe@google.com>	ART: Arm64 optimizing compiler intrinsics Implement most intrinsics for the optimizing compiler for Arm64. Change-Id: Idb459be09f0524cb9aeab7a5c7fccb1c6b65a707
d97dc40d186aec46bfd318b6a2026a98241d7e9c	22-Jan-2015	Nicolas Geoffray <ngeoffray@google.com>	Support callee save floating point registers on x64. - Share the computation of core_spill_mask and fpu_spill_mask between backends. - Remove explicit stack overflow check support: we need to adjust them and since they are not tested, they will easily bitrot. Change-Id: I0b619b8de4e1bdb169ea1ae7c6ede8df0d65837a
988939683c26c0b1c8808fc206add6337319509a	21-Jan-2015	Nicolas Geoffray <ngeoffray@google.com>	Enable core callee-save on x64. Will work on other architectures and FP support in other CLs. Change-Id: I8cef0343eedc7202d206f5217fdf0349035f0e4d
77520bca97ec44e3758510cebd0f20e3bb4584ea	12-Jan-2015	Calin Juravle <calin@google.com>	Record implicit null checks at the actual invoke time. ImplicitNullChecks are recorded only for instructions directly (see NB below) preceeded by NullChecks in the graph. This way we avoid recording redundant safepoints and minimize the code size increase. NB: ParallalelMoves might be inserted by the register allocator between the NullChecks and their uses. These modify the environment and the correct action would be to reverse their modification. This will be addressed in a follow-up CL. Change-Id: Ie50006e5a4bd22932dcf11348f5a655d253cd898
cd6dffedf1bd8e6dfb3fb0c933551f9a90f7de3f	08-Jan-2015	Calin Juravle <calin@google.com>	Add implicit null checks for the optimizing compiler - for backends: arm, arm64, x86, x86_64 - fixed parameter passing for CodeGenerator - 003-omnibus-opcodes test verifies that NullPointerExceptions work as expected Change-Id: I1b302acd353342504716c9169a80706cf3aba2c8
f85a9ca9859ad843dc03d3a2b600afbaf2e9bbdd	13-Jan-2015	Mark Mendell <mark.p.mendell@intel.com>	[optimizing compiler] Compute live spill size The current stack frame calculation assumes that each live register to be saved/restored has the word size of the machine. This fails for X86, where a double in an XMM register takes up 8 bytes. Change the calculation to keep track of the number of core registers and number of fp registers to handle this distinction. This is slightly pessimal, as the registers may not be active at the same time, but the only way to handle this would be to allocate both classes of registers simultaneously, or remember all the active intervals, matching them up and compute the size of each safepoint interval. Change-Id: If7860aa319b625c214775347728cdf49a56946eb Signed-off-by: Mark Mendell <mark.p.mendell@intel.com>
840e5461a85f8908f51e7f6cd562a9129ff0e7ce	07-Jan-2015	Nicolas Geoffray <ngeoffray@google.com>	Implement double and float support for arm in register allocator. The basic approach is: - An instruction that needs two registers gets two intervals. - When allocating the low part, we also allocate the high part. - When splitting a low (or high) interval, we also split the high (or low) equivalent. - Allocation follows the (S/D register) requirement that low registers are always even and the high equivalent is low + 1. Change-Id: I06a5148e05a2ffc7e7555d08e871ed007b4c2797
02d81cc8d162a31f0664249535456775e397b608	05-Jan-2015	Serban Constantinescu <serban.constantinescu@arm.com>	Opt Compiler: ARM64: Add support for rem-float, rem-double and volatile. Add support for rem-float, rem-double and volatile memory accesses using acquire-release and memory barriers. Change-Id: I96a24dff66002c3b772c3d8e6ed792e3cb59048a Signed-off-by: Serban Constantinescu <serban.constantinescu@arm.com>
5b4b898ed8725242ee6b7229b94467c3ea3054c8	18-Dec-2014	Nicolas Geoffray <ngeoffray@google.com>	Revert "Don't block quick callee saved registers for optimizing." X64 has one libcore test failing, and codegen_test on arm is failing. This reverts commit 6004796d6c630696127df2494dcd4f30d1367a34. Change-Id: I20e00431fa18e11ce4c0cb6fffa91977fa8e9b4f
6004796d6c630696127df2494dcd4f30d1367a34	15-Dec-2014	Nicolas Geoffray <ngeoffray@google.com>	Don't block quick callee saved registers for optimizing. This change builds on: https://android-review.googlesource.com/#/c/118983/ - Also fix x86_64 assembler bug triggered by this change. - Fix (and improve) x86's backend byte register usage. - Fix a bug in baseline register allocator: a fixed out register must prevent inputs from allocating it. Change-Id: I4883862e29b4e4b6470f1823cf7eab7e7863d8ad
3e69f16ae3fddfd24f4f0e29deb106d564ab296c	10-Dec-2014	Alexandre Rames <alexandre.rames@arm.com>	Opt compiler: Add arm64 support for register allocation. Change-Id: Idc6e84eee66170de4a9c0a5844c3da038c083aa7
02164b352a1474c616771582ca9a73a2cc514c1f	13-Nov-2014	Serban Constantinescu <serban.constantinescu@arm.com>	Opt Compiler: Arm64: Add support for more IRs plus various fixes. Add support for more IRs and update others. Change-Id: Iae1bef01dc3c0d238a46fbd2800e71c38288b1d2 Signed-off-by: Serban Constantinescu <serban.constantinescu@arm.com>
32f5b4d2c8c9b52e9522941c159577b21752d0fa	25-Nov-2014	Serban Constantinescu <serban.constantinescu@arm.com>	Vixl: Update the VIXL interface to VIXL 1.7 and enable VIXL debug. This patch updates the interface to VIXL 1.7 and enables the debug version of VIXL when ART is built in debug mode. Change-Id: I443fb941bec3cffefba7038f93bb972e6b7d8db5 Signed-off-by: Serban Constantinescu <serban.constantinescu@arm.com>
86a8d7afc7f00ff0f5ea7b8aaf4d50514250a4e6	19-Nov-2014	Nicolas Geoffray <ngeoffray@google.com>	Consistently use k{InstructionSet}WordSize. These constants were defined prior to k{InstructionSet}PointerSize. So use them consistently in optimizing as a first step. We can discuss whether we should remove them in a second step. Change-Id: If129de1a3bb8b65f8d9c816a8ad466815fb202e6
67555f7e9a05a9d436e034f67ae683bbf02d072d	18-Nov-2014	Alexandre Rames <alexandre.rames@arm.com>	Opt compiler: Add support for more IRs on arm64. Change-Id: I4b6425135d1af74912a206411288081d2516f8bf
f0e3937b87453234d0d7970b8712082062709b8d	12-Nov-2014	Nicolas Geoffray <ngeoffray@google.com>	Do a parallel move in BoundsCheckSlowPath. The two locations of the index and length could overlap, so we need a parallel move. Also factorize the code for doing a parallel move based on two locations. Change-Id: Iee8b3459e2eed6704d45e9a564fb2cd050741ea4
fc19de8b201475231751b9df08fce01a093e5c2b	07-Nov-2014	Alexandre Rames <alexandre.rames@arm.com>	Opt compiler: Add arm64 support for a few more IRs. Change-Id: I781ddcbc61eb2b04ae80b1c7697e1ed5694bd5b9
a89086e3be94fb262c4c4feb15241b30616c3b8f	07-Nov-2014	Alexandre Rames <alexandre.rames@arm.com>	Opt compiler: Add arm64 support for floating-point. Change-Id: I0d97ab0f5ab770fee62c819505743febbce8835e
de58ab2c03ff8112b07ab827c8fa38f670dfc656	05-Nov-2014	Nicolas Geoffray <ngeoffray@google.com>	Implement try/catch/throw in optimizing. - We currently don't run optimizations in the presence of a try/catch. - We therefore implement Quick's mapping table. - Also fix a missing null check on array-length. Change-Id: I6917dfcb868e75c1cf6eff32b7cbb60b6cfbd68f
6a3c1fcb4ba42ad4d5d142c17a3712a6ddd3866f	31-Oct-2014	Ian Rogers <irogers@google.com>	Remove -Wno-unused-parameter and -Wno-sign-promo from base cflags. Fix associated errors about unused paramenters and implict sign conversions. For sign conversion this was largely in the area of enums, so add ostream operators for the effected enums and fix tools/generate-operator-out.py. Tidy arena allocation code and arena allocated data types, rather than fixing new and delete operators. Remove dead code. Change-Id: I5b433e722d2f75baacfacae4d32aef4a828bfe1b
5319defdf502fc4569316473846b83180ec08035	23-Oct-2014	Alexandre Rames <alexandre.rames@arm.com>	ART: optimizing compiler: initial support for ARM64. The ARM64 port uses VIXL for code generation, to which it defers work like label binding and branch resolving, register type coherency checking, and immediate values handling. Change-Id: I0a44508c0c991f472a63e67b3469cdd878fe1a68 Signed-off-by: Serban Constantinescu <serban.constantinescu@arm.com> Signed-off-by: Alexandre Rames <alexandre.rames@arm.com>