Cross Reference: /external/llvm/lib/Target/X86/

//===---------------------------------------------------------------------===//
// Random ideas for the X86 backend: FP stack related stuff
//===---------------------------------------------------------------------===//

//===---------------------------------------------------------------------===//

Some targets (e.g. athlons) prefer freep to fstp ST(0):
http://gcc.gnu.org/ml/gcc-patches/2004-04/msg00659.html

//===---------------------------------------------------------------------===//

This should use fiadd on chips where it is profitable:
double foo(double P, int *I) { return P+*I; }

We have fiadd patterns now but the followings have the same cost and
complexity. We need a way to specify the later is more profitable.

def FpADD32m  : FpI<(ops RFP:$dst, RFP:$src1, f32mem:$src2), OneArgFPRW,
                    [(set RFP:$dst, (fadd RFP:$src1,
                                     (extloadf64f32 addr:$src2)))]>;
                // ST(0) = ST(0) + [mem32]

def FpIADD32m : FpI<(ops RFP:$dst, RFP:$src1, i32mem:$src2), OneArgFPRW,
                    [(set RFP:$dst, (fadd RFP:$src1,
                                     (X86fild addr:$src2, i32)))]>;
                // ST(0) = ST(0) + [mem32int]

//===---------------------------------------------------------------------===//

The FP stackifier should handle simple permutates to reduce number of shuffle
instructions, e.g. turning:

fld P	->		fld Q
fld Q			fld P
fxch

or:

fxch	->		fucomi
fucomi			jl X
jg X

Ideas:
http://gcc.gnu.org/ml/gcc-patches/2004-11/msg02410.html


//===---------------------------------------------------------------------===//

Add a target specific hook to DAG combiner to handle SINT_TO_FP and
FP_TO_SINT when the source operand is already in memory.

//===---------------------------------------------------------------------===//

Open code rint,floor,ceil,trunc:
http://gcc.gnu.org/ml/gcc-patches/2004-08/msg02006.html
http://gcc.gnu.org/ml/gcc-patches/2004-08/msg02011.html

Opencode the sincos[f] libcall.

//===---------------------------------------------------------------------===//

None of the FPStack instructions are handled in
X86RegisterInfo::foldMemoryOperand, which prevents the spiller from
folding spill code into the instructions.

//===---------------------------------------------------------------------===//

Currently the x86 codegen isn't very good at mixing SSE and FPStack
code:

unsigned int foo(double x) { return x; }

foo:
	subl $20, %esp
	movsd 24(%esp), %xmm0
	movsd %xmm0, 8(%esp)
	fldl 8(%esp)
	fisttpll (%esp)
	movl (%esp), %eax
	addl $20, %esp
	ret

This just requires being smarter when custom expanding fptoui.

//===---------------------------------------------------------------------===//
Name	Date	Size
..	06-Oct-2015	4 KiB
Android.mk	06-Oct-2015	1.7 KiB
AsmParser/	06-Oct-2015	4 KiB
CMakeLists.txt	06-Oct-2015	1.7 KiB
Disassembler/	06-Oct-2015	4 KiB
InstPrinter/	06-Oct-2015	4 KiB
LLVMBuild.txt	06-Oct-2015	1 KiB
Makefile	06-Oct-2015	840
MCTargetDesc/	06-Oct-2015	4 KiB
README-FPStack.txt	06-Oct-2015	2.7 KiB
README-MMX.txt	06-Oct-2015	1.5 KiB
README-SSE.txt	06-Oct-2015	24.5 KiB
README-UNIMPLEMENTED.txt	06-Oct-2015	679
README-X86-64.txt	06-Oct-2015	6 KiB
README.txt	06-Oct-2015	47.6 KiB
TargetInfo/	06-Oct-2015	4 KiB
Utils/	06-Oct-2015	4 KiB
X86.h	06-Oct-2015	2.8 KiB
X86.td	06-Oct-2015	28.4 KiB
X86AsmPrinter.cpp	06-Oct-2015	25.9 KiB
X86AsmPrinter.h	06-Oct-2015	4.5 KiB
X86CallFrameOptimization.cpp	06-Oct-2015	16.5 KiB
X86CallingConv.h	06-Oct-2015	1.6 KiB
X86CallingConv.td	06-Oct-2015	28.7 KiB
X86CompilationCallback_Win64.asm	06-Oct-2015	1.6 KiB
X86FastISel.cpp	06-Oct-2015	121.3 KiB
X86FixupLEAs.cpp	06-Oct-2015	11.7 KiB
X86FloatingPoint.cpp	06-Oct-2015	60.9 KiB
X86FrameLowering.cpp	06-Oct-2015	75.7 KiB
X86FrameLowering.h	06-Oct-2015	4 KiB
X86Instr3DNow.td	06-Oct-2015	4.4 KiB
X86InstrArithmetic.td	06-Oct-2015	64.2 KiB
X86InstrAVX512.td	06-Oct-2015	272.9 KiB
X86InstrBuilder.h	06-Oct-2015	6.6 KiB
X86InstrCMovSetCC.td	06-Oct-2015	5.3 KiB
X86InstrCompiler.td	06-Oct-2015	78.9 KiB
X86InstrControl.td	06-Oct-2015	14.9 KiB
X86InstrExtension.td	06-Oct-2015	9.3 KiB
X86InstrFMA.td	06-Oct-2015	19.2 KiB
X86InstrFormats.td	06-Oct-2015	40.4 KiB
X86InstrFPStack.td	06-Oct-2015	34.1 KiB
X86InstrFragmentsSIMD.td	06-Oct-2015	31.6 KiB
X86InstrInfo.cpp	06-Oct-2015	266.6 KiB
X86InstrInfo.h	06-Oct-2015	21.4 KiB
X86InstrInfo.td	06-Oct-2015	135.3 KiB
X86InstrMMX.td	06-Oct-2015	30.5 KiB
X86InstrSGX.td	06-Oct-2015	907
X86InstrShiftRotate.td	06-Oct-2015	46 KiB
X86InstrSSE.td	06-Oct-2015	420.1 KiB
X86InstrSVM.td	06-Oct-2015	2.1 KiB
X86InstrSystem.td	06-Oct-2015	27.5 KiB
X86InstrTSX.td	06-Oct-2015	1.9 KiB
X86InstrVMX.td	06-Oct-2015	3.2 KiB
X86InstrXOP.td	06-Oct-2015	15.6 KiB
X86IntrinsicsInfo.h	06-Oct-2015	38.8 KiB
X86ISelDAGToDAG.cpp	06-Oct-2015	104.2 KiB
X86ISelLowering.cpp	06-Oct-2015	983.9 KiB
X86ISelLowering.h	06-Oct-2015	41.4 KiB
X86MachineFunctionInfo.cpp	06-Oct-2015	1 KiB
X86MachineFunctionInfo.h	06-Oct-2015	7.3 KiB
X86MCInstLower.cpp	06-Oct-2015	47.8 KiB
X86PadShortFunction.cpp	06-Oct-2015	6.6 KiB
X86RegisterInfo.cpp	06-Oct-2015	27.4 KiB
X86RegisterInfo.h	06-Oct-2015	5 KiB
X86RegisterInfo.td	06-Oct-2015	19.9 KiB
X86SchedHaswell.td	06-Oct-2015	55.4 KiB
X86SchedSandyBridge.td	06-Oct-2015	8.1 KiB
X86Schedule.td	06-Oct-2015	22.2 KiB
X86ScheduleAtom.td	06-Oct-2015	28.7 KiB
X86ScheduleBtVer2.td	06-Oct-2015	11.3 KiB
X86ScheduleSLM.td	06-Oct-2015	7.5 KiB
X86SelectionDAGInfo.cpp	06-Oct-2015	11 KiB
X86SelectionDAGInfo.h	06-Oct-2015	1.9 KiB
X86Subtarget.cpp	06-Oct-2015	10.8 KiB
X86Subtarget.h	06-Oct-2015	16.2 KiB
X86TargetMachine.cpp	06-Oct-2015	8 KiB
X86TargetMachine.h	06-Oct-2015	1.5 KiB
X86TargetObjectFile.cpp	06-Oct-2015	6.8 KiB
X86TargetObjectFile.h	06-Oct-2015	2.8 KiB
X86TargetTransformInfo.cpp	06-Oct-2015	42 KiB
X86TargetTransformInfo.h	06-Oct-2015	4 KiB
X86VZeroUpper.cpp	06-Oct-2015	11.5 KiB