Cross Reference: /external/llvm/lib/Target/X86/

//===---------------------------------------------------------------------===//
// Random ideas for the X86 backend: FP stack related stuff
//===---------------------------------------------------------------------===//

//===---------------------------------------------------------------------===//

Some targets (e.g. athlons) prefer freep to fstp ST(0):
http://gcc.gnu.org/ml/gcc-patches/2004-04/msg00659.html

//===---------------------------------------------------------------------===//

This should use fiadd on chips where it is profitable:
double foo(double P, int *I) { return P+*I; }

We have fiadd patterns now but the followings have the same cost and
complexity. We need a way to specify the later is more profitable.

def FpADD32m  : FpI<(ops RFP:$dst, RFP:$src1, f32mem:$src2), OneArgFPRW,
                    [(set RFP:$dst, (fadd RFP:$src1,
                                     (extloadf64f32 addr:$src2)))]>;
                // ST(0) = ST(0) + [mem32]

def FpIADD32m : FpI<(ops RFP:$dst, RFP:$src1, i32mem:$src2), OneArgFPRW,
                    [(set RFP:$dst, (fadd RFP:$src1,
                                     (X86fild addr:$src2, i32)))]>;
                // ST(0) = ST(0) + [mem32int]

//===---------------------------------------------------------------------===//

The FP stackifier should handle simple permutates to reduce number of shuffle
instructions, e.g. turning:

fld P	->		fld Q
fld Q			fld P
fxch

or:

fxch	->		fucomi
fucomi			jl X
jg X

Ideas:
http://gcc.gnu.org/ml/gcc-patches/2004-11/msg02410.html


//===---------------------------------------------------------------------===//

Add a target specific hook to DAG combiner to handle SINT_TO_FP and
FP_TO_SINT when the source operand is already in memory.

//===---------------------------------------------------------------------===//

Open code rint,floor,ceil,trunc:
http://gcc.gnu.org/ml/gcc-patches/2004-08/msg02006.html
http://gcc.gnu.org/ml/gcc-patches/2004-08/msg02011.html

Opencode the sincos[f] libcall.

//===---------------------------------------------------------------------===//

None of the FPStack instructions are handled in
X86RegisterInfo::foldMemoryOperand, which prevents the spiller from
folding spill code into the instructions.

//===---------------------------------------------------------------------===//

Currently the x86 codegen isn't very good at mixing SSE and FPStack
code:

unsigned int foo(double x) { return x; }

foo:
	subl $20, %esp
	movsd 24(%esp), %xmm0
	movsd %xmm0, 8(%esp)
	fldl 8(%esp)
	fisttpll (%esp)
	movl (%esp), %eax
	addl $20, %esp
	ret

This just requires being smarter when custom expanding fptoui.

//===---------------------------------------------------------------------===//
Name	Date	Size
..	04-Jun-2014	4 KiB
Android.mk	04-Jun-2014	1.6 KiB
AsmParser/	04-Jun-2014	4 KiB
CMakeLists.txt	04-Jun-2014	2 KiB
Disassembler/	04-Jun-2014	4 KiB
InstPrinter/	04-Jun-2014	4 KiB
LLVMBuild.txt	04-Jun-2014	1 KiB
Makefile	04-Jun-2014	840
MCTargetDesc/	04-Jun-2014	4 KiB
README-FPStack.txt	04-Jun-2014	2.7 KiB
README-MMX.txt	04-Jun-2014	1.5 KiB
README-SSE.txt	04-Jun-2014	26.5 KiB
README-UNIMPLEMENTED.txt	04-Jun-2014	679
README-X86-64.txt	04-Jun-2014	6 KiB
README.txt	04-Jun-2014	52.6 KiB
TargetInfo/	04-Jun-2014	4 KiB
Utils/	04-Jun-2014	4 KiB
X86.h	04-Jun-2014	3 KiB
X86.td	04-Jun-2014	18.2 KiB
X86AsmPrinter.cpp	04-Jun-2014	25.3 KiB
X86AsmPrinter.h	04-Jun-2014	2.7 KiB
X86CallingConv.td	04-Jun-2014	22.4 KiB
X86CodeEmitter.cpp	04-Jun-2014	51.6 KiB
X86COFFMachineModuleInfo.cpp	04-Jun-2014	614
X86COFFMachineModuleInfo.h	04-Jun-2014	1.4 KiB
X86CompilationCallback_Win64.asm	04-Jun-2014	1.6 KiB
X86FastISel.cpp	04-Jun-2014	86.2 KiB
X86FixupLEAs.cpp	04-Jun-2014	8.8 KiB
X86FloatingPoint.cpp	04-Jun-2014	65.8 KiB
X86FrameLowering.cpp	04-Jun-2014	66.3 KiB
X86FrameLowering.h	04-Jun-2014	3.6 KiB
X86Instr3DNow.td	04-Jun-2014	4.4 KiB
X86InstrArithmetic.td	04-Jun-2014	63.8 KiB
X86InstrAVX512.td	04-Jun-2014	34.8 KiB
X86InstrBuilder.h	04-Jun-2014	6.6 KiB
X86InstrCMovSetCC.td	04-Jun-2014	5.2 KiB
X86InstrCompiler.td	04-Jun-2014	81 KiB
X86InstrControl.td	04-Jun-2014	13.2 KiB
X86InstrExtension.td	04-Jun-2014	8.6 KiB
X86InstrFMA.td	04-Jun-2014	18.1 KiB
X86InstrFormats.td	04-Jun-2014	35.1 KiB
X86InstrFPStack.td	04-Jun-2014	34.3 KiB
X86InstrFragmentsSIMD.td	04-Jun-2014	21 KiB
X86InstrInfo.cpp	04-Jun-2014	211.5 KiB
X86InstrInfo.h	04-Jun-2014	19.8 KiB
X86InstrInfo.td	04-Jun-2014	104.4 KiB
X86InstrMMX.td	04-Jun-2014	28.9 KiB
X86InstrShiftRotate.td	04-Jun-2014	45.7 KiB
X86InstrSSE.td	04-Jun-2014	399.4 KiB
X86InstrSVM.td	04-Jun-2014	2.1 KiB
X86InstrSystem.td	04-Jun-2014	24.9 KiB
X86InstrTSX.td	04-Jun-2014	1.7 KiB
X86InstrVMX.td	04-Jun-2014	3.2 KiB
X86InstrXOP.td	04-Jun-2014	14.9 KiB
X86ISelDAGToDAG.cpp	04-Jun-2014	102.9 KiB
X86ISelLowering.cpp	04-Jun-2014	724.5 KiB
X86ISelLowering.h	04-Jun-2014	40.1 KiB
X86JITInfo.cpp	04-Jun-2014	19.2 KiB
X86JITInfo.h	04-Jun-2014	3 KiB
X86MachineFunctionInfo.cpp	04-Jun-2014	444
X86MachineFunctionInfo.h	04-Jun-2014	5.6 KiB
X86MCInstLower.cpp	04-Jun-2014	30 KiB
X86PadShortFunction.cpp	04-Jun-2014	6.6 KiB
X86RegisterInfo.cpp	04-Jun-2014	25 KiB
X86RegisterInfo.h	04-Jun-2014	5 KiB
X86RegisterInfo.td	04-Jun-2014	19 KiB
X86Relocations.h	04-Jun-2014	2 KiB
X86SchedHaswell.td	04-Jun-2014	5.1 KiB
X86SchedSandyBridge.td	04-Jun-2014	4.7 KiB
X86Schedule.td	04-Jun-2014	19.2 KiB
X86ScheduleAtom.td	04-Jun-2014	27.9 KiB
X86SelectionDAGInfo.cpp	04-Jun-2014	10.2 KiB
X86SelectionDAGInfo.h	04-Jun-2014	1.9 KiB
X86Subtarget.cpp	04-Jun-2014	16.7 KiB
X86Subtarget.h	04-Jun-2014	13.9 KiB
X86TargetMachine.cpp	04-Jun-2014	8.1 KiB
X86TargetMachine.h	04-Jun-2014	4.4 KiB
X86TargetObjectFile.cpp	04-Jun-2014	1.9 KiB
X86TargetObjectFile.h	04-Jun-2014	1.6 KiB
X86TargetTransformInfo.cpp	04-Jun-2014	22.6 KiB
X86VZeroUpper.cpp	04-Jun-2014	9.7 KiB