external_llvm.git - Unnamed repository; edit this file 'description' to name the repository.

	Commit message (Collapse)	Author	Age	Files	Lines
*	[SystemZ] Handle extensions in RxSBG optimizations	Richard Sandiford	2013-10-16	1	-3/+2
\| \| \| \| \| \| \| \|	The input to an RxSBG operation can be narrower as long as the upper bits are don't care. This fixes a FIXME added in r192783. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@192790 91177308-0d34-0410-b5e6-96231b3b80d8
*	[SystemZ] Improve handling of SETCC	Richard Sandiford	2013-10-16	3	-16/+260
\| \| \| \| \| \| \| \| \|	We previously used the default expansion to SELECT_CC, which in turn would expand to "LHI; BRC; LHI". In most cases it's better to use an IPM-based sequence instead. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@192784 91177308-0d34-0410-b5e6-96231b3b80d8
*	Handle (shl (anyext (shr ...))) in SimpilfyDemandedBits	Richard Sandiford	2013-10-16	1	-0/+67
\| \| \| \| \| \| \| \| \| \| \|	This is really an extension of the current (shl (shr ...)) -> shl optimization. The main difference is that certain upper bits must also not be demanded. The motivating examples are the first two in the testcase, which occur in llvmpipe output. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@192783 91177308-0d34-0410-b5e6-96231b3b80d8
*	Revert r192758 (and r192759), "MC: Better handling of tricky symbol and ↵	NAKAMURA Takumi	2013-10-16	3	-4/+4
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	section names" GNU AS didn't like quotes in symbol names. Error: junk at end of line, first unrecognized character is `"' .def "@feat.00"; "@feat.00" = 1 Reproduced on Cygwin's 2.23.52.20130309 and mingw32's 2.20.1.20100303. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@192775 91177308-0d34-0410-b5e6-96231b3b80d8
*	Add a triple to this test.	Rafael Espindola	2013-10-16	1	-1/+1
\| \| \| \|	git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@192767 91177308-0d34-0410-b5e6-96231b3b80d8
*	Add support for metadata representing .ident directives.	Rafael Espindola	2013-10-16	1	-0/+9
\| \| \| \|	git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@192764 91177308-0d34-0410-b5e6-96231b3b80d8
*	MC: Better handling of tricky symbol and section names	Hans Wennborg	2013-10-16	3	-4/+4
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Because of win32 mangling, we produce symbol and section names with funny characters in them, most notably @ characters. MC would choke on trying to parse its own assembly output. This patch addresses that by: - Making @ trigger quoting of symbol names - Also quote section names in the same way - Just parse section names like other identifiers (to allow for quotes) - Don't assume @ signifies a symbol variant if it is in a string. Differential Revision: http://llvm-reviews.chandlerc.com/D1945 git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@192758 91177308-0d34-0410-b5e6-96231b3b80d8
*	Enable MI Sched for x86.	Andrew Trick	2013-10-15	65	-278/+336
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This changes the SelectionDAG scheduling preference to source order. Soon, the SelectionDAG scheduler can be bypassed saving a nice chunk of compile time. Performance differences that result from this change are often a consequence of register coalescing. The register coalescer is far from perfect. Bugs can be filed for deficiencies. On x86 SandyBridge/Haswell, the source order schedule is often preserved, particularly for small blocks. Register pressure is generally improved over the SD scheduler's ILP mode. However, we are still able to handle large blocks that require latency hiding, unlike the SD scheduler's BURR mode. MI scheduler also attempts to discover the critical path in single-block loops and adjust heuristics accordingly. The MI scheduler relies on the new machine model. This is currently unimplemented for AVX, so we may not be generating the best code yet. Unit tests are updated so they don't depend on SD scheduling heuristics. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@192750 91177308-0d34-0410-b5e6-96231b3b80d8
*	[AArch64] Add support for NEON scalar signed saturating absolute value and	Chad Rosier	2013-10-15	2	-0/+98
\| \| \| \| \| \|	scalar signed saturating negate instructions. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@192733 91177308-0d34-0410-b5e6-96231b3b80d8
*	Struct byval: fix a copy-paste error for thumb2.	Manman Ren	2013-10-15	1	-4/+43
\| \| \| \| \| \| \|	PR17309 git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@192730 91177308-0d34-0410-b5e6-96231b3b80d8
*	Fix PR17546	Michael Liao	2013-10-15	1	-0/+10
\| \| \| \| \| \| \| \| \| \| \| \|	- Type of index used in extract_vector_elt or insert_vector_elt supposes to be TLI.getVectorIdxTy() which is pointer type on most targets. It'd better to truncate (or zero-extend in case it's changed later) it to mask element type to guarantee they are matching instead of asserting that. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@192722 91177308-0d34-0410-b5e6-96231b3b80d8
*	Fix PR16807	Michael Liao	2013-10-15	1	-0/+18
\| \| \| \| \| \| \| \| \| \| \| \|	- Lower signed division by constant powers-of-2 to target-independent DAG operators instead of target-dependent ones to support them better on targets where vector types are legal but shift operators on that types are illegal. E.g., on AVX, PSRAW is only available on <8 x i16> though <16 x i16> is a legal type. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@192721 91177308-0d34-0410-b5e6-96231b3b80d8
*	[mips][msa] Added support for build_vector for v4f32 and v2f64.	Daniel Sanders	2013-10-15	1	-4/+39
\| \| \| \|	git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@192699 91177308-0d34-0410-b5e6-96231b3b80d8
*	[SystemZ] Use A(G)SI when spilling the target of a constant addition	Richard Sandiford	2013-10-15	2	-0/+332
\| \| \| \|	git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@192681 91177308-0d34-0410-b5e6-96231b3b80d8
*	Fix MSP430 calling convention to match MSPGCC	Job Noorman	2013-10-15	2	-0/+179
\| \| \| \|	git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@192678 91177308-0d34-0410-b5e6-96231b3b80d8
*	llvm/test/CodeGen/X86/break-avx-dep.ll: Relax an expression to be matched to ↵	NAKAMURA Takumi	2013-10-15	1	-1/+1
\| \| \| \| \| \|	also r[89], not only rXX. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@192675 91177308-0d34-0410-b5e6-96231b3b80d8
*	Improve on r192635, ExeDepsFix for avx, and add a test case.	Andrew Trick	2013-10-15	1	-0/+29
\| \| \| \| \| \| \| \| \|	rdar:15221834 False AVX register dependencies cause 5x slowdown on flops-5/6 and significant slowdown on several others. This was blocking the switch to MI-Sched. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@192669 91177308-0d34-0410-b5e6-96231b3b80d8
*	[mips] Transfer kill flag to the newly created operand.	Akira Hatanaka	2013-10-15	2	-6/+24
\| \| \| \|	git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@192662 91177308-0d34-0410-b5e6-96231b3b80d8
*	[X86][FastISel] During X86 fastisel, the address of indirect call was resolved	Quentin Colombet	2013-10-14	1	-0/+132
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	through bitcast, ptrtoint, and inttoptr instructions. This is valid only if the related instructions are in that same basic block, otherwise we may reference variables that were not live accross basic blocks resulting in undefined virtual registers. The bug was exposed when both SDISel and FastISel were used within the same function, i.e., one basic block is issued with FastISel and another with SDISel, as demonstrated with the testcase. <rdar://problem/15192473> git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@192636 91177308-0d34-0410-b5e6-96231b3b80d8
*	Fix a typo, in a comment, in a test.	Nick Lewycky	2013-10-14	1	-1/+1
\| \| \| \|	git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@192632 91177308-0d34-0410-b5e6-96231b3b80d8
*	Revert part of a fix from 2010, changes since then:	Eric Christopher	2013-10-14	1	-1/+5
\| \| \| \| \| \| \| \| \| \| \| \|	a) x86-64 TLS has been documented b) the code path should use movq for the correct relocation to be generated. I've also added a fixme for the test case that we should improve the code generated, it should look something like is documented in the tls abi document. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@192631 91177308-0d34-0410-b5e6-96231b3b80d8
*	MachineSink: Fix and tweak critical-edge breaking heuristic.	Will Dietz	2013-10-14	9	-20/+39
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Per original comment, the intention of this loop is to go ahead and break the critical edge (in order to sink this instruction) if there's reason to believe doing so might "unblock" the sinking of additional instructions that define registers used by this one. The idea is that if we have a few instructions to sink "together" breaking the edge might be worthwhile. This commit makes a few small changes to help better realize this goal: First, modify the loop to ignore registers defined by this instruction. We don't sink definitions of physical registers, and sinking an SSA definition isn't going to unblock an upstream instruction. Second, ignore uses of physical registers. Instructions that define physical registers are rejected for sinking, and so moving this one won't enable moving any defining instructions. As an added bonus, while virtual register use-def chains are generally small due to SSA goodness, iteration over the uses and definitions (used by hasOneNonDBGUse) for physical registers like EFLAGS can be rather expensive in practice. (This is the original reason for looking at this) Finally, to keep things simple continue to only consider this trick for registers that have a single use (via hasOneNonDBGUse), but to avoid spuriously breaking critical edges only do so if the definition resides in the same MBB and therefore this one directly blocks it from being sunk as well. If sinking them together is meant to be, let the iterative nature of this pass sink the definition into this block first. Update tests to accomodate this change, add new testcase where sinking avoids pipeline stalls. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@192608 91177308-0d34-0410-b5e6-96231b3b80d8
*	[AArch64] Add support for NEON scalar integer compare instructions.	Chad Rosier	2013-10-14	1	-0/+128
\| \| \| \|	git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@192596 91177308-0d34-0410-b5e6-96231b3b80d8
*	Add Cortex-A57 support	Bernard Ogden	2013-10-14	1	-0/+13
\| \| \| \|	git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@192591 91177308-0d34-0410-b5e6-96231b3b80d8
*	Add subtarget feature support for Cortex-A53	Bernard Ogden	2013-10-14	1	-4/+8
\| \| \| \| \| \| \|	Some previous implicit defaults have changed, for example FP and NEON are now on by default. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@192590 91177308-0d34-0410-b5e6-96231b3b80d8
*	Fixed a bug in dynamic allocation memory on stack.	Elena Demikhovsky	2013-10-14	3	-3/+62
\| \| \| \| \| \| \| \| \|	The alignment of allocated space was wrong, see Bugzila 17345. Done by Zvi Rackover <zvi.rackover@intel.com>. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@192573 91177308-0d34-0410-b5e6-96231b3b80d8
*	R600: improve dump of S_WAITCNT	Vincent Lejeune	2013-10-13	1	-0/+37
\| \| \| \|	git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@192557 91177308-0d34-0410-b5e6-96231b3b80d8
*	R600: Use masked read sel for texture instructions	Vincent Lejeune	2013-10-13	1	-8/+7
\| \| \| \|	git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@192554 91177308-0d34-0410-b5e6-96231b3b80d8
*	R600: fix swizzle export	Vincent Lejeune	2013-10-13	2	-1/+147
\| \| \| \|	git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@192553 91177308-0d34-0410-b5e6-96231b3b80d8
*	Force a CPU on test so it doesn't depend on microarchitectural scheduling ↵	Benjamin Kramer	2013-10-12	1	-2/+2
\| \| \| \| \| \|	decisions. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@192532 91177308-0d34-0410-b5e6-96231b3b80d8
*	For Mips16, start to consolidate all forms of 32 bit literal loading so that	Reed Kotler	2013-10-12	1	-6/+13
\| \| \| \| \| \| \| \|	they can be better handled and optimized in the Mips16 constant island code. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@192520 91177308-0d34-0410-b5e6-96231b3b80d8
*	R600: Add scalar i32 add test	Matt Arsenault	2013-10-11	1	-0/+16
\| \| \| \|	git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@192501 91177308-0d34-0410-b5e6-96231b3b80d8
*	Use CHECK-LABEL	Matt Arsenault	2013-10-11	1	-8/+8
\| \| \| \|	git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@192500 91177308-0d34-0410-b5e6-96231b3b80d8
*	Remove kill flags after if conversion if necessary	Matthias Braun	2013-10-11	1	-0/+30
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	When if converting something like: true: ... = R0<kill> false: ... = R0<kill> then the instructions of the true block must not have a <kill> flag anymore, as the instruction of the false block follow and do still read the R0 value. Specifically this patch determines the set of register live-in in the false block (possibly after simulating the liveness changes of the duplicated instructions). Each of these live-in registers mustn't be killed. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@192482 91177308-0d34-0410-b5e6-96231b3b80d8
*	[DAGCombiner] Load slicing test case: attempt to really fix the buildbots ↵	Quentin Colombet	2013-10-11	1	-2/+2
\| \| \| \| \| \| \| \| \|	(used sse4.2 instead of avx!). <rdar://problem/14477220> git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@192480 91177308-0d34-0410-b5e6-96231b3b80d8
*	[DAGCombiner] Reapply load slicing (192471) with a test that explicitly set ↵	Quentin Colombet	2013-10-11	1	-0/+140
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	sse4.2 support. This should fix the buildbots. Original commit message: [DAGCombiner] Slice a big load in two loads when the element are next to each other in memory and the target has paired load and performs post-isel loads combining. E.g., this optimization will transform something like this: a = load i64* addr b = trunc i64 a to i32 c = lshr i64 a, 32 d = trunc i64 c to i32 into: b = load i32* addr1 d = load i32* addr2 Where addr1 = addr2 +/- sizeof(i32), if the target supports paired load and performs post-isel loads combining. One should overload TargetLowering::hasPairedLoad to provide this information. The default is false. <rdar://problem/14477220> git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@192476 91177308-0d34-0410-b5e6-96231b3b80d8
*	[DAGCombiner] Revert load slicing (r192471), until I figure out why it fails ↵	Quentin Colombet	2013-10-11	1	-140/+0
\| \| \| \| \| \|	on ubuntu. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@192474 91177308-0d34-0410-b5e6-96231b3b80d8
*	Revert "Tests: Be less dependent on a specific schedule/regalloc"	Matthias Braun	2013-10-11	10	-53/+55
\| \| \| \| \| \| \| \| \|	This reverts r192454 Apparently FileCheck isn't as smart as I though and does not enforce a topological order between variable defs+uses. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@192472 91177308-0d34-0410-b5e6-96231b3b80d8
*	[DAGCombiner] Slice a big load in two loads when the element are next to each	Quentin Colombet	2013-10-11	1	-0/+140
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	other in memory and the target has paired load and performs post-isel loads combining. E.g., this optimization will transform something like this: a = load i64* addr b = trunc i64 a to i32 c = lshr i64 a, 32 d = trunc i64 c to i32 into: b = load i32* addr1 d = load i32* addr2 Where addr1 = addr2 +/- sizeof(i32), if the target supports paired load and performs post-isel loads combining. One should overload TargetLowering::hasPairedLoad to provide this information. The default is false. <rdar://problem/14477220> git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@192471 91177308-0d34-0410-b5e6-96231b3b80d8
*	[ARM] Fix FP ABI attributes with no VFP enabled.	Amara Emerson	2013-10-11	2	-8/+3
\| \| \| \|	git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@192458 91177308-0d34-0410-b5e6-96231b3b80d8
*	Tests: Be less dependent on a specific schedule/regalloc	Matthias Braun	2013-10-11	10	-55/+53
\| \| \| \|	git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@192454 91177308-0d34-0410-b5e6-96231b3b80d8
*	[mips][msa] Improves robustness of the test by enhancing pattern matching.	Matheus Almeida	2013-10-11	1	-240/+360
\| \| \| \|	git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@192446 91177308-0d34-0410-b5e6-96231b3b80d8
*	[NVPTX] Switch from StrongPHIElimination to PHIElimination in ↵	Justin Holewinski	2013-10-11	1	-0/+38
\| \| \| \| \| \| \| \|	NVPTXTargetMachine, and add some missing optimization passes to addOptimizedRegAlloc Fixes PR17529 git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@192445 91177308-0d34-0410-b5e6-96231b3b80d8
*	Make AsmPrinter::emitImplicitDef a virtual method so targets can emit custom ↵	Justin Holewinski	2013-10-11	1	-0/+9
\| \| \| \| \| \| \| \| \| \| \| \| \|	comments for implicit defs For NVPTX, this fixes a crash where the emitImplicitDef implementation was expecting physical registers, while NVPTX uses virtual registers (with a couple of exceptions). Now, the implicit def comment will be emitted as a true PTX register name. Other targets can use this to customize the output of implicit def comments. Fixes PR17519 git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@192444 91177308-0d34-0410-b5e6-96231b3b80d8
*	[ARM] Add a test case for disabled neon/fpu features.	Amara Emerson	2013-10-11	1	-0/+33
\| \| \| \|	git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@192440 91177308-0d34-0410-b5e6-96231b3b80d8
*	[mips][msa] Added support for matching maddv.[bhwd], and msubv.[bhwd] from ↵	Daniel Sanders	2013-10-11	1	-0/+160
\| \| \| \| \| \|	normal IR (i.e. not intrinsics) git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@192438 91177308-0d34-0410-b5e6-96231b3b80d8
*	[mips][msa] Added support for matching fmsub.[wd] from normal IR (i.e. not ↵	Daniel Sanders	2013-10-11	1	-0/+40
\| \| \| \| \| \|	intrinsics) git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@192435 91177308-0d34-0410-b5e6-96231b3b80d8
*	XCore target fix bug in emitArrayBound() causing segmentation fault	Robert Lytton	2013-10-11	2	-1/+8
\| \| \| \|	git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@192434 91177308-0d34-0410-b5e6-96231b3b80d8
*	XCore target does not emit '.hidden' or '.protected' attributes	Robert Lytton	2013-10-11	1	-0/+10
\| \| \| \|	git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@192433 91177308-0d34-0410-b5e6-96231b3b80d8
*	XCore target: fix bug in XCoreLowerThreadLocal.cpp	Robert Lytton	2013-10-11	1	-0/+39
\| \| \| \| \| \| \| \| \|	When a ConstantExpr which uses a thread local is part of a PHI node instruction, the insruction that replaces the ConstantExpr must be inserted in the predecessor block, in front of the terminator instruction. If the predecessor block has multiple successors, the edge is first split. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@192432 91177308-0d34-0410-b5e6-96231b3b80d8