external_llvm.git - Unnamed repository; edit this file 'description' to name the repository.

	Commit message (Collapse)	Author	Age	Files	Lines
*	Optimized integer vector multiplication operation by replacing it with ↵	Elena Demikhovsky	2013-06-26	5	-3/+85
\| \| \| \| \| \|	shift/xor/sub when it is possible. Fixed a bug in SDIV, where the const operand is not a splat constant vector. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@184931 91177308-0d34-0410-b5e6-96231b3b80d8
*	R600: Use new getNamedOperandIdx function generated by TableGen	Tom Stellard	2013-06-25	1	-0/+59
\| \| \| \|	git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@184880 91177308-0d34-0410-b5e6-96231b3b80d8
*	R600: Add v2i32 test for vselect	Aaron Watry	2013-06-25	1	-6/+20
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Note: Only adding test for evergreen, not SI yet. When I attempted to expand vselect for SI, I got the following: llc: /home/awatry/src/llvm/lib/CodeGen/SelectionDAG/LegalizeIntegerTypes.cpp:522: llvm::SDValue llvm::DAGTypeLegalizer::PromoteIntRes_SETCC(llvm::SDNode*): Assertion `SVT.isVector() == N->getOperand(0).getValueType().isVector() && "Vector compare must return a vector result!"' failed. Reviewed-by: Tom Stellard <thomas.stellard@amd.com> git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@184847 91177308-0d34-0410-b5e6-96231b3b80d8
*	R600/SI: Expand xor v2i32/v4i32	Aaron Watry	2013-06-25	1	-7/+33
\| \| \| \| \| \| \| \|	Add test cases for both vector sizes on SI and also add v2i32 test for EG. Reviewed-by: Tom Stellard <thomas.stellard@amd.com> git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@184846 91177308-0d34-0410-b5e6-96231b3b80d8
*	R600: Add v2i32 test for setcc on evergreen	Aaron Watry	2013-06-25	1	-3/+22
\| \| \| \| \| \| \| \| \| \| \|	No test/expansion for SI has been added yet. Attempts to expand this operation for SI resulted in a stacktrace in (IIRC) LegalizeIntegerTypes which was complaining about vector comparisons being required to return a vector type. Reviewed-by: Tom Stellard <thomas.stellard@amd.com> git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@184845 91177308-0d34-0410-b5e6-96231b3b80d8
*	R600/SI: Expand urem of v2i32/v4i32 for SI	Aaron Watry	2013-06-25	1	-4/+23
\| \| \| \| \| \| \| \| \| \| \| \|	Also add lit test for both cases on SI, and v2i32 for evergreen. Note: I followed the guidance of the v4i32 EG check... UREM produces really complex code, so let's just check that the instruction was lowered successfully. Reviewed-by: Tom Stellard <thomas.stellard@amd.com> git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@184844 91177308-0d34-0410-b5e6-96231b3b80d8
*	R600/SI: Expand udiv v[24]i32 for SI and v2i32 for EG	Aaron Watry	2013-06-25	1	-3/+22
\| \| \| \| \| \| \| \| \| \| \| \|	Also add lit test for both cases on SI, and v2i32 for evergreen. Note: I followed the guidance of the v4i32 EG check... UDIV produces really complex code, so let's just check that the instruction was lowered successfully. Reviewed-by: Tom Stellard <thomas.stellard@amd.com> git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@184843 91177308-0d34-0410-b5e6-96231b3b80d8
*	R600/SI: Expand ashr of v2i32/v4i32 for SI	Aaron Watry	2013-06-25	1	-7/+34
\| \| \| \| \| \| \| \|	Also add lit test for both cases on SI, and v2i32 for evergreen. Reviewed-by: Tom Stellard <thomas.stellard@amd.com> git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@184842 91177308-0d34-0410-b5e6-96231b3b80d8
*	R600/SI: Expand srl of v2i32/v4i32 for SI	Aaron Watry	2013-06-25	1	-7/+35
\| \| \| \| \| \| \| \|	Also add lit test for both cases on SI, and v2i32 for evergreen. Reviewed-by: Tom Stellard <thomas.stellard@amd.com> git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@184841 91177308-0d34-0410-b5e6-96231b3b80d8
*	R600/SI: Expand shl of v2i32/v4i32 for SI	Aaron Watry	2013-06-25	1	-7/+34
\| \| \| \| \| \| \| \|	Also add lit test for both cases on SI, and v2i32 for evergreen. Reviewed-by: Tom Stellard <thomas.stellard@amd.com> git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@184840 91177308-0d34-0410-b5e6-96231b3b80d8
*	R600/SI: Expand or of v2i32/v4i32 for SI	Aaron Watry	2013-06-25	1	-7/+34
\| \| \| \| \| \| \| \|	Also add lit test for both cases on SI, and v2i32 for evergreen. Reviewed-by: Tom Stellard <thomas.stellard@amd.com> git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@184839 91177308-0d34-0410-b5e6-96231b3b80d8
*	R600/SI: Expand mul of v2i32/v4i32 for SI	Aaron Watry	2013-06-25	1	-6/+32
\| \| \| \| \| \| \| \|	Also add lit test for both cases on SI, and v2i32 for evergreen. Reviewed-by: Tom Stellard <thomas.stellard@amd.com> git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@184838 91177308-0d34-0410-b5e6-96231b3b80d8
*	R600/SI: Expand and of v2i32/v4i32 for SI	Aaron Watry	2013-06-25	1	-6/+31
\| \| \| \| \| \| \| \|	Also add lit test for both cases on SI, and v2i32 for evergreen. Reviewed-by: Tom Stellard <thomas.stellard@amd.com> git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@184837 91177308-0d34-0410-b5e6-96231b3b80d8
*	Revert "Temporarily enable MI-Sched on X86."	Andrew Trick	2013-06-25	63	-273/+257
\| \| \| \| \| \|	This reverts commit 98a9b72e8c56dc13a2617de84503a3d78352789c. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@184823 91177308-0d34-0410-b5e6-96231b3b80d8
*	R600/SI: Report unaligned memory accesses as legal for > 32-bit types	Tom Stellard	2013-06-25	1	-0/+32
\| \| \| \| \| \| \| \| \| \| \|	In reality, some unaligned memory accesses are legal for 32-bit types and smaller too, but it all depends on the address space. Allowing unaligned loads/stores for > 32-bit types is mainly to prevent the legalizer from splitting one load into multiple loads of smaller types. https://bugs.freedesktop.org/show_bug.cgi?id=65873 git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@184822 91177308-0d34-0410-b5e6-96231b3b80d8
*	R600: Add support for i32 loads from the constant address space on Cayman	Tom Stellard	2013-06-25	1	-0/+1
\| \| \| \| \| \|	Tested-By: Aaron Watry <awatry@gmail.com> git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@184821 91177308-0d34-0410-b5e6-96231b3b80d8
*	R600/SI: Add support for v4i32 and v4f32 kernel args	Tom Stellard	2013-06-25	1	-6/+10
\| \| \| \| \| \|	Tested-By: Aaron Watry <awatry@gmail.com> git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@184820 91177308-0d34-0410-b5e6-96231b3b80d8
*	R600: Fix typo in R600Schedule.td	Tom Stellard	2013-06-25	1	-0/+34
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This should only make a difference in programs that use a lot of the vector ALU instructions like BFI_INT and BIT_ALIGN. There is a slight improvement in the phatk bitcoin mining kernel with this patch on Evergreen (vector size == 1): Before: 1173 Instruction Groups / 9520 dwords After: 1167 Instruction Groups / 9510 dwords Reviewed-by: Reviewed-by: Vincent Lejeune<vljn at ovi.com> git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@184819 91177308-0d34-0410-b5e6-96231b3b80d8
*	llvm/test/CodeGen/X86: Add explicit -mtriple=x86_64-unknown-unknown.	NAKAMURA Takumi	2013-06-24	2	-2/+2
\| \| \| \|	git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@184731 91177308-0d34-0410-b5e6-96231b3b80d8
*	llvm/test/CodeGen/X86/legalize-shift-64.ll: Add explicit ↵	NAKAMURA Takumi	2013-06-24	1	-1/+1
\| \| \| \| \| \|	-mtriple=i686-unknown-unknown. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@184730 91177308-0d34-0410-b5e6-96231b3b80d8
*	Add -mcpu to some unit tests that only fail on certain hosts.	Andrew Trick	2013-06-24	7	-8/+8
\| \| \| \|	git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@184709 91177308-0d34-0410-b5e6-96231b3b80d8
*	Temporarily enable MI-Sched on X86.	Andrew Trick	2013-06-24	63	-257/+273
\| \| \| \| \| \| \|	Sorry for the unit test churn. I'll try to make the change permanently next time. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@184705 91177308-0d34-0410-b5e6-96231b3b80d8
*	Fix tail merging to assign the (more) correct BasicBlock when splitting.	Andrew Trick	2013-06-24	1	-2/+2
\| \| \| \| \| \| \| \| \| \| \|	This makes it possible to write unit tests that are less susceptible to minor code motion, particularly copy placement. block-placement.ll covers this case with -pre-RA-sched=source which will soon be default. One incorrectly named block is already fixed, but without this fix, enabling new coalescing and scheduling would cause more failures. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@184680 91177308-0d34-0410-b5e6-96231b3b80d8
*	Add MI-Sched support for x86 macro fusion.	Andrew Trick	2013-06-23	1	-0/+108
\| \| \| \| \| \| \| \|	This is an awful implementation of the target hook. But we don't have abstractions yet for common machine ops, and I don't see any quick way to make it table-driven. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@184664 91177308-0d34-0410-b5e6-96231b3b80d8
*	Replace with a shorter test case produced by Doug Gillmore.	Reed Kotler	2013-06-22	1	-6392/+28
\| \| \| \|	git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@184645 91177308-0d34-0410-b5e6-96231b3b80d8
*	DebugInfo: Don't lose unreferenced non-trivial by-value parameters	David Blaikie	2013-06-21	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \|	A FastISel optimization was causing us to emit no information for such parameters & when they go missing we end up emitting a different function type. By avoiding that shortcut we not only get types correct (very important) but also location information (handy) - even if it's only live at the start of a function & may be clobbered later. Reviewed/discussion by Evan Cheng & Dan Gohman. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@184604 91177308-0d34-0410-b5e6-96231b3b80d8
*	Add '-mcpu=' to prevent breaking on ATOM due to different code schedule	Michael Liao	2013-06-21	1	-1/+1
\| \| \| \|	git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@184591 91177308-0d34-0410-b5e6-96231b3b80d8
*	[NVPTX] Add support for selecting CUDA vs OCL mode based on triple	Justin Holewinski	2013-06-21	5	-7/+11
\| \| \| \| \| \|	IR for CUDA should use "nvptx[64]-nvidia-cuda", and IR for NV OpenCL should use "nvptx[64]-nvidia-nvcl" git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@184579 91177308-0d34-0410-b5e6-96231b3b80d8
*	Add missing REQUIRES: asserts in crash.ll.	Andrew Trick	2013-06-21	1	-0/+1
\| \| \| \|	git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@184576 91177308-0d34-0410-b5e6-96231b3b80d8
*	Fix PR16360	Michael Liao	2013-06-21	1	-0/+16
\| \| \| \| \| \| \| \| \|	When (srl (anyextend x), c) is folded into (anyextend (srl x, c)), the high bits are not cleared. Add 'and' to clear off them. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@184575 91177308-0d34-0410-b5e6-96231b3b80d8
*	Update physreg live intervals during remat.	Andrew Trick	2013-06-21	1	-4/+4
\| \| \| \|	git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@184574 91177308-0d34-0410-b5e6-96231b3b80d8
*	ARM: Remove a (false) dependency on the memoryoperand's value as we do not use	Quentin Colombet	2013-06-20	2	-2/+44
\| \| \| \| \| \| \| \| \| \| \|	it at the moment. This allows to form more paired loads even when stack coloring pass destroys the memoryoperand's value. <rdar://problem/13978317> git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@184492 91177308-0d34-0410-b5e6-96231b3b80d8
*	R600/SI: Expand sub for v2i32 and v4i32 for SI	Tom Stellard	2013-06-20	1	-6/+31
\| \| \| \| \| \| \| \| \| \| \|	Also add a v2i32 test to the existing v4i32 test. Patch by: Aaron Watry Reviewed-by: Tom Stellard <thomas.stellard@amd.com> Signed-off-by: Aaron Watry<awatry@gmail.com> git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@184482 91177308-0d34-0410-b5e6-96231b3b80d8
*	R600/SI: Expand add for v2i32 and v4i32	Tom Stellard	2013-06-20	1	-6/+31
\| \| \| \| \| \| \| \| \| \| \| \|	Also add SI tests to existing file and a v2i32 test for both R600 and SI. Patch by: Aaron Watry Reviewed-by: Tom Stellard <thomas.stellard@amd.com> Signed-off-by: Aaron Watry <awatry@gmail.com> git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@184481 91177308-0d34-0410-b5e6-96231b3b80d8
*	R600: Expand v2i32 load/store instead of custom lowering	Tom Stellard	2013-06-20	1	-0/+6
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	The custom lowering causes llc to crash with a segfault. Ideally, the custom lowering can be fixed, but this allows programs which load/store v2i32 to work without crashing. Patch by: Aaron Watry Reviewed-by: Tom Stellard <thomas.stellard@amd.com> Signed-off-by: Aaron Watry<awatry@gmail.com> git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@184480 91177308-0d34-0410-b5e6-96231b3b80d8
*	DebugInfo: don't use location lists when the location covers the whole ↵	David Blaikie	2013-06-20	1	-12/+6
\| \| \| \| \| \| \| \| \| \| \| \| \|	function anyway Fix up three tests - one that was relying on abbreviation number, another relying on a location list in this case (& testing raw asm, changed that to use dwarfdump on the debug_info now that that's where the location is), and another which was added in r184368 - exposing a bug in that fix that is exposed when we emit the location inline rather than through a location list. Fix that bug while I'm here. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@184387 91177308-0d34-0410-b5e6-96231b3b80d8
*	AArch64: remove accidental test output file.	Tim Northover	2013-06-18	1	-208/+0
\| \| \| \|	git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@184236 91177308-0d34-0410-b5e6-96231b3b80d8
*	During SelectionDAG building explicitly set a node to constant zero when the	Quentin Colombet	2013-06-18	4	-5/+41
\| \| \| \| \| \| \| \| \| \| \| \| \|	value is zero. This allows optmizations to kick in more easily. Fix some test cases so that they remain meaningful (i.e., not completely dead coded) when optimizations apply. <rdar://problem/14096009> superfluous multiply by high part of zero-extended value. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@184222 91177308-0d34-0410-b5e6-96231b3b80d8
*	Reenable, improve, and add MI-Sched unit tests.	Andrew Trick	2013-06-17	3	-15/+60
\| \| \| \|	git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@184134 91177308-0d34-0410-b5e6-96231b3b80d8
*	R600: PV stores Reg id, not index	Vincent Lejeune	2013-06-17	1	-0/+50
\| \| \| \|	git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@184117 91177308-0d34-0410-b5e6-96231b3b80d8
*	R600: Properly set COUNT_3 bit in TEX clause initiating inst for pre EG gen.	Vincent Lejeune	2013-06-17	1	-0/+44
\| \| \| \| \| \| \|	Fixes rv7x0 bug in Heaven reported here: https://bugs.freedesktop.org/show_bug.cgi?id=64257 git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@184116 91177308-0d34-0410-b5e6-96231b3b80d8
*	Switch spill weights from a basic loop depth estimation to BlockFrequencyInfo.	Benjamin Kramer	2013-06-17	4	-87/+11
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The main advantages here are way better heuristics, taking into account not just loop depth but also __builtin_expect and other static heuristics and will eventually learn how to use profile info. Most of the work in this patch is pushing the MachineBlockFrequencyInfo analysis into the right places. This is good for a 5% speedup on zlib's deflate (x86_64), there were some very unfortunate spilling decisions in its hottest loop in longest_match(). Other benchmarks I tried were mostly neutral. This changes register allocation in subtle ways, update the tests for it. 2012-02-20-MachineCPBug.ll was deleted as it's very fragile and the instruction it looked for was gone already (but the FileCheck pattern picked up unrelated stuff). git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@184105 91177308-0d34-0410-b5e6-96231b3b80d8
*	Debug Info: Simplify Frame Index handling in DBG_VALUE Machine Instructions	David Blaikie	2013-06-16	2	-2/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Rather than using the full power of target-specific addressing modes in DBG_VALUEs with Frame Indicies, simply use Frame Index + Offset. This reduces the complexity of debug info handling down to two representations of values (reg+offset and frame index+offset) rather than three or four. Ideally we could ensure that frame indicies had been eliminated by the time we reached an assembly or dwarf generation, but I haven't spent the time to figure out where the FIs are leaking through into that & whether there's a good place to convert them. Some FI+offset=>reg+offset conversion is done (see PrologEpilogInserter, for example) which is necessary for some SelectionDAG assumptions about registers, I believe, but it might be possible to make this a more thorough conversion & ensure there are no remaining FIs no matter how instruction selection is performed. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@184066 91177308-0d34-0410-b5e6-96231b3b80d8
*	DebugInfo: follow up to 184045 to constrain the tests further to ensure they ↵	David Blaikie	2013-06-15	4	-5/+5
\| \| \| \| \| \|	don't contain +0 offsets git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@184046 91177308-0d34-0410-b5e6-96231b3b80d8
*	DebugInfo: print DBG_VALUE MachineInstrs with [] for deref and drop the ↵	David Blaikie	2013-06-15	5	-6/+6
\| \| \| \| \| \|	offset when it's zero git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@184045 91177308-0d34-0410-b5e6-96231b3b80d8
*	Machine Model: Add MicroOpBufferSize and resource BufferSize.	Andrew Trick	2013-06-15	3	-12/+14
\| \| \| \| \| \| \| \| \| \| \| \| \|	Replace the ill-defined MinLatency and ILPWindow properties with with straightforward buffer sizes: MCSchedMode::MicroOpBufferSize MCProcResourceDesc::BufferSize These can be used to more precisely model instruction execution if desired. Disabled some misched tests temporarily. They'll be reenabled in a few commits. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@184032 91177308-0d34-0410-b5e6-96231b3b80d8
*	Debug Info: Don't print the display name and colon prefix for DEBUG_VALUE ↵	David Blaikie	2013-06-15	1	-1/+1
\| \| \| \| \| \|	comments if the display name is empty git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@184026 91177308-0d34-0410-b5e6-96231b3b80d8
*	R600: Add SI load support for v[24]i32 and store for v2i32	Tom Stellard	2013-06-15	1	-0/+19
\| \| \| \| \| \| \| \| \| \| \| \|	Also add a seperate vector lit test file, since r600 doesn't seem to handle v2i32 load/store yet, but we can test both for SI. Patch by: Aaron Watry Reviewed-by: Tom Stellard <thomas.stellard@amd.com> Signed-off-by: Aaron Watry <awatry@gmail.com> git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@184021 91177308-0d34-0410-b5e6-96231b3b80d8
*	R600: Use correct encoding for Vertex Fetch instructions on Cayman	Tom Stellard	2013-06-14	1	-0/+25
\| \| \| \| \| \|	Reviewed-by: Vincent Lejeune<vljn at ovi.com> git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@184016 91177308-0d34-0410-b5e6-96231b3b80d8
*	R600: Use EXPORT_RAT_INST_STORE_DWORD for stores on Cayman	Tom Stellard	2013-06-14	1	-0/+3
\| \| \| \| \| \| \| \| \| \|	We were using RAT_INST_STORE_RAW, which seemed to work, but the docs say this instruction doesn't exist for Cayman, so it's probably safer to use a documented instruction instead. Reviewed-by: Vincent Lejeune<vljn at ovi.com> git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@184015 91177308-0d34-0410-b5e6-96231b3b80d8