Refactor isInTailCallPosition handling

This change came about primarily because of two issues in the existing code. Niether of: define i64 @test1(i64 %val) { %in = trunc i64 %val to i32 tail call i32 @ret32(i32 returned %in) ret i64 %val } define i64 @test2(i64 %val) { tail call i32 @ret32(i32 returned undef) ret i32 42 } should be tail calls, and the function sameNoopInput is responsible. The main problem is that it is completely symmetric in the "tail call" and "ret" value, but in reality different things are allowed on each side. For these cases: 1. Any truncation should lead to a larger value being generated by "tail call" than needed by "ret". 2. Undef should only be allowed as a source for ret, not as a result of the call. Along the way I noticed that a mismatch between what this function treats as a valid truncation and what the backends see can lead to invalid calls as well (see x86-32 test case). This patch refactors the code so that instead of being based primarily on values which it recurses into when necessary, it starts by inspecting the type and considers each fundamental slot that the backend will see in turn. For example, given a pathological function that returned {{}, {{}, i32, {}}, i32} we would consider each "real" i32 in turn, and ask if it passes through unchanged. This is much closer to what the backend sees as a result of ComputeValueVTs. Aside from the bug fixes, this eliminates the recursion that's going on and, I believe, makes the bulk of the code significantly easier to understand. The trade-off is the nasty iterators needed to find the real types inside a returned value. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@187787 91177308-0d34-0410-b5e6-96231b3b80d8
author: Tim Northover <tnorthover@apple.com> 2013-08-06 09:12:35 +0000
committer: Tim Northover <tnorthover@apple.com> 2013-08-06 09:12:35 +0000
commit: d113448c1dd5f40522c3c02db96e87a9eb59eaf4 (patch)
tree: e16c53d8abd4e812b53a411d8291cb147021cb5c /include/llvm/Target/TargetLowering.h
parent: 900cbf5545bbdf2c24f645da8cfd071c792f753d (diff)
download: external_llvm-d113448c1dd5f40522c3c02db96e87a9eb59eaf4.zip
external_llvm-d113448c1dd5f40522c3c02db96e87a9eb59eaf4.tar.gz
external_llvm-d113448c1dd5f40522c3c02db96e87a9eb59eaf4.tar.bz2
1 files changed, 9 insertions, 0 deletions
diff --git a/include/llvm/Target/TargetLowering.h b/include/llvm/Target/TargetLowering.h
index 69bfe70..c3fa3cc 100644
--- a/include/llvm/Target/TargetLowering.h
+++ b/include/llvm/Target/TargetLowering.h
@@ -1152,6 +1152,15 @@ public:
     return false;
   }
 
+  /// Return true if a truncation from Ty1 to Ty2 is permitted when deciding
+  /// whether a call is in tail position. Typically this means that both results
+  /// would be assigned to the same register or stack slot, but it could mean
+  /// the target performs adequate checks of its own before proceeding with the
+  /// tail call.
+  virtual bool allowTruncateForTailCall(Type * /*Ty1*/, Type * /*Ty2*/) const {
+    return false;
+  }
+
   virtual bool isTruncateFree(EVT /*VT1*/, EVT /*VT2*/) const {
     return false;
   }
author	Tim Northover <tnorthover@apple.com>	2013-08-06 09:12:35 +0000
committer	Tim Northover <tnorthover@apple.com>	2013-08-06 09:12:35 +0000
commit	d113448c1dd5f40522c3c02db96e87a9eb59eaf4 (patch)
tree	e16c53d8abd4e812b53a411d8291cb147021cb5c /include/llvm/Target/TargetLowering.h
parent	900cbf5545bbdf2c24f645da8cfd071c792f753d (diff)
download	external_llvm-d113448c1dd5f40522c3c02db96e87a9eb59eaf4.zip external_llvm-d113448c1dd5f40522c3c02db96e87a9eb59eaf4.tar.gz external_llvm-d113448c1dd5f40522c3c02db96e87a9eb59eaf4.tar.bz2