summaryrefslogtreecommitdiffstats
path: root/src/glsl/nir/nir.h
diff options
context:
space:
mode:
authorMatt Turner <mattst88@gmail.com>2016-01-13 11:09:11 -0800
committerMatt Turner <mattst88@gmail.com>2016-01-14 09:28:01 -0800
commitb82e26a6a4d6baf121f44c61c862bfa79ba0d172 (patch)
treed8c97faf241b7ea0bd07ceccde1f59f5e7058986 /src/glsl/nir/nir.h
parent15640ee77ae601cba33cbbc72256e55e03a363e5 (diff)
downloadexternal_mesa3d-b82e26a6a4d6baf121f44c61c862bfa79ba0d172.zip
external_mesa3d-b82e26a6a4d6baf121f44c61c862bfa79ba0d172.tar.gz
external_mesa3d-b82e26a6a4d6baf121f44c61c862bfa79ba0d172.tar.bz2
nir: Lower bitfield_extract.
The OpenGL specifications for bitfieldExtract() says: The result will be undefined if <offset> or <bits> is negative, or if the sum of <offset> and <bits> is greater than the number of bits used to store the operand. Therefore passing bits=32, offset=0 is legal and defined in GLSL. But the earlier SM5 ubfe/ibfe opcodes are specified to accept a bitfield width ranging from 0-31. As such, Intel and AMD instructions read only the low 5 bits of the width operand, making them not able to implement the GLSL-specified behavior directly. This commit adds ubfe/ibfe operations from SM5 and a lowering pass for bitfield_extract to to handle the trivial case of <bits> = 32 as bitfieldExtract: bits > 31 ? value : bfe(value, offset, bits) Fixes: ES31-CTS.shader_bitfield_operation.bitfieldExtract.uvec3_0 Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=92595 Reviewed-by: Connor Abbott <cwabbott0@gmail.com> Tested-by: Marta Lofstedt <marta.lofstedt@intel.com>
Diffstat (limited to 'src/glsl/nir/nir.h')
-rw-r--r--src/glsl/nir/nir.h1
1 files changed, 1 insertions, 0 deletions
diff --git a/src/glsl/nir/nir.h b/src/glsl/nir/nir.h
index 23aec69..11add65 100644
--- a/src/glsl/nir/nir.h
+++ b/src/glsl/nir/nir.h
@@ -1447,6 +1447,7 @@ typedef struct nir_shader_compiler_options {
bool lower_fsat;
bool lower_fsqrt;
bool lower_fmod;
+ bool lower_bitfield_extract;
bool lower_bitfield_insert;
bool lower_uadd_carry;
bool lower_usub_borrow;