blob: 9ae15174c5822e6515c8e40ac81da20584cf0340 (
plain)
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
|
***
add gcc builtins for alpha instructions
***
custom expand byteswap into nifty
extract/insert/mask byte/word/longword/quadword low/high
sequences
***
see if any of the extract/insert/mask operations can be added
***
match more interesting things for cmovlbc cmovlbs (move if low bit clear/set)
***
lower srem and urem
remq(i,j): i - (j * divq(i,j)) if j != 0
remqu(i,j): i - (j * divqu(i,j)) if j != 0
reml(i,j): i - (j * divl(i,j)) if j != 0
remlu(i,j): i - (j * divlu(i,j)) if j != 0
***
add crazy vector instructions (MVI):
(MIN|MAX)(U|S)(B8|W4) min and max, signed and unsigned, byte and word
PKWB, UNPKBW pack/unpack word to byte
PKLB UNPKBL pack/unpack long to byte
PERR pixel error (sum accross bytes of bytewise abs(i8v8 a - i8v8 b))
cmpbytes bytewise cmpeq of i8v8 a and i8v8 b (not part of MVI extentions)
this has some good examples for other operations that can be synthesised well
from these rather meager vector ops (such as saturating add).
http://www.alphalinux.org/docs/MVI-full.html
|