source: trunk/third/gcc/rtl.texi @ 11288

Revision 11288, 114.5 KB checked in by ghudson, 26 years ago (diff)
This commit was generated by cvs2svn to compensate for changes in r11287, which included commits to RCS files with non-trunk default branches.
Line 
1@c Copyright (C) 1988, 89, 92, 94, 97, 1998 Free Software Foundation, Inc.
2@c This is part of the GCC manual.
3@c For copying conditions, see the file gcc.texi.
4
5@node RTL
6@chapter RTL Representation
7@cindex RTL representation
8@cindex representation of RTL
9@cindex Register Transfer Language (RTL)
10
11Most of the work of the compiler is done on an intermediate representation
12called register transfer language.  In this language, the instructions to be
13output are described, pretty much one by one, in an algebraic form that
14describes what the instruction does.
15
16RTL is inspired by Lisp lists.  It has both an internal form, made up of
17structures that point at other structures, and a textual form that is used
18in the machine description and in printed debugging dumps.  The textual
19form uses nested parentheses to indicate the pointers in the internal form.
20
21@menu
22* RTL Objects::       Expressions vs vectors vs strings vs integers.
23* Accessors::         Macros to access expression operands or vector elts.
24* Flags::             Other flags in an RTL expression.
25* Machine Modes::     Describing the size and format of a datum.
26* Constants::         Expressions with constant values.
27* Regs and Memory::   Expressions representing register contents or memory.
28* Arithmetic::        Expressions representing arithmetic on other expressions.
29* Comparisons::       Expressions representing comparison of expressions.
30* Bit Fields::        Expressions representing bitfields in memory or reg.
31* Conversions::       Extending, truncating, floating or fixing.
32* RTL Declarations::  Declaring volatility, constancy, etc.
33* Side Effects::      Expressions for storing in registers, etc.
34* Incdec::            Embedded side-effects for autoincrement addressing.
35* Assembler::         Representing @code{asm} with operands.
36* Insns::             Expression types for entire insns.
37* Calls::             RTL representation of function call insns.
38* Sharing::           Some expressions are unique; others *must* be copied.
39* Reading RTL::       Reading textual RTL from a file.
40@end menu
41
42@node RTL Objects, Accessors, RTL, RTL
43@section RTL Object Types
44@cindex RTL object types
45
46@cindex RTL integers
47@cindex RTL strings
48@cindex RTL vectors
49@cindex RTL expression
50@cindex RTX (See RTL)
51RTL uses five kinds of objects: expressions, integers, wide integers,
52strings and vectors.  Expressions are the most important ones.  An RTL
53expression (``RTX'', for short) is a C structure, but it is usually
54referred to with a pointer; a type that is given the typedef name
55@code{rtx}.
56
57An integer is simply an @code{int}; their written form uses decimal digits.
58A wide integer is an integral object whose type is @code{HOST_WIDE_INT}
59(@pxref{Config}); their written form uses decimal digits.
60
61A string is a sequence of characters.  In core it is represented as a
62@code{char *} in usual C fashion, and it is written in C syntax as well.
63However, strings in RTL may never be null.  If you write an empty string in
64a machine description, it is represented in core as a null pointer rather
65than as a pointer to a null character.  In certain contexts, these null
66pointers instead of strings are valid.  Within RTL code, strings are most
67commonly found inside @code{symbol_ref} expressions, but they appear in
68other contexts in the RTL expressions that make up machine descriptions. 
69
70A vector contains an arbitrary number of pointers to expressions.  The
71number of elements in the vector is explicitly present in the vector.
72The written form of a vector consists of square brackets
73(@samp{[@dots{}]}) surrounding the elements, in sequence and with
74whitespace separating them.  Vectors of length zero are not created;
75null pointers are used instead.
76
77@cindex expression codes
78@cindex codes, RTL expression
79@findex GET_CODE
80@findex PUT_CODE
81Expressions are classified by @dfn{expression codes} (also called RTX
82codes).  The expression code is a name defined in @file{rtl.def}, which is
83also (in upper case) a C enumeration constant.  The possible expression
84codes and their meanings are machine-independent.  The code of an RTX can
85be extracted with the macro @code{GET_CODE (@var{x})} and altered with
86@code{PUT_CODE (@var{x}, @var{newcode})}.
87
88The expression code determines how many operands the expression contains,
89and what kinds of objects they are.  In RTL, unlike Lisp, you cannot tell
90by looking at an operand what kind of object it is.  Instead, you must know
91from its context---from the expression code of the containing expression.
92For example, in an expression of code @code{subreg}, the first operand is
93to be regarded as an expression and the second operand as an integer.  In
94an expression of code @code{plus}, there are two operands, both of which
95are to be regarded as expressions.  In a @code{symbol_ref} expression,
96there is one operand, which is to be regarded as a string.
97
98Expressions are written as parentheses containing the name of the
99expression type, its flags and machine mode if any, and then the operands
100of the expression (separated by spaces).
101
102Expression code names in the @samp{md} file are written in lower case,
103but when they appear in C code they are written in upper case.  In this
104manual, they are shown as follows: @code{const_int}.
105
106@cindex (nil)
107@cindex nil
108In a few contexts a null pointer is valid where an expression is normally
109wanted.  The written form of this is @code{(nil)}.
110
111@node Accessors, Flags, RTL Objects, RTL
112@section Access to Operands
113@cindex accessors
114@cindex access to operands
115@cindex operand access
116
117@cindex RTL format
118For each expression type @file{rtl.def} specifies the number of
119contained objects and their kinds, with four possibilities: @samp{e} for
120expression (actually a pointer to an expression), @samp{i} for integer,
121@samp{w} for wide integer, @samp{s} for string, and @samp{E} for vector
122of expressions.  The sequence of letters for an expression code is
123called its @dfn{format}.  Thus, the format of @code{subreg} is
124@samp{ei}.@refill
125
126@cindex RTL format characters
127A few other format characters are used occasionally:
128
129@table @code
130@item u
131@samp{u} is equivalent to @samp{e} except that it is printed differently
132in debugging dumps.  It is used for pointers to insns.
133
134@item n
135@samp{n} is equivalent to @samp{i} except that it is printed differently
136in debugging dumps.  It is used for the line number or code number of a
137@code{note} insn.
138
139@item S
140@samp{S} indicates a string which is optional.  In the RTL objects in
141core, @samp{S} is equivalent to @samp{s}, but when the object is read,
142from an @samp{md} file, the string value of this operand may be omitted.
143An omitted string is taken to be the null string.
144
145@item V
146@samp{V} indicates a vector which is optional.  In the RTL objects in
147core, @samp{V} is equivalent to @samp{E}, but when the object is read
148from an @samp{md} file, the vector value of this operand may be omitted.
149An omitted vector is effectively the same as a vector of no elements.
150
151@item 0
152@samp{0} means a slot whose contents do not fit any normal category.
153@samp{0} slots are not printed at all in dumps, and are often used in
154special ways by small parts of the compiler.
155@end table
156
157There are macros to get the number of operands, the format, and the
158class of an expression code:
159
160@table @code
161@findex GET_RTX_LENGTH
162@item GET_RTX_LENGTH (@var{code})
163Number of operands of an RTX of code @var{code}.
164
165@findex GET_RTX_FORMAT
166@item GET_RTX_FORMAT (@var{code})
167The format of an RTX of code @var{code}, as a C string.
168
169@findex GET_RTX_CLASS
170@cindex classes of RTX codes
171@item GET_RTX_CLASS (@var{code})
172A single character representing the type of RTX operation that code
173@var{code} performs.
174
175The following classes are defined:
176
177@table @code
178@item o
179An RTX code that represents an actual object, such as @code{reg} or
180@code{mem}.  @code{subreg} is not in this class.
181
182@item <
183An RTX code for a comparison.  The codes in this class are
184@code{NE}, @code{EQ}, @code{LE}, @code{LT}, @code{GE}, @code{GT},
185@code{LEU}, @code{LTU}, @code{GEU}, @code{GTU}.@refill
186
187@item 1
188An RTX code for a unary arithmetic operation, such as @code{neg}.
189
190@item c
191An RTX code for a commutative binary operation, other than @code{NE}
192and @code{EQ} (which have class @samp{<}).
193
194@item 2
195An RTX code for a noncommutative binary operation, such as @code{MINUS}.
196
197@item b
198An RTX code for a bitfield operation, either @code{ZERO_EXTRACT} or
199@code{SIGN_EXTRACT}.
200
201@item 3
202An RTX code for other three input operations, such as @code{IF_THEN_ELSE}.
203
204@item i
205An RTX code for a machine insn (@code{INSN}, @code{JUMP_INSN}, and
206@code{CALL_INSN}).@refill
207
208@item m
209An RTX code for something that matches in insns, such as @code{MATCH_DUP}.
210
211@item x
212All other RTX codes.
213@end table
214@end table
215
216@findex XEXP
217@findex XINT
218@findex XWINT
219@findex XSTR
220Operands of expressions are accessed using the macros @code{XEXP},
221@code{XINT}, @code{XWINT} and @code{XSTR}.  Each of these macros takes
222two arguments: an expression-pointer (RTX) and an operand number
223(counting from zero).  Thus,@refill
224
225@example
226XEXP (@var{x}, 2)
227@end example
228
229@noindent
230accesses operand 2 of expression @var{x}, as an expression.
231
232@example
233XINT (@var{x}, 2)
234@end example
235
236@noindent
237accesses the same operand as an integer.  @code{XSTR}, used in the same
238fashion, would access it as a string.
239
240Any operand can be accessed as an integer, as an expression or as a string.
241You must choose the correct method of access for the kind of value actually
242stored in the operand.  You would do this based on the expression code of
243the containing expression.  That is also how you would know how many
244operands there are.
245
246For example, if @var{x} is a @code{subreg} expression, you know that it has
247two operands which can be correctly accessed as @code{XEXP (@var{x}, 0)}
248and @code{XINT (@var{x}, 1)}.  If you did @code{XINT (@var{x}, 0)}, you
249would get the address of the expression operand but cast as an integer;
250that might occasionally be useful, but it would be cleaner to write
251@code{(int) XEXP (@var{x}, 0)}.  @code{XEXP (@var{x}, 1)} would also
252compile without error, and would return the second, integer operand cast as
253an expression pointer, which would probably result in a crash when
254accessed.  Nothing stops you from writing @code{XEXP (@var{x}, 28)} either,
255but this will access memory past the end of the expression with
256unpredictable results.@refill
257
258Access to operands which are vectors is more complicated.  You can use the
259macro @code{XVEC} to get the vector-pointer itself, or the macros
260@code{XVECEXP} and @code{XVECLEN} to access the elements and length of a
261vector.
262
263@table @code
264@findex XVEC
265@item XVEC (@var{exp}, @var{idx})
266Access the vector-pointer which is operand number @var{idx} in @var{exp}.
267
268@findex XVECLEN
269@item XVECLEN (@var{exp}, @var{idx})
270Access the length (number of elements) in the vector which is
271in operand number @var{idx} in @var{exp}.  This value is an @code{int}.
272
273@findex XVECEXP
274@item XVECEXP (@var{exp}, @var{idx}, @var{eltnum})
275Access element number @var{eltnum} in the vector which is
276in operand number @var{idx} in @var{exp}.  This value is an RTX.
277
278It is up to you to make sure that @var{eltnum} is not negative
279and is less than @code{XVECLEN (@var{exp}, @var{idx})}.
280@end table
281
282All the macros defined in this section expand into lvalues and therefore
283can be used to assign the operands, lengths and vector elements as well as
284to access them.
285
286@node Flags, Machine Modes, Accessors, RTL
287@section Flags in an RTL Expression
288@cindex flags in RTL expression
289
290RTL expressions contain several flags (one-bit bitfields) that are used
291in certain types of expression.  Most often they are accessed with the
292following macros:
293
294@table @code
295@findex MEM_VOLATILE_P
296@cindex @code{mem} and @samp{/v}
297@cindex @code{volatil}, in @code{mem}
298@cindex @samp{/v} in RTL dump
299@item MEM_VOLATILE_P (@var{x})
300In @code{mem} expressions, nonzero for volatile memory references.
301Stored in the @code{volatil} field and printed as @samp{/v}.
302
303@findex MEM_IN_STRUCT_P
304@cindex @code{mem} and @samp{/s}
305@cindex @code{in_struct}, in @code{mem}
306@cindex @samp{/s} in RTL dump
307@item MEM_IN_STRUCT_P (@var{x})
308In @code{mem} expressions, nonzero for reference to an entire
309structure, union or array, or to a component of one.  Zero for
310references to a scalar variable or through a pointer to a scalar.
311Stored in the @code{in_struct} field and printed as @samp{/s}.
312
313@findex REG_LOOP_TEST_P
314@cindex @code{reg} and @samp{/s}
315@cindex @code{in_struct}, in @code{reg}
316@item REG_LOOP_TEST_P
317In @code{reg} expressions, nonzero if this register's entire life is
318contained in the exit test code for some loop.  Stored in the
319@code{in_struct} field and printed as @samp{/s}.
320
321@findex REG_USERVAR_P
322@cindex @code{reg} and @samp{/v}
323@cindex @code{volatil}, in @code{reg}
324@item REG_USERVAR_P (@var{x})
325In a @code{reg}, nonzero if it corresponds to a variable present in
326the user's source code.  Zero for temporaries generated internally by
327the compiler.  Stored in the @code{volatil} field and printed as
328@samp{/v}.
329
330@cindex @samp{/i} in RTL dump
331@findex REG_FUNCTION_VALUE_P
332@cindex @code{reg} and @samp{/i}
333@cindex @code{integrated}, in @code{reg}
334@item REG_FUNCTION_VALUE_P (@var{x})
335Nonzero in a @code{reg} if it is the place in which this function's
336value is going to be returned.  (This happens only in a hard
337register.)  Stored in the @code{integrated} field and printed as
338@samp{/i}.
339
340The same hard register may be used also for collecting the values of
341functions called by this one, but @code{REG_FUNCTION_VALUE_P} is zero
342in this kind of use.
343
344@findex SUBREG_PROMOTED_VAR_P
345@cindex @code{subreg} and @samp{/s}
346@cindex @code{in_struct}, in @code{subreg}
347@item SUBREG_PROMOTED_VAR_P
348Nonzero in a @code{subreg} if it was made when accessing an object that
349was promoted to a wider mode in accord with the @code{PROMOTED_MODE} machine
350description macro (@pxref{Storage Layout}).  In this case, the mode of
351the @code{subreg} is the declared mode of the object and the mode of
352@code{SUBREG_REG} is the mode of the register that holds the object.
353Promoted variables are always either sign- or zero-extended to the wider
354mode on every assignment.  Stored in the @code{in_struct} field and
355printed as @samp{/s}.
356
357@findex SUBREG_PROMOTED_UNSIGNED_P
358@cindex @code{subreg} and @samp{/u}
359@cindex @code{unchanging}, in @code{subreg}
360@item SUBREG_PROMOTED_UNSIGNED_P
361Nonzero in a @code{subreg} that has @code{SUBREG_PROMOTED_VAR_P} nonzero
362if the object being referenced is kept zero-extended and zero if it
363is kept sign-extended.  Stored in the @code{unchanging} field and
364printed as @samp{/u}.
365
366@findex RTX_UNCHANGING_P
367@cindex @code{reg} and @samp{/u}
368@cindex @code{mem} and @samp{/u}
369@cindex @code{unchanging}, in @code{reg} and @code{mem}
370@cindex @samp{/u} in RTL dump
371@item RTX_UNCHANGING_P (@var{x})
372Nonzero in a @code{reg} or @code{mem} if the value is not changed.
373(This flag is not set for memory references via pointers to constants.
374Such pointers only guarantee that the object will not be changed
375explicitly by the current function.  The object might be changed by
376other functions or by aliasing.)  Stored in the
377@code{unchanging} field and printed as @samp{/u}.
378
379@findex RTX_INTEGRATED_P
380@cindex @code{integrated}, in @code{insn}
381@item RTX_INTEGRATED_P (@var{insn})
382Nonzero in an insn if it resulted from an in-line function call.
383Stored in the @code{integrated} field and printed as @samp{/i}.  This
384may be deleted; nothing currently depends on it.
385
386@findex SYMBOL_REF_USED
387@cindex @code{used}, in @code{symbol_ref}
388@item SYMBOL_REF_USED (@var{x})
389In a @code{symbol_ref}, indicates that @var{x} has been used.  This is
390normally only used to ensure that @var{x} is only declared external
391once.  Stored in the @code{used} field.
392
393@findex SYMBOL_REF_FLAG
394@cindex @code{symbol_ref} and @samp{/v}
395@cindex @code{volatil}, in @code{symbol_ref}
396@item SYMBOL_REF_FLAG (@var{x})
397In a @code{symbol_ref}, this is used as a flag for machine-specific purposes.
398Stored in the @code{volatil} field and printed as @samp{/v}.
399
400@findex LABEL_OUTSIDE_LOOP_P
401@cindex @code{label_ref} and @samp{/s}
402@cindex @code{in_struct}, in @code{label_ref}
403@item LABEL_OUTSIDE_LOOP_P
404In @code{label_ref} expressions, nonzero if this is a reference to a
405label that is outside the innermost loop containing the reference to the
406label.  Stored in the @code{in_struct} field and printed as @samp{/s}.
407
408@findex INSN_DELETED_P
409@cindex @code{volatil}, in @code{insn}
410@item INSN_DELETED_P (@var{insn})
411In an insn, nonzero if the insn has been deleted.  Stored in the
412@code{volatil} field and printed as @samp{/v}.
413
414@findex INSN_ANNULLED_BRANCH_P
415@cindex @code{insn} and @samp{/u}
416@cindex @code{unchanging}, in @code{insn}
417@item INSN_ANNULLED_BRANCH_P (@var{insn})
418In an @code{insn} in the delay slot of a branch insn, indicates that an
419annulling branch should be used.  See the discussion under
420@code{sequence} below.  Stored in the @code{unchanging} field and printed
421as @samp{/u}.
422
423@findex INSN_FROM_TARGET_P
424@cindex @code{insn} and @samp{/s}
425@cindex @code{in_struct}, in @code{insn}
426@cindex @samp{/s} in RTL dump
427@item INSN_FROM_TARGET_P (@var{insn})
428In an @code{insn} in a delay slot of a branch, indicates that the insn
429is from the target of the branch.  If the branch insn has
430@code{INSN_ANNULLED_BRANCH_P} set, this insn will only be executed if
431the branch is taken.  For annulled branches with
432@code{INSN_FROM_TARGET_P} clear, the insn will be executed only if the
433branch is not taken.  When @code{INSN_ANNULLED_BRANCH_P} is not set,
434this insn will always be executed.  Stored in the @code{in_struct}
435field and printed as @samp{/s}.
436
437@findex CONSTANT_POOL_ADDRESS_P
438@cindex @code{symbol_ref} and @samp{/u}
439@cindex @code{unchanging}, in @code{symbol_ref}
440@item CONSTANT_POOL_ADDRESS_P (@var{x})
441Nonzero in a @code{symbol_ref} if it refers to part of the current
442function's ``constants pool''.  These are addresses close to the
443beginning of the function, and GNU CC assumes they can be addressed
444directly (perhaps with the help of base registers).  Stored in the
445@code{unchanging} field and printed as @samp{/u}.
446
447@findex CONST_CALL_P
448@cindex @code{call_insn} and @samp{/u}
449@cindex @code{unchanging}, in @code{call_insn}
450@item CONST_CALL_P (@var{x})
451In a @code{call_insn}, indicates that the insn represents a call to a const
452function.  Stored in the @code{unchanging} field and printed as @samp{/u}.
453
454@findex LABEL_PRESERVE_P
455@cindex @code{code_label} and @samp{/i}
456@cindex @code{in_struct}, in @code{code_label}
457@item LABEL_PRESERVE_P (@var{x})
458In a @code{code_label}, indicates that the label can never be deleted.
459Labels referenced by a non-local goto will have this bit set.  Stored
460in the @code{in_struct} field and printed as @samp{/s}.
461
462@findex SCHED_GROUP_P
463@cindex @code{insn} and @samp{/i}
464@cindex @code{in_struct}, in @code{insn}
465@item SCHED_GROUP_P (@var{insn})
466During instruction scheduling, in an insn, indicates that the previous insn
467must be scheduled together with this insn.  This is used to ensure that
468certain groups of instructions will not be split up by the instruction
469scheduling pass, for example, @code{use} insns before a @code{call_insn} may
470not be separated from the @code{call_insn}.  Stored in the @code{in_struct}
471field and printed as @samp{/s}.
472@end table
473
474These are the fields which the above macros refer to:
475
476@table @code
477@findex used
478@item used
479Normally, this flag is used only momentarily, at the end of RTL
480generation for a function, to count the number of times an expression
481appears in insns.  Expressions that appear more than once are copied,
482according to the rules for shared structure (@pxref{Sharing}).
483
484In a @code{symbol_ref}, it indicates that an external declaration for
485the symbol has already been written.
486
487In a @code{reg}, it is used by the leaf register renumbering code to ensure
488that each register is only renumbered once.
489
490@findex volatil
491@item volatil
492This flag is used in @code{mem}, @code{symbol_ref} and @code{reg}
493expressions and in insns.  In RTL dump files, it is printed as
494@samp{/v}.
495
496@cindex volatile memory references
497In a @code{mem} expression, it is 1 if the memory reference is volatile.
498Volatile memory references may not be deleted, reordered or combined.
499
500In a @code{symbol_ref} expression, it is used for machine-specific
501purposes.
502
503In a @code{reg} expression, it is 1 if the value is a user-level variable.
5040 indicates an internal compiler temporary.
505
506In an insn, 1 means the insn has been deleted.
507
508@findex in_struct
509@item in_struct
510In @code{mem} expressions, it is 1 if the memory datum referred to is
511all or part of a structure or array; 0 if it is (or might be) a scalar
512variable.  A reference through a C pointer has 0 because the pointer
513might point to a scalar variable.  This information allows the compiler
514to determine something about possible cases of aliasing.
515
516In an insn in the delay slot of a branch, 1 means that this insn is from
517the target of the branch.
518
519During instruction scheduling, in an insn, 1 means that this insn must be
520scheduled as part of a group together with the previous insn.
521
522In @code{reg} expressions, it is 1 if the register has its entire life
523contained within the test expression of some loop.
524
525In @code{subreg} expressions, 1 means that the @code{subreg} is accessing
526an object that has had its mode promoted from a wider mode.
527
528In @code{label_ref} expressions, 1 means that the referenced label is
529outside the innermost loop containing the insn in which the @code{label_ref}
530was found.
531
532In @code{code_label} expressions, it is 1 if the label may never be deleted.
533This is used for labels which are the target of non-local gotos.
534
535In an RTL dump, this flag is represented as @samp{/s}.
536
537@findex unchanging
538@item unchanging
539In @code{reg} and @code{mem} expressions, 1 means
540that the value of the expression never changes.
541
542In @code{subreg} expressions, it is 1 if the @code{subreg} references an
543unsigned object whose mode has been promoted to a wider mode.
544
545In an insn, 1 means that this is an annulling branch.
546
547In a @code{symbol_ref} expression, 1 means that this symbol addresses
548something in the per-function constants pool.
549
550In a @code{call_insn}, 1 means that this instruction is a call to a
551const function.
552
553In an RTL dump, this flag is represented as @samp{/u}.
554
555@findex integrated
556@item integrated
557In some kinds of expressions, including insns, this flag means the
558rtl was produced by procedure integration.
559
560In a @code{reg} expression, this flag indicates the register
561containing the value to be returned by the current function.  On
562machines that pass parameters in registers, the same register number
563may be used for parameters as well, but this flag is not set on such
564uses.
565@end table
566
567@node Machine Modes, Constants, Flags, RTL
568@section Machine Modes
569@cindex machine modes
570
571@findex enum machine_mode
572A machine mode describes a size of data object and the representation used
573for it.  In the C code, machine modes are represented by an enumeration
574type, @code{enum machine_mode}, defined in @file{machmode.def}.  Each RTL
575expression has room for a machine mode and so do certain kinds of tree
576expressions (declarations and types, to be precise).
577
578In debugging dumps and machine descriptions, the machine mode of an RTL
579expression is written after the expression code with a colon to separate
580them.  The letters @samp{mode} which appear at the end of each machine mode
581name are omitted.  For example, @code{(reg:SI 38)} is a @code{reg}
582expression with machine mode @code{SImode}.  If the mode is
583@code{VOIDmode}, it is not written at all.
584
585Here is a table of machine modes.  The term ``byte'' below refers to an
586object of @code{BITS_PER_UNIT} bits (@pxref{Storage Layout}).
587
588@table @code
589@findex QImode
590@item QImode
591``Quarter-Integer'' mode represents a single byte treated as an integer.
592
593@findex HImode
594@item HImode
595``Half-Integer'' mode represents a two-byte integer.
596
597@findex PSImode
598@item PSImode
599``Partial Single Integer'' mode represents an integer which occupies
600four bytes but which doesn't really use all four.  On some machines,
601this is the right mode to use for pointers.
602
603@findex SImode
604@item SImode
605``Single Integer'' mode represents a four-byte integer.
606
607@findex PDImode
608@item PDImode
609``Partial Double Integer'' mode represents an integer which occupies
610eight bytes but which doesn't really use all eight.  On some machines,
611this is the right mode to use for certain pointers.
612
613@findex DImode
614@item DImode
615``Double Integer'' mode represents an eight-byte integer.
616
617@findex TImode
618@item TImode
619``Tetra Integer'' (?) mode represents a sixteen-byte integer.
620
621@findex SFmode
622@item SFmode
623``Single Floating'' mode represents a single-precision (four byte) floating
624point number.
625
626@findex DFmode
627@item DFmode
628``Double Floating'' mode represents a double-precision (eight byte) floating
629point number.
630
631@findex XFmode
632@item XFmode
633``Extended Floating'' mode represents a triple-precision (twelve byte)
634floating point number.  This mode is used for IEEE extended floating
635point.  On some systems not all bits within these bytes will actually
636be used.
637
638@findex TFmode
639@item TFmode
640``Tetra Floating'' mode represents a quadruple-precision (sixteen byte)
641floating point number.
642
643@findex CCmode
644@item CCmode
645``Condition Code'' mode represents the value of a condition code, which
646is a machine-specific set of bits used to represent the result of a
647comparison operation.  Other machine-specific modes may also be used for
648the condition code.  These modes are not used on machines that use
649@code{cc0} (see @pxref{Condition Code}).
650
651@findex BLKmode
652@item BLKmode
653``Block'' mode represents values that are aggregates to which none of
654the other modes apply.  In RTL, only memory references can have this mode,
655and only if they appear in string-move or vector instructions.  On machines
656which have no such instructions, @code{BLKmode} will not appear in RTL.
657
658@findex VOIDmode
659@item VOIDmode
660Void mode means the absence of a mode or an unspecified mode.
661For example, RTL expressions of code @code{const_int} have mode
662@code{VOIDmode} because they can be taken to have whatever mode the context
663requires.  In debugging dumps of RTL, @code{VOIDmode} is expressed by
664the absence of any mode.
665
666@findex SCmode
667@findex DCmode
668@findex XCmode
669@findex TCmode
670@item SCmode, DCmode, XCmode, TCmode
671These modes stand for a complex number represented as a pair of floating
672point values.  The floating point values are in @code{SFmode},
673@code{DFmode}, @code{XFmode}, and @code{TFmode}, respectively.
674
675@findex CQImode
676@findex CHImode
677@findex CSImode
678@findex CDImode
679@findex CTImode
680@findex COImode
681@item CQImode, CHImode, CSImode, CDImode, CTImode, COImode
682These modes stand for a complex number represented as a pair of integer
683values.  The integer values are in @code{QImode}, @code{HImode},
684@code{SImode}, @code{DImode}, @code{TImode}, and @code{OImode},
685respectively.
686@end table
687
688The machine description defines @code{Pmode} as a C macro which expands
689into the machine mode used for addresses.  Normally this is the mode
690whose size is @code{BITS_PER_WORD}, @code{SImode} on 32-bit machines.
691
692The only modes which a machine description @i{must} support are
693@code{QImode}, and the modes corresponding to @code{BITS_PER_WORD},
694@code{FLOAT_TYPE_SIZE} and @code{DOUBLE_TYPE_SIZE}.
695The compiler will attempt to use @code{DImode} for 8-byte structures and
696unions, but this can be prevented by overriding the definition of
697@code{MAX_FIXED_MODE_SIZE}.  Alternatively, you can have the compiler
698use @code{TImode} for 16-byte structures and unions.  Likewise, you can
699arrange for the C type @code{short int} to avoid using @code{HImode}.
700
701@cindex mode classes
702Very few explicit references to machine modes remain in the compiler and
703these few references will soon be removed.  Instead, the machine modes
704are divided into mode classes.  These are represented by the enumeration
705type @code{enum mode_class} defined in @file{machmode.h}.  The possible
706mode classes are:
707
708@table @code
709@findex MODE_INT
710@item MODE_INT
711Integer modes.  By default these are @code{QImode}, @code{HImode},
712@code{SImode}, @code{DImode}, and @code{TImode}.
713
714@findex MODE_PARTIAL_INT
715@item MODE_PARTIAL_INT
716The ``partial integer'' modes, @code{PSImode} and @code{PDImode}.
717
718@findex MODE_FLOAT
719@item MODE_FLOAT
720floating point modes.  By default these are @code{SFmode}, @code{DFmode},
721@code{XFmode} and @code{TFmode}.
722
723@findex MODE_COMPLEX_INT
724@item MODE_COMPLEX_INT
725Complex integer modes.  (These are not currently implemented).
726
727@findex MODE_COMPLEX_FLOAT
728@item MODE_COMPLEX_FLOAT
729Complex floating point modes.  By default these are @code{SCmode},
730@code{DCmode}, @code{XCmode}, and @code{TCmode}.
731
732@findex MODE_FUNCTION
733@item MODE_FUNCTION
734Algol or Pascal function variables including a static chain.
735(These are not currently implemented).
736
737@findex MODE_CC
738@item MODE_CC
739Modes representing condition code values.  These are @code{CCmode} plus
740any modes listed in the @code{EXTRA_CC_MODES} macro.  @xref{Jump Patterns},
741also see @ref{Condition Code}.
742
743@findex MODE_RANDOM
744@item MODE_RANDOM
745This is a catchall mode class for modes which don't fit into the above
746classes.  Currently @code{VOIDmode} and @code{BLKmode} are in
747@code{MODE_RANDOM}.
748@end table
749
750Here are some C macros that relate to machine modes:
751
752@table @code
753@findex GET_MODE
754@item GET_MODE (@var{x})
755Returns the machine mode of the RTX @var{x}.
756
757@findex PUT_MODE
758@item PUT_MODE (@var{x}, @var{newmode})
759Alters the machine mode of the RTX @var{x} to be @var{newmode}.
760
761@findex NUM_MACHINE_MODES
762@item NUM_MACHINE_MODES
763Stands for the number of machine modes available on the target
764machine.  This is one greater than the largest numeric value of any
765machine mode.
766
767@findex GET_MODE_NAME
768@item GET_MODE_NAME (@var{m})
769Returns the name of mode @var{m} as a string.
770
771@findex GET_MODE_CLASS
772@item GET_MODE_CLASS (@var{m})
773Returns the mode class of mode @var{m}.
774
775@findex GET_MODE_WIDER_MODE
776@item GET_MODE_WIDER_MODE (@var{m})
777Returns the next wider natural mode.  For example, the expression
778@code{GET_MODE_WIDER_MODE (QImode)} returns @code{HImode}.
779
780@findex GET_MODE_SIZE
781@item GET_MODE_SIZE (@var{m})
782Returns the size in bytes of a datum of mode @var{m}.
783
784@findex GET_MODE_BITSIZE
785@item GET_MODE_BITSIZE (@var{m})
786Returns the size in bits of a datum of mode @var{m}.
787
788@findex GET_MODE_MASK
789@item GET_MODE_MASK (@var{m})
790Returns a bitmask containing 1 for all bits in a word that fit within
791mode @var{m}.  This macro can only be used for modes whose bitsize is
792less than or equal to @code{HOST_BITS_PER_INT}.
793
794@findex GET_MODE_ALIGNMENT
795@item GET_MODE_ALIGNMENT (@var{m)})
796Return the required alignment, in bits, for an object of mode @var{m}.
797
798@findex GET_MODE_UNIT_SIZE
799@item GET_MODE_UNIT_SIZE (@var{m})
800Returns the size in bytes of the subunits of a datum of mode @var{m}.
801This is the same as @code{GET_MODE_SIZE} except in the case of complex
802modes.  For them, the unit size is the size of the real or imaginary
803part.
804
805@findex GET_MODE_NUNITS
806@item GET_MODE_NUNITS (@var{m})
807Returns the number of units contained in a mode, i.e.,
808@code{GET_MODE_SIZE} divided by @code{GET_MODE_UNIT_SIZE}.
809
810@findex GET_CLASS_NARROWEST_MODE
811@item GET_CLASS_NARROWEST_MODE (@var{c})
812Returns the narrowest mode in mode class @var{c}.
813@end table
814
815@findex byte_mode
816@findex word_mode
817The global variables @code{byte_mode} and @code{word_mode} contain modes
818whose classes are @code{MODE_INT} and whose bitsizes are either
819@code{BITS_PER_UNIT} or @code{BITS_PER_WORD}, respectively.  On 32-bit
820machines, these are @code{QImode} and @code{SImode}, respectively.
821
822@node Constants, Regs and Memory, Machine Modes, RTL
823@section Constant Expression Types
824@cindex RTL constants
825@cindex RTL constant expression types
826
827The simplest RTL expressions are those that represent constant values.
828
829@table @code
830@findex const_int
831@item (const_int @var{i})
832This type of expression represents the integer value @var{i}.  @var{i}
833is customarily accessed with the macro @code{INTVAL} as in
834@code{INTVAL (@var{exp})}, which is equivalent to @code{XWINT (@var{exp}, 0)}.
835
836@findex const0_rtx
837@findex const1_rtx
838@findex const2_rtx
839@findex constm1_rtx
840There is only one expression object for the integer value zero; it is
841the value of the variable @code{const0_rtx}.  Likewise, the only
842expression for integer value one is found in @code{const1_rtx}, the only
843expression for integer value two is found in @code{const2_rtx}, and the
844only expression for integer value negative one is found in
845@code{constm1_rtx}.  Any attempt to create an expression of code
846@code{const_int} and value zero, one, two or negative one will return
847@code{const0_rtx}, @code{const1_rtx}, @code{const2_rtx} or
848@code{constm1_rtx} as appropriate.@refill
849
850@findex const_true_rtx
851Similarly, there is only one object for the integer whose value is
852@code{STORE_FLAG_VALUE}.  It is found in @code{const_true_rtx}.  If
853@code{STORE_FLAG_VALUE} is one, @code{const_true_rtx} and
854@code{const1_rtx} will point to the same object.  If
855@code{STORE_FLAG_VALUE} is -1, @code{const_true_rtx} and
856@code{constm1_rtx} will point to the same object.@refill
857
858@findex const_double
859@item (const_double:@var{m} @var{addr} @var{i0} @var{i1} @dots{})
860Represents either a floating-point constant of mode @var{m} or an
861integer constant too large to fit into @code{HOST_BITS_PER_WIDE_INT}
862bits but small enough to fit within twice that number of bits (GNU CC
863does not provide a mechanism to represent even larger constants).  In
864the latter case, @var{m} will be @code{VOIDmode}.
865
866@findex CONST_DOUBLE_MEM
867@findex CONST_DOUBLE_CHAIN
868@var{addr} is used to contain the @code{mem} expression that corresponds
869to the location in memory that at which the constant can be found.  If
870it has not been allocated a memory location, but is on the chain of all
871@code{const_double} expressions in this compilation (maintained using an
872undisplayed field), @var{addr} contains @code{const0_rtx}.  If it is not
873on the chain, @var{addr} contains @code{cc0_rtx}.  @var{addr} is
874customarily accessed with the macro @code{CONST_DOUBLE_MEM} and the
875chain field via @code{CONST_DOUBLE_CHAIN}.@refill
876
877@findex CONST_DOUBLE_LOW
878If @var{m} is @code{VOIDmode}, the bits of the value are stored in
879@var{i0} and @var{i1}.  @var{i0} is customarily accessed with the macro
880@code{CONST_DOUBLE_LOW} and @var{i1} with @code{CONST_DOUBLE_HIGH}.
881
882If the constant is floating point (regardless of its precision), then
883the number of integers used to store the value depends on the size of
884@code{REAL_VALUE_TYPE} (@pxref{Cross-compilation}).  The integers
885represent a floating point number, but not precisely in the target
886machine's or host machine's floating point format.  To convert them to
887the precise bit pattern used by the target machine, use the macro
888@code{REAL_VALUE_TO_TARGET_DOUBLE} and friends (@pxref{Data Output}).
889
890@findex CONST0_RTX
891@findex CONST1_RTX
892@findex CONST2_RTX
893The macro @code{CONST0_RTX (@var{mode})} refers to an expression with
894value 0 in mode @var{mode}.  If mode @var{mode} is of mode class
895@code{MODE_INT}, it returns @code{const0_rtx}.  Otherwise, it returns a
896@code{CONST_DOUBLE} expression in mode @var{mode}.  Similarly, the macro
897@code{CONST1_RTX (@var{mode})} refers to an expression with value 1 in
898mode @var{mode} and similarly for @code{CONST2_RTX}.
899
900@findex const_string
901@item (const_string @var{str})
902Represents a constant string with value @var{str}.  Currently this is
903used only for insn attributes (@pxref{Insn Attributes}) since constant
904strings in C are placed in memory.
905
906@findex symbol_ref
907@item (symbol_ref:@var{mode} @var{symbol})
908Represents the value of an assembler label for data.  @var{symbol} is
909a string that describes the name of the assembler label.  If it starts
910with a @samp{*}, the label is the rest of @var{symbol} not including
911the @samp{*}.  Otherwise, the label is @var{symbol}, usually prefixed
912with @samp{_}.
913
914The @code{symbol_ref} contains a mode, which is usually @code{Pmode}.
915Usually that is the only mode for which a symbol is directly valid.
916
917@findex label_ref
918@item (label_ref @var{label})
919Represents the value of an assembler label for code.  It contains one
920operand, an expression, which must be a @code{code_label} that appears
921in the instruction sequence to identify the place where the label
922should go.
923
924The reason for using a distinct expression type for code label
925references is so that jump optimization can distinguish them.
926
927@item (const:@var{m} @var{exp})
928Represents a constant that is the result of an assembly-time
929arithmetic computation.  The operand, @var{exp}, is an expression that
930contains only constants (@code{const_int}, @code{symbol_ref} and
931@code{label_ref} expressions) combined with @code{plus} and
932@code{minus}.  However, not all combinations are valid, since the
933assembler cannot do arbitrary arithmetic on relocatable symbols.
934
935@var{m} should be @code{Pmode}.
936
937@findex high
938@item (high:@var{m} @var{exp})
939Represents the high-order bits of @var{exp}, usually a
940@code{symbol_ref}.  The number of bits is machine-dependent and is
941normally the number of bits specified in an instruction that initializes
942the high order bits of a register.  It is used with @code{lo_sum} to
943represent the typical two-instruction sequence used in RISC machines to
944reference a global memory location.
945
946@var{m} should be @code{Pmode}.
947@end table
948
949@node Regs and Memory, Arithmetic, Constants, RTL
950@section Registers and Memory
951@cindex RTL register expressions
952@cindex RTL memory expressions
953
954Here are the RTL expression types for describing access to machine
955registers and to main memory.
956
957@table @code
958@findex reg
959@cindex hard registers
960@cindex pseudo registers
961@item (reg:@var{m} @var{n})
962For small values of the integer @var{n} (those that are less than
963@code{FIRST_PSEUDO_REGISTER}), this stands for a reference to machine
964register number @var{n}: a @dfn{hard register}.  For larger values of
965@var{n}, it stands for a temporary value or @dfn{pseudo register}.
966The compiler's strategy is to generate code assuming an unlimited
967number of such pseudo registers, and later convert them into hard
968registers or into memory references.
969
970@var{m} is the machine mode of the reference.  It is necessary because
971machines can generally refer to each register in more than one mode.
972For example, a register may contain a full word but there may be
973instructions to refer to it as a half word or as a single byte, as
974well as instructions to refer to it as a floating point number of
975various precisions.
976
977Even for a register that the machine can access in only one mode,
978the mode must always be specified.
979
980The symbol @code{FIRST_PSEUDO_REGISTER} is defined by the machine
981description, since the number of hard registers on the machine is an
982invariant characteristic of the machine.  Note, however, that not
983all of the machine registers must be general registers.  All the
984machine registers that can be used for storage of data are given
985hard register numbers, even those that can be used only in certain
986instructions or can hold only certain types of data.
987
988A hard register may be accessed in various modes throughout one
989function, but each pseudo register is given a natural mode
990and is accessed only in that mode.  When it is necessary to describe
991an access to a pseudo register using a nonnatural mode, a @code{subreg}
992expression is used.
993
994A @code{reg} expression with a machine mode that specifies more than
995one word of data may actually stand for several consecutive registers.
996If in addition the register number specifies a hardware register, then
997it actually represents several consecutive hardware registers starting
998with the specified one.
999
1000Each pseudo register number used in a function's RTL code is
1001represented by a unique @code{reg} expression.
1002
1003@findex FIRST_VIRTUAL_REGISTER
1004@findex LAST_VIRTUAL_REGISTER
1005Some pseudo register numbers, those within the range of
1006@code{FIRST_VIRTUAL_REGISTER} to @code{LAST_VIRTUAL_REGISTER} only
1007appear during the RTL generation phase and are eliminated before the
1008optimization phases.  These represent locations in the stack frame that
1009cannot be determined until RTL generation for the function has been
1010completed.  The following virtual register numbers are defined:
1011
1012@table @code
1013@findex VIRTUAL_INCOMING_ARGS_REGNUM
1014@item VIRTUAL_INCOMING_ARGS_REGNUM
1015This points to the first word of the incoming arguments passed on the
1016stack.  Normally these arguments are placed there by the caller, but the
1017callee may have pushed some arguments that were previously passed in
1018registers.
1019
1020@cindex @code{FIRST_PARM_OFFSET} and virtual registers
1021@cindex @code{ARG_POINTER_REGNUM} and virtual registers
1022When RTL generation is complete, this virtual register is replaced
1023by the sum of the register given by @code{ARG_POINTER_REGNUM} and the
1024value of @code{FIRST_PARM_OFFSET}.
1025
1026@findex VIRTUAL_STACK_VARS_REGNUM
1027@cindex @code{FRAME_GROWS_DOWNWARD} and virtual registers
1028@item VIRTUAL_STACK_VARS_REGNUM
1029If @code{FRAME_GROWS_DOWNWARD} is defined, this points to immediately
1030above the first variable on the stack.  Otherwise, it points to the
1031first variable on the stack.
1032
1033@cindex @code{STARTING_FRAME_OFFSET} and virtual registers
1034@cindex @code{FRAME_POINTER_REGNUM} and virtual registers
1035@code{VIRTUAL_STACK_VARS_REGNUM} is replaced with the sum of the
1036register given by @code{FRAME_POINTER_REGNUM} and the value
1037@code{STARTING_FRAME_OFFSET}.
1038
1039@findex VIRTUAL_STACK_DYNAMIC_REGNUM
1040@item VIRTUAL_STACK_DYNAMIC_REGNUM
1041This points to the location of dynamically allocated memory on the stack
1042immediately after the stack pointer has been adjusted by the amount of
1043memory desired.
1044
1045@cindex @code{STACK_DYNAMIC_OFFSET} and virtual registers
1046@cindex @code{STACK_POINTER_REGNUM} and virtual registers
1047This virtual register is replaced by the sum of the register given by
1048@code{STACK_POINTER_REGNUM} and the value @code{STACK_DYNAMIC_OFFSET}.
1049
1050@findex VIRTUAL_OUTGOING_ARGS_REGNUM
1051@item VIRTUAL_OUTGOING_ARGS_REGNUM
1052This points to the location in the stack at which outgoing arguments
1053should be written when the stack is pre-pushed (arguments pushed using
1054push insns should always use @code{STACK_POINTER_REGNUM}).
1055
1056@cindex @code{STACK_POINTER_OFFSET} and virtual registers
1057This virtual register is replaced by the sum of the register given by
1058@code{STACK_POINTER_REGNUM} and the value @code{STACK_POINTER_OFFSET}.
1059@end table
1060
1061@findex subreg
1062@item (subreg:@var{m} @var{reg} @var{wordnum})
1063@code{subreg} expressions are used to refer to a register in a machine
1064mode other than its natural one, or to refer to one register of
1065a multi-word @code{reg} that actually refers to several registers.
1066
1067Each pseudo-register has a natural mode.  If it is necessary to
1068operate on it in a different mode---for example, to perform a fullword
1069move instruction on a pseudo-register that contains a single
1070byte---the pseudo-register must be enclosed in a @code{subreg}.  In
1071such a case, @var{wordnum} is zero.
1072
1073Usually @var{m} is at least as narrow as the mode of @var{reg}, in which
1074case it is restricting consideration to only the bits of @var{reg} that
1075are in @var{m}.
1076
1077Sometimes @var{m} is wider than the mode of @var{reg}.  These
1078@code{subreg} expressions are often called @dfn{paradoxical}.  They are
1079used in cases where we want to refer to an object in a wider mode but do
1080not care what value the additional bits have.  The reload pass ensures
1081that paradoxical references are only made to hard registers.
1082
1083The other use of @code{subreg} is to extract the individual registers of
1084a multi-register value.  Machine modes such as @code{DImode} and
1085@code{TImode} can indicate values longer than a word, values which
1086usually require two or more consecutive registers.  To access one of the
1087registers, use a @code{subreg} with mode @code{SImode} and a
1088@var{wordnum} that says which register.
1089
1090Storing in a non-paradoxical @code{subreg} has undefined results for
1091bits belonging to the same word as the @code{subreg}.  This laxity makes
1092it easier to generate efficient code for such instructions.  To
1093represent an instruction that preserves all the bits outside of those in
1094the @code{subreg}, use @code{strict_low_part} around the @code{subreg}.
1095
1096@cindex @code{WORDS_BIG_ENDIAN}, effect on @code{subreg}
1097The compilation parameter @code{WORDS_BIG_ENDIAN}, if set to 1, says
1098that word number zero is the most significant part; otherwise, it is
1099the least significant part.
1100
1101@cindex combiner pass
1102@cindex reload pass
1103@cindex @code{subreg}, special reload handling
1104Between the combiner pass and the reload pass, it is possible to have a
1105paradoxical @code{subreg} which contains a @code{mem} instead of a
1106@code{reg} as its first operand.  After the reload pass, it is also
1107possible to have a non-paradoxical @code{subreg} which contains a
1108@code{mem}; this usually occurs when the @code{mem} is a stack slot
1109which replaced a pseudo register.
1110
1111Note that it is not valid to access a @code{DFmode} value in @code{SFmode}
1112using a @code{subreg}.  On some machines the most significant part of a
1113@code{DFmode} value does not have the same format as a single-precision
1114floating value.
1115
1116It is also not valid to access a single word of a multi-word value in a
1117hard register when less registers can hold the value than would be
1118expected from its size.  For example, some 32-bit machines have
1119floating-point registers that can hold an entire @code{DFmode} value.
1120If register 10 were such a register @code{(subreg:SI (reg:DF 10) 1)}
1121would be invalid because there is no way to convert that reference to
1122a single machine register.  The reload pass prevents @code{subreg}
1123expressions such as these from being formed.
1124
1125@findex SUBREG_REG
1126@findex SUBREG_WORD
1127The first operand of a @code{subreg} expression is customarily accessed
1128with the @code{SUBREG_REG} macro and the second operand is customarily
1129accessed with the @code{SUBREG_WORD} macro.
1130
1131@findex scratch
1132@cindex scratch operands
1133@item (scratch:@var{m})
1134This represents a scratch register that will be required for the
1135execution of a single instruction and not used subsequently.  It is
1136converted into a @code{reg} by either the local register allocator or
1137the reload pass.
1138
1139@code{scratch} is usually present inside a @code{clobber} operation
1140(@pxref{Side Effects}).
1141
1142@findex cc0
1143@cindex condition code register
1144@item (cc0)
1145This refers to the machine's condition code register.  It has no
1146operands and may not have a machine mode.  There are two ways to use it:
1147
1148@itemize @bullet
1149@item
1150To stand for a complete set of condition code flags.  This is best on
1151most machines, where each comparison sets the entire series of flags.
1152
1153With this technique, @code{(cc0)} may be validly used in only two
1154contexts: as the destination of an assignment (in test and compare
1155instructions) and in comparison operators comparing against zero
1156(@code{const_int} with value zero; that is to say, @code{const0_rtx}).
1157
1158@item
1159To stand for a single flag that is the result of a single condition.
1160This is useful on machines that have only a single flag bit, and in
1161which comparison instructions must specify the condition to test.
1162
1163With this technique, @code{(cc0)} may be validly used in only two
1164contexts: as the destination of an assignment (in test and compare
1165instructions) where the source is a comparison operator, and as the
1166first operand of @code{if_then_else} (in a conditional branch).
1167@end itemize
1168
1169@findex cc0_rtx
1170There is only one expression object of code @code{cc0}; it is the
1171value of the variable @code{cc0_rtx}.  Any attempt to create an
1172expression of code @code{cc0} will return @code{cc0_rtx}.
1173
1174Instructions can set the condition code implicitly.  On many machines,
1175nearly all instructions set the condition code based on the value that
1176they compute or store.  It is not necessary to record these actions
1177explicitly in the RTL because the machine description includes a
1178prescription for recognizing the instructions that do so (by means of
1179the macro @code{NOTICE_UPDATE_CC}).  @xref{Condition Code}.  Only
1180instructions whose sole purpose is to set the condition code, and
1181instructions that use the condition code, need mention @code{(cc0)}.
1182
1183On some machines, the condition code register is given a register number
1184and a @code{reg} is used instead of @code{(cc0)}.  This is usually the
1185preferable approach if only a small subset of instructions modify the
1186condition code.  Other machines store condition codes in general
1187registers; in such cases a pseudo register should be used.
1188
1189Some machines, such as the Sparc and RS/6000, have two sets of
1190arithmetic instructions, one that sets and one that does not set the
1191condition code.  This is best handled by normally generating the
1192instruction that does not set the condition code, and making a pattern
1193that both performs the arithmetic and sets the condition code register
1194(which would not be @code{(cc0)} in this case).  For examples, search
1195for @samp{addcc} and @samp{andcc} in @file{sparc.md}.
1196
1197@findex pc
1198@item (pc)
1199@cindex program counter
1200This represents the machine's program counter.  It has no operands and
1201may not have a machine mode.  @code{(pc)} may be validly used only in
1202certain specific contexts in jump instructions.
1203
1204@findex pc_rtx
1205There is only one expression object of code @code{pc}; it is the value
1206of the variable @code{pc_rtx}.  Any attempt to create an expression of
1207code @code{pc} will return @code{pc_rtx}.
1208
1209All instructions that do not jump alter the program counter implicitly
1210by incrementing it, but there is no need to mention this in the RTL.
1211
1212@findex mem
1213@item (mem:@var{m} @var{addr})
1214This RTX represents a reference to main memory at an address
1215represented by the expression @var{addr}.  @var{m} specifies how large
1216a unit of memory is accessed.
1217
1218@findex addressof
1219@item (addressof:@var{m} @var{reg})
1220This RTX represents a request for the address of register @var{reg}.  Its mode
1221is always @code{Pmode}.  If there are any @code{addressof}
1222expressions left in the function after CSE, @var{reg} is forced into the
1223stack and the @code{addressof} expression is replaced with a @code{plus}
1224expression for the address of its stack slot.
1225@end table
1226
1227@node Arithmetic, Comparisons, Regs and Memory, RTL
1228@section RTL Expressions for Arithmetic
1229@cindex arithmetic, in RTL
1230@cindex math, in RTL
1231@cindex RTL expressions for arithmetic
1232
1233Unless otherwise specified, all the operands of arithmetic expressions
1234must be valid for mode @var{m}.  An operand is valid for mode @var{m}
1235if it has mode @var{m}, or if it is a @code{const_int} or
1236@code{const_double} and @var{m} is a mode of class @code{MODE_INT}.
1237
1238For commutative binary operations, constants should be placed in the
1239second operand.
1240
1241@table @code
1242@findex plus
1243@cindex RTL addition
1244@cindex RTL sum
1245@item (plus:@var{m} @var{x} @var{y})
1246Represents the sum of the values represented by @var{x} and @var{y}
1247carried out in machine mode @var{m}.
1248
1249@findex lo_sum
1250@item (lo_sum:@var{m} @var{x} @var{y})
1251Like @code{plus}, except that it represents that sum of @var{x} and the
1252low-order bits of @var{y}.  The number of low order bits is
1253machine-dependent but is normally the number of bits in a @code{Pmode}
1254item minus the number of bits set by the @code{high} code
1255(@pxref{Constants}).
1256
1257@var{m} should be @code{Pmode}.
1258
1259@findex minus
1260@cindex RTL subtraction
1261@cindex RTL difference
1262@item (minus:@var{m} @var{x} @var{y})
1263Like @code{plus} but represents subtraction.
1264
1265@findex compare
1266@cindex RTL comparison
1267@item (compare:@var{m} @var{x} @var{y})
1268Represents the result of subtracting @var{y} from @var{x} for purposes
1269of comparison.  The result is computed without overflow, as if with
1270infinite precision.
1271
1272Of course, machines can't really subtract with infinite precision.
1273However, they can pretend to do so when only the sign of the
1274result will be used, which is the case when the result is stored
1275in the condition code.   And that is the only way this kind of expression
1276may validly be used: as a value to be stored in the condition codes.
1277
1278The mode @var{m} is not related to the modes of @var{x} and @var{y},
1279but instead is the mode of the condition code value.  If @code{(cc0)}
1280is used, it is @code{VOIDmode}.  Otherwise it is some mode in class
1281@code{MODE_CC}, often @code{CCmode}.  @xref{Condition Code}.
1282
1283Normally, @var{x} and @var{y} must have the same mode.  Otherwise,
1284@code{compare} is valid only if the mode of @var{x} is in class
1285@code{MODE_INT} and @var{y} is a @code{const_int} or
1286@code{const_double} with mode @code{VOIDmode}.  The mode of @var{x}
1287determines what mode the comparison is to be done in; thus it must not
1288be @code{VOIDmode}.
1289
1290If one of the operands is a constant, it should be placed in the
1291second operand and the comparison code adjusted as appropriate. 
1292
1293A @code{compare} specifying two @code{VOIDmode} constants is not valid
1294since there is no way to know in what mode the comparison is to be
1295performed; the comparison must either be folded during the compilation
1296or the first operand must be loaded into a register while its mode is
1297still known.
1298
1299@findex neg
1300@item (neg:@var{m} @var{x})
1301Represents the negation (subtraction from zero) of the value represented
1302by @var{x}, carried out in mode @var{m}.
1303
1304@findex mult
1305@cindex multiplication
1306@cindex product
1307@item (mult:@var{m} @var{x} @var{y})
1308Represents the signed product of the values represented by @var{x} and
1309@var{y} carried out in machine mode @var{m}.
1310
1311Some machines support a multiplication that generates a product wider
1312than the operands.  Write the pattern for this as
1313
1314@example
1315(mult:@var{m} (sign_extend:@var{m} @var{x}) (sign_extend:@var{m} @var{y}))
1316@end example
1317
1318where @var{m} is wider than the modes of @var{x} and @var{y}, which need
1319not be the same.
1320
1321Write patterns for unsigned widening multiplication similarly using
1322@code{zero_extend}.
1323
1324@findex div
1325@cindex division
1326@cindex signed division
1327@cindex quotient
1328@item (div:@var{m} @var{x} @var{y})
1329Represents the quotient in signed division of @var{x} by @var{y},
1330carried out in machine mode @var{m}.  If @var{m} is a floating point
1331mode, it represents the exact quotient; otherwise, the integerized
1332quotient.
1333
1334Some machines have division instructions in which the operands and
1335quotient widths are not all the same; you should represent
1336such instructions using @code{truncate} and @code{sign_extend} as in,
1337
1338@example
1339(truncate:@var{m1} (div:@var{m2} @var{x} (sign_extend:@var{m2} @var{y})))
1340@end example
1341
1342@findex udiv
1343@cindex unsigned division
1344@cindex division
1345@item (udiv:@var{m} @var{x} @var{y})
1346Like @code{div} but represents unsigned division.
1347
1348@findex mod
1349@findex umod
1350@cindex remainder
1351@cindex division
1352@item (mod:@var{m} @var{x} @var{y})
1353@itemx (umod:@var{m} @var{x} @var{y})
1354Like @code{div} and @code{udiv} but represent the remainder instead of
1355the quotient.
1356
1357@findex smin
1358@findex smax
1359@cindex signed minimum
1360@cindex signed maximum
1361@item (smin:@var{m} @var{x} @var{y})
1362@itemx (smax:@var{m} @var{x} @var{y})
1363Represents the smaller (for @code{smin}) or larger (for @code{smax}) of
1364@var{x} and @var{y}, interpreted as signed integers in mode @var{m}.
1365
1366@findex umin
1367@findex umax
1368@cindex unsigned minimum and maximum
1369@item (umin:@var{m} @var{x} @var{y})
1370@itemx (umax:@var{m} @var{x} @var{y})
1371Like @code{smin} and @code{smax}, but the values are interpreted as unsigned
1372integers.
1373
1374@findex not
1375@cindex complement, bitwise
1376@cindex bitwise complement
1377@item (not:@var{m} @var{x})
1378Represents the bitwise complement of the value represented by @var{x},
1379carried out in mode @var{m}, which must be a fixed-point machine mode.
1380
1381@findex and
1382@cindex logical-and, bitwise
1383@cindex bitwise logical-and
1384@item (and:@var{m} @var{x} @var{y})
1385Represents the bitwise logical-and of the values represented by
1386@var{x} and @var{y}, carried out in machine mode @var{m}, which must be
1387a fixed-point machine mode.
1388
1389@findex ior
1390@cindex inclusive-or, bitwise
1391@cindex bitwise inclusive-or
1392@item (ior:@var{m} @var{x} @var{y})
1393Represents the bitwise inclusive-or of the values represented by @var{x}
1394and @var{y}, carried out in machine mode @var{m}, which must be a
1395fixed-point mode.
1396
1397@findex xor
1398@cindex exclusive-or, bitwise
1399@cindex bitwise exclusive-or
1400@item (xor:@var{m} @var{x} @var{y})
1401Represents the bitwise exclusive-or of the values represented by @var{x}
1402and @var{y}, carried out in machine mode @var{m}, which must be a
1403fixed-point mode.
1404
1405@findex ashift
1406@cindex left shift
1407@cindex shift
1408@cindex arithmetic shift
1409@item (ashift:@var{m} @var{x} @var{c})
1410Represents the result of arithmetically shifting @var{x} left by @var{c}
1411places.  @var{x} have mode @var{m}, a fixed-point machine mode.  @var{c}
1412be a fixed-point mode or be a constant with mode @code{VOIDmode}; which
1413mode is determined by the mode called for in the machine description
1414entry for the left-shift instruction.  For example, on the Vax, the mode
1415of @var{c} is @code{QImode} regardless of @var{m}.
1416
1417@findex lshiftrt
1418@cindex right shift
1419@findex ashiftrt
1420@item (lshiftrt:@var{m} @var{x} @var{c})
1421@itemx (ashiftrt:@var{m} @var{x} @var{c})
1422Like @code{ashift} but for right shift.  Unlike the case for left shift,
1423these two operations are distinct.
1424
1425@findex rotate
1426@cindex rotate
1427@cindex left rotate
1428@findex rotatert
1429@cindex right rotate
1430@item (rotate:@var{m} @var{x} @var{c})
1431@itemx (rotatert:@var{m} @var{x} @var{c})
1432Similar but represent left and right rotate.  If @var{c} is a constant,
1433use @code{rotate}.
1434
1435@findex abs
1436@cindex absolute value
1437@item (abs:@var{m} @var{x})
1438Represents the absolute value of @var{x}, computed in mode @var{m}.
1439
1440@findex sqrt
1441@cindex square root
1442@item (sqrt:@var{m} @var{x})
1443Represents the square root of @var{x}, computed in mode @var{m}.
1444Most often @var{m} will be a floating point mode.
1445
1446@findex ffs
1447@item (ffs:@var{m} @var{x})
1448Represents one plus the index of the least significant 1-bit in
1449@var{x}, represented as an integer of mode @var{m}.  (The value is
1450zero if @var{x} is zero.)  The mode of @var{x} need not be @var{m};
1451depending on the target machine, various mode combinations may be
1452valid.
1453@end table
1454
1455@node Comparisons, Bit Fields, Arithmetic, RTL
1456@section Comparison Operations
1457@cindex RTL comparison operations
1458
1459Comparison operators test a relation on two operands and are considered
1460to represent a machine-dependent nonzero value described by, but not
1461necessarily equal to, @code{STORE_FLAG_VALUE} (@pxref{Misc})
1462if the relation holds, or zero if it does not.  The mode of the
1463comparison operation is independent of the mode of the data being
1464compared.  If the comparison operation is being tested (e.g., the first
1465operand of an @code{if_then_else}), the mode must be @code{VOIDmode}.
1466If the comparison operation is producing data to be stored in some
1467variable, the mode must be in class @code{MODE_INT}.  All comparison
1468operations producing data must use the same mode, which is
1469machine-specific.
1470
1471@cindex condition codes
1472There are two ways that comparison operations may be used.  The
1473comparison operators may be used to compare the condition codes
1474@code{(cc0)} against zero, as in @code{(eq (cc0) (const_int 0))}.  Such
1475a construct actually refers to the result of the preceding instruction
1476in which the condition codes were set.  The instructing setting the
1477condition code must be adjacent to the instruction using the condition
1478code; only @code{note} insns may separate them.
1479
1480Alternatively, a comparison operation may directly compare two data
1481objects.  The mode of the comparison is determined by the operands; they
1482must both be valid for a common machine mode.  A comparison with both
1483operands constant would be invalid as the machine mode could not be
1484deduced from it, but such a comparison should never exist in RTL due to
1485constant folding.
1486
1487In the example above, if @code{(cc0)} were last set to
1488@code{(compare @var{x} @var{y})}, the comparison operation is
1489identical to @code{(eq @var{x} @var{y})}.  Usually only one style
1490of comparisons is supported on a particular machine, but the combine
1491pass will try to merge the operations to produce the @code{eq} shown
1492in case it exists in the context of the particular insn involved.
1493
1494Inequality comparisons come in two flavors, signed and unsigned.  Thus,
1495there are distinct expression codes @code{gt} and @code{gtu} for signed and
1496unsigned greater-than.  These can produce different results for the same
1497pair of integer values: for example, 1 is signed greater-than -1 but not
1498unsigned greater-than, because -1 when regarded as unsigned is actually
1499@code{0xffffffff} which is greater than 1.
1500
1501The signed comparisons are also used for floating point values.  Floating
1502point comparisons are distinguished by the machine modes of the operands.
1503
1504@table @code
1505@findex eq
1506@cindex equal
1507@item (eq:@var{m} @var{x} @var{y})
15081 if the values represented by @var{x} and @var{y} are equal,
1509otherwise 0.
1510
1511@findex ne
1512@cindex not equal
1513@item (ne:@var{m} @var{x} @var{y})
15141 if the values represented by @var{x} and @var{y} are not equal,
1515otherwise 0.
1516
1517@findex gt
1518@cindex greater than
1519@item (gt:@var{m} @var{x} @var{y})
15201 if the @var{x} is greater than @var{y}.  If they are fixed-point,
1521the comparison is done in a signed sense.
1522
1523@findex gtu
1524@cindex greater than
1525@cindex unsigned greater than
1526@item (gtu:@var{m} @var{x} @var{y})
1527Like @code{gt} but does unsigned comparison, on fixed-point numbers only.
1528
1529@findex lt
1530@cindex less than
1531@findex ltu
1532@cindex unsigned less than
1533@item (lt:@var{m} @var{x} @var{y})
1534@itemx (ltu:@var{m} @var{x} @var{y})
1535Like @code{gt} and @code{gtu} but test for ``less than''.
1536
1537@findex ge
1538@cindex greater than
1539@findex geu
1540@cindex unsigned greater than
1541@item (ge:@var{m} @var{x} @var{y})
1542@itemx (geu:@var{m} @var{x} @var{y})
1543Like @code{gt} and @code{gtu} but test for ``greater than or equal''.
1544
1545@findex le
1546@cindex less than or equal
1547@findex leu
1548@cindex unsigned less than
1549@item (le:@var{m} @var{x} @var{y})
1550@itemx (leu:@var{m} @var{x} @var{y})
1551Like @code{gt} and @code{gtu} but test for ``less than or equal''.
1552
1553@findex if_then_else
1554@item (if_then_else @var{cond} @var{then} @var{else})
1555This is not a comparison operation but is listed here because it is
1556always used in conjunction with a comparison operation.  To be
1557precise, @var{cond} is a comparison expression.  This expression
1558represents a choice, according to @var{cond}, between the value
1559represented by @var{then} and the one represented by @var{else}.
1560
1561On most machines, @code{if_then_else} expressions are valid only
1562to express conditional jumps.
1563
1564@findex cond
1565@item (cond [@var{test1} @var{value1} @var{test2} @var{value2} @dots{}] @var{default})
1566Similar to @code{if_then_else}, but more general.  Each of @var{test1},
1567@var{test2}, @dots{} is performed in turn.  The result of this expression is
1568the @var{value} corresponding to the first non-zero test, or @var{default} if
1569none of the tests are non-zero expressions.
1570
1571This is currently not valid for instruction patterns and is supported only
1572for insn attributes.  @xref{Insn Attributes}.
1573@end table
1574
1575@node Bit Fields, Conversions, Comparisons, RTL
1576@section Bit Fields
1577@cindex bit fields
1578
1579Special expression codes exist to represent bitfield instructions.
1580These types of expressions are lvalues in RTL; they may appear
1581on the left side of an assignment, indicating insertion of a value
1582into the specified bit field.
1583
1584@table @code
1585@findex sign_extract
1586@cindex @code{BITS_BIG_ENDIAN}, effect on @code{sign_extract}
1587@item (sign_extract:@var{m} @var{loc} @var{size} @var{pos})
1588This represents a reference to a sign-extended bit field contained or
1589starting in @var{loc} (a memory or register reference).  The bit field
1590is @var{size} bits wide and starts at bit @var{pos}.  The compilation
1591option @code{BITS_BIG_ENDIAN} says which end of the memory unit
1592@var{pos} counts from.
1593
1594If @var{loc} is in memory, its mode must be a single-byte integer mode.
1595If @var{loc} is in a register, the mode to use is specified by the
1596operand of the @code{insv} or @code{extv} pattern
1597(@pxref{Standard Names}) and is usually a full-word integer mode,
1598which is the default if none is specified.
1599
1600The mode of @var{pos} is machine-specific and is also specified
1601in the @code{insv} or @code{extv} pattern.
1602
1603The mode @var{m} is the same as the mode that would be used for
1604@var{loc} if it were a register.
1605
1606@findex zero_extract
1607@item (zero_extract:@var{m} @var{loc} @var{size} @var{pos})
1608Like @code{sign_extract} but refers to an unsigned or zero-extended
1609bit field.  The same sequence of bits are extracted, but they
1610are filled to an entire word with zeros instead of by sign-extension.
1611@end table
1612
1613@node Conversions, RTL Declarations, Bit Fields, RTL
1614@section Conversions
1615@cindex conversions
1616@cindex machine mode conversions
1617
1618All conversions between machine modes must be represented by
1619explicit conversion operations.  For example, an expression
1620which is the sum of a byte and a full word cannot be written as
1621@code{(plus:SI (reg:QI 34) (reg:SI 80))} because the @code{plus}
1622operation requires two operands of the same machine mode.
1623Therefore, the byte-sized operand is enclosed in a conversion
1624operation, as in
1625
1626@example
1627(plus:SI (sign_extend:SI (reg:QI 34)) (reg:SI 80))
1628@end example
1629
1630The conversion operation is not a mere placeholder, because there
1631may be more than one way of converting from a given starting mode
1632to the desired final mode.  The conversion operation code says how
1633to do it.
1634
1635For all conversion operations, @var{x} must not be @code{VOIDmode}
1636because the mode in which to do the conversion would not be known.
1637The conversion must either be done at compile-time or @var{x}
1638must be placed into a register.
1639
1640@table @code
1641@findex sign_extend
1642@item (sign_extend:@var{m} @var{x})
1643Represents the result of sign-extending the value @var{x}
1644to machine mode @var{m}.  @var{m} must be a fixed-point mode
1645and @var{x} a fixed-point value of a mode narrower than @var{m}.
1646
1647@findex zero_extend
1648@item (zero_extend:@var{m} @var{x})
1649Represents the result of zero-extending the value @var{x}
1650to machine mode @var{m}.  @var{m} must be a fixed-point mode
1651and @var{x} a fixed-point value of a mode narrower than @var{m}.
1652
1653@findex float_extend
1654@item (float_extend:@var{m} @var{x})
1655Represents the result of extending the value @var{x}
1656to machine mode @var{m}.  @var{m} must be a floating point mode
1657and @var{x} a floating point value of a mode narrower than @var{m}.
1658
1659@findex truncate
1660@item (truncate:@var{m} @var{x})
1661Represents the result of truncating the value @var{x}
1662to machine mode @var{m}.  @var{m} must be a fixed-point mode
1663and @var{x} a fixed-point value of a mode wider than @var{m}.
1664
1665@findex float_truncate
1666@item (float_truncate:@var{m} @var{x})
1667Represents the result of truncating the value @var{x}
1668to machine mode @var{m}.  @var{m} must be a floating point mode
1669and @var{x} a floating point value of a mode wider than @var{m}.
1670
1671@findex float
1672@item (float:@var{m} @var{x})
1673Represents the result of converting fixed point value @var{x},
1674regarded as signed, to floating point mode @var{m}.
1675
1676@findex unsigned_float
1677@item (unsigned_float:@var{m} @var{x})
1678Represents the result of converting fixed point value @var{x},
1679regarded as unsigned, to floating point mode @var{m}.
1680
1681@findex fix
1682@item (fix:@var{m} @var{x})
1683When @var{m} is a fixed point mode, represents the result of
1684converting floating point value @var{x} to mode @var{m}, regarded as
1685signed.  How rounding is done is not specified, so this operation may
1686be used validly in compiling C code only for integer-valued operands.
1687
1688@findex unsigned_fix
1689@item (unsigned_fix:@var{m} @var{x})
1690Represents the result of converting floating point value @var{x} to
1691fixed point mode @var{m}, regarded as unsigned.  How rounding is done
1692is not specified.
1693
1694@findex fix
1695@item (fix:@var{m} @var{x})
1696When @var{m} is a floating point mode, represents the result of
1697converting floating point value @var{x} (valid for mode @var{m}) to an
1698integer, still represented in floating point mode @var{m}, by rounding
1699towards zero.
1700@end table
1701
1702@node RTL Declarations, Side Effects, Conversions, RTL
1703@section Declarations
1704@cindex RTL declarations
1705@cindex declarations, RTL
1706
1707Declaration expression codes do not represent arithmetic operations
1708but rather state assertions about their operands.
1709
1710@table @code
1711@findex strict_low_part
1712@cindex @code{subreg}, in @code{strict_low_part}
1713@item (strict_low_part (subreg:@var{m} (reg:@var{n} @var{r}) 0))
1714This expression code is used in only one context: as the destination operand of a
1715@code{set} expression.  In addition, the operand of this expression
1716must be a non-paradoxical @code{subreg} expression.
1717
1718The presence of @code{strict_low_part} says that the part of the
1719register which is meaningful in mode @var{n}, but is not part of
1720mode @var{m}, is not to be altered.  Normally, an assignment to such
1721a subreg is allowed to have undefined effects on the rest of the
1722register when @var{m} is less than a word.
1723@end table
1724
1725@node Side Effects, Incdec, RTL Declarations, RTL
1726@section Side Effect Expressions
1727@cindex RTL side effect expressions
1728
1729The expression codes described so far represent values, not actions.
1730But machine instructions never produce values; they are meaningful
1731only for their side effects on the state of the machine.  Special
1732expression codes are used to represent side effects.
1733
1734The body of an instruction is always one of these side effect codes;
1735the codes described above, which represent values, appear only as
1736the operands of these.
1737
1738@table @code
1739@findex set
1740@item (set @var{lval} @var{x})
1741Represents the action of storing the value of @var{x} into the place
1742represented by @var{lval}.  @var{lval} must be an expression
1743representing a place that can be stored in: @code{reg} (or
1744@code{subreg} or @code{strict_low_part}), @code{mem}, @code{pc} or
1745@code{cc0}.@refill
1746
1747If @var{lval} is a @code{reg}, @code{subreg} or @code{mem}, it has a
1748machine mode; then @var{x} must be valid for that mode.@refill
1749
1750If @var{lval} is a @code{reg} whose machine mode is less than the full
1751width of the register, then it means that the part of the register
1752specified by the machine mode is given the specified value and the
1753rest of the register receives an undefined value.  Likewise, if
1754@var{lval} is a @code{subreg} whose machine mode is narrower than
1755the mode of the register, the rest of the register can be changed in
1756an undefined way.
1757
1758If @var{lval} is a @code{strict_low_part} of a @code{subreg}, then the
1759part of the register specified by the machine mode of the
1760@code{subreg} is given the value @var{x} and the rest of the register
1761is not changed.@refill
1762
1763If @var{lval} is @code{(cc0)}, it has no machine mode, and @var{x} may
1764be either a @code{compare} expression or a value that may have any mode.
1765The latter case represents a ``test'' instruction.  The expression
1766@code{(set (cc0) (reg:@var{m} @var{n}))} is equivalent to
1767@code{(set (cc0) (compare (reg:@var{m} @var{n}) (const_int 0)))}.
1768Use the former expression to save space during the compilation.
1769
1770@cindex jump instructions and @code{set}
1771@cindex @code{if_then_else} usage
1772If @var{lval} is @code{(pc)}, we have a jump instruction, and the
1773possibilities for @var{x} are very limited.  It may be a
1774@code{label_ref} expression (unconditional jump).  It may be an
1775@code{if_then_else} (conditional jump), in which case either the
1776second or the third operand must be @code{(pc)} (for the case which
1777does not jump) and the other of the two must be a @code{label_ref}
1778(for the case which does jump).  @var{x} may also be a @code{mem} or
1779@code{(plus:SI (pc) @var{y})}, where @var{y} may be a @code{reg} or a
1780@code{mem}; these unusual patterns are used to represent jumps through
1781branch tables.@refill
1782
1783If @var{lval} is neither @code{(cc0)} nor @code{(pc)}, the mode of
1784@var{lval} must not be @code{VOIDmode} and the mode of @var{x} must be
1785valid for the mode of @var{lval}.
1786
1787@findex SET_DEST
1788@findex SET_SRC
1789@var{lval} is customarily accessed with the @code{SET_DEST} macro and
1790@var{x} with the @code{SET_SRC} macro.
1791
1792@findex return
1793@item (return)
1794As the sole expression in a pattern, represents a return from the
1795current function, on machines where this can be done with one
1796instruction, such as Vaxes.  On machines where a multi-instruction
1797``epilogue'' must be executed in order to return from the function,
1798returning is done by jumping to a label which precedes the epilogue, and
1799the @code{return} expression code is never used.
1800
1801Inside an @code{if_then_else} expression, represents the value to be
1802placed in @code{pc} to return to the caller.
1803
1804Note that an insn pattern of @code{(return)} is logically equivalent to
1805@code{(set (pc) (return))}, but the latter form is never used.
1806
1807@findex call
1808@item (call @var{function} @var{nargs})
1809Represents a function call.  @var{function} is a @code{mem} expression
1810whose address is the address of the function to be called.
1811@var{nargs} is an expression which can be used for two purposes: on
1812some machines it represents the number of bytes of stack argument; on
1813others, it represents the number of argument registers.
1814
1815Each machine has a standard machine mode which @var{function} must
1816have.  The machine description defines macro @code{FUNCTION_MODE} to
1817expand into the requisite mode name.  The purpose of this mode is to
1818specify what kind of addressing is allowed, on machines where the
1819allowed kinds of addressing depend on the machine mode being
1820addressed.
1821
1822@findex clobber
1823@item (clobber @var{x})
1824Represents the storing or possible storing of an unpredictable,
1825undescribed value into @var{x}, which must be a @code{reg},
1826@code{scratch} or @code{mem} expression.
1827
1828One place this is used is in string instructions that store standard
1829values into particular hard registers.  It may not be worth the
1830trouble to describe the values that are stored, but it is essential to
1831inform the compiler that the registers will be altered, lest it
1832attempt to keep data in them across the string instruction.
1833
1834If @var{x} is @code{(mem:BLK (const_int 0))}, it means that all memory
1835locations must be presumed clobbered.
1836
1837Note that the machine description classifies certain hard registers as
1838``call-clobbered''.  All function call instructions are assumed by
1839default to clobber these registers, so there is no need to use
1840@code{clobber} expressions to indicate this fact.  Also, each function
1841call is assumed to have the potential to alter any memory location,
1842unless the function is declared @code{const}.
1843
1844If the last group of expressions in a @code{parallel} are each a
1845@code{clobber} expression whose arguments are @code{reg} or
1846@code{match_scratch} (@pxref{RTL Template}) expressions, the combiner
1847phase can add the appropriate @code{clobber} expressions to an insn it
1848has constructed when doing so will cause a pattern to be matched.
1849
1850This feature can be used, for example, on a machine that whose multiply
1851and add instructions don't use an MQ register but which has an
1852add-accumulate instruction that does clobber the MQ register.  Similarly,
1853a combined instruction might require a temporary register while the
1854constituent instructions might not.
1855
1856When a @code{clobber} expression for a register appears inside a
1857@code{parallel} with other side effects, the register allocator
1858guarantees that the register is unoccupied both before and after that
1859insn.  However, the reload phase may allocate a register used for one of
1860the inputs unless the @samp{&} constraint is specified for the selected
1861alternative (@pxref{Modifiers}).  You can clobber either a specific hard
1862register, a pseudo register, or a @code{scratch} expression; in the
1863latter two cases, GNU CC will allocate a hard register that is available
1864there for use as a temporary.
1865
1866For instructions that require a temporary register, you should use
1867@code{scratch} instead of a pseudo-register because this will allow the
1868combiner phase to add the @code{clobber} when required.  You do this by
1869coding (@code{clobber} (@code{match_scratch} @dots{})).  If you do
1870clobber a pseudo register, use one which appears nowhere else---generate
1871a new one each time.  Otherwise, you may confuse CSE.
1872
1873There is one other known use for clobbering a pseudo register in a
1874@code{parallel}: when one of the input operands of the insn is also
1875clobbered by the insn.  In this case, using the same pseudo register in
1876the clobber and elsewhere in the insn produces the expected results.
1877
1878@findex use
1879@item (use @var{x})
1880Represents the use of the value of @var{x}.  It indicates that the
1881value in @var{x} at this point in the program is needed, even though
1882it may not be apparent why this is so.  Therefore, the compiler will
1883not attempt to delete previous instructions whose only effect is to
1884store a value in @var{x}.  @var{x} must be a @code{reg} expression.
1885
1886During the delayed branch scheduling phase, @var{x} may be an insn.
1887This indicates that @var{x} previously was located at this place in the
1888code and its data dependencies need to be taken into account.  These
1889@code{use} insns will be deleted before the delayed branch scheduling
1890phase exits.
1891
1892@findex parallel
1893@item (parallel [@var{x0} @var{x1} @dots{}])
1894Represents several side effects performed in parallel.  The square
1895brackets stand for a vector; the operand of @code{parallel} is a
1896vector of expressions.  @var{x0}, @var{x1} and so on are individual
1897side effect expressions---expressions of code @code{set}, @code{call},
1898@code{return}, @code{clobber} or @code{use}.@refill
1899
1900``In parallel'' means that first all the values used in the individual
1901side-effects are computed, and second all the actual side-effects are
1902performed.  For example,
1903
1904@example
1905(parallel [(set (reg:SI 1) (mem:SI (reg:SI 1)))
1906           (set (mem:SI (reg:SI 1)) (reg:SI 1))])
1907@end example
1908
1909@noindent
1910says unambiguously that the values of hard register 1 and the memory
1911location addressed by it are interchanged.  In both places where
1912@code{(reg:SI 1)} appears as a memory address it refers to the value
1913in register 1 @emph{before} the execution of the insn.
1914
1915It follows that it is @emph{incorrect} to use @code{parallel} and
1916expect the result of one @code{set} to be available for the next one.
1917For example, people sometimes attempt to represent a jump-if-zero
1918instruction this way:
1919
1920@example
1921(parallel [(set (cc0) (reg:SI 34))
1922           (set (pc) (if_then_else
1923                        (eq (cc0) (const_int 0))
1924                        (label_ref @dots{})
1925                        (pc)))])
1926@end example
1927
1928@noindent
1929But this is incorrect, because it says that the jump condition depends
1930on the condition code value @emph{before} this instruction, not on the
1931new value that is set by this instruction.
1932
1933@cindex peephole optimization, RTL representation
1934Peephole optimization, which takes place together with final assembly
1935code output, can produce insns whose patterns consist of a @code{parallel}
1936whose elements are the operands needed to output the resulting
1937assembler code---often @code{reg}, @code{mem} or constant expressions.
1938This would not be well-formed RTL at any other stage in compilation,
1939but it is ok then because no further optimization remains to be done.
1940However, the definition of the macro @code{NOTICE_UPDATE_CC}, if
1941any, must deal with such insns if you define any peephole optimizations.
1942
1943@findex sequence
1944@item (sequence [@var{insns} @dots{}])
1945Represents a sequence of insns.  Each of the @var{insns} that appears
1946in the vector is suitable for appearing in the chain of insns, so it
1947must be an @code{insn}, @code{jump_insn}, @code{call_insn},
1948@code{code_label}, @code{barrier} or @code{note}.
1949
1950A @code{sequence} RTX is never placed in an actual insn during RTL
1951generation.  It represents the sequence of insns that result from a
1952@code{define_expand} @emph{before} those insns are passed to
1953@code{emit_insn} to insert them in the chain of insns.  When actually
1954inserted, the individual sub-insns are separated out and the
1955@code{sequence} is forgotten.
1956
1957After delay-slot scheduling is completed, an insn and all the insns that
1958reside in its delay slots are grouped together into a @code{sequence}.
1959The insn requiring the delay slot is the first insn in the vector;
1960subsequent insns are to be placed in the delay slot.
1961
1962@code{INSN_ANNULLED_BRANCH_P} is set on an insn in a delay slot to
1963indicate that a branch insn should be used that will conditionally annul
1964the effect of the insns in the delay slots.  In such a case,
1965@code{INSN_FROM_TARGET_P} indicates that the insn is from the target of
1966the branch and should be executed only if the branch is taken; otherwise
1967the insn should be executed only if the branch is not taken.
1968@xref{Delay Slots}.
1969@end table
1970
1971These expression codes appear in place of a side effect, as the body of
1972an insn, though strictly speaking they do not always describe side
1973effects as such:
1974
1975@table @code
1976@findex asm_input
1977@item (asm_input @var{s})
1978Represents literal assembler code as described by the string @var{s}.
1979
1980@findex unspec
1981@findex unspec_volatile
1982@item (unspec [@var{operands} @dots{}] @var{index})
1983@itemx (unspec_volatile [@var{operands} @dots{}] @var{index})
1984Represents a machine-specific operation on @var{operands}.  @var{index}
1985selects between multiple machine-specific operations.
1986@code{unspec_volatile} is used for volatile operations and operations
1987that may trap; @code{unspec} is used for other operations.
1988
1989These codes may appear inside a @code{pattern} of an
1990insn, inside a @code{parallel}, or inside an expression.
1991
1992@findex addr_vec
1993@item (addr_vec:@var{m} [@var{lr0} @var{lr1} @dots{}])
1994Represents a table of jump addresses.  The vector elements @var{lr0},
1995etc., are @code{label_ref} expressions.  The mode @var{m} specifies
1996how much space is given to each address; normally @var{m} would be
1997@code{Pmode}.
1998
1999@findex addr_diff_vec
2000@item (addr_diff_vec:@var{m} @var{base} [@var{lr0} @var{lr1} @dots{}])
2001Represents a table of jump addresses expressed as offsets from
2002@var{base}.  The vector elements @var{lr0}, etc., are @code{label_ref}
2003expressions and so is @var{base}.  The mode @var{m} specifies how much
2004space is given to each address-difference.@refill
2005@end table
2006
2007@node Incdec, Assembler, Side Effects, RTL
2008@section Embedded Side-Effects on Addresses
2009@cindex RTL preincrement
2010@cindex RTL postincrement
2011@cindex RTL predecrement
2012@cindex RTL postdecrement
2013
2014Four special side-effect expression codes appear as memory addresses.
2015
2016@table @code
2017@findex pre_dec
2018@item (pre_dec:@var{m} @var{x})
2019Represents the side effect of decrementing @var{x} by a standard
2020amount and represents also the value that @var{x} has after being
2021decremented.  @var{x} must be a @code{reg} or @code{mem}, but most
2022machines allow only a @code{reg}.  @var{m} must be the machine mode
2023for pointers on the machine in use.  The amount @var{x} is decremented
2024by is the length in bytes of the machine mode of the containing memory
2025reference of which this expression serves as the address.  Here is an
2026example of its use:@refill
2027
2028@example
2029(mem:DF (pre_dec:SI (reg:SI 39)))
2030@end example
2031
2032@noindent
2033This says to decrement pseudo register 39 by the length of a @code{DFmode}
2034value and use the result to address a @code{DFmode} value.
2035
2036@findex pre_inc
2037@item (pre_inc:@var{m} @var{x})
2038Similar, but specifies incrementing @var{x} instead of decrementing it.
2039
2040@findex post_dec
2041@item (post_dec:@var{m} @var{x})
2042Represents the same side effect as @code{pre_dec} but a different
2043value.  The value represented here is the value @var{x} has @i{before}
2044being decremented.
2045
2046@findex post_inc
2047@item (post_inc:@var{m} @var{x})
2048Similar, but specifies incrementing @var{x} instead of decrementing it.
2049@end table
2050
2051These embedded side effect expressions must be used with care.  Instruction
2052patterns may not use them.  Until the @samp{flow} pass of the compiler,
2053they may occur only to represent pushes onto the stack.  The @samp{flow}
2054pass finds cases where registers are incremented or decremented in one
2055instruction and used as an address shortly before or after; these cases are
2056then transformed to use pre- or post-increment or -decrement.
2057
2058If a register used as the operand of these expressions is used in
2059another address in an insn, the original value of the register is used.
2060Uses of the register outside of an address are not permitted within the
2061same insn as a use in an embedded side effect expression because such
2062insns behave differently on different machines and hence must be treated
2063as ambiguous and disallowed.
2064
2065An instruction that can be represented with an embedded side effect
2066could also be represented using @code{parallel} containing an additional
2067@code{set} to describe how the address register is altered.  This is not
2068done because machines that allow these operations at all typically
2069allow them wherever a memory address is called for.  Describing them as
2070additional parallel stores would require doubling the number of entries
2071in the machine description.
2072
2073@node Assembler, Insns, Incdec, RTL
2074@section Assembler Instructions as Expressions
2075@cindex assembler instructions in RTL
2076
2077@cindex @code{asm_operands}, usage
2078The RTX code @code{asm_operands} represents a value produced by a
2079user-specified assembler instruction.  It is used to represent
2080an @code{asm} statement with arguments.  An @code{asm} statement with
2081a single output operand, like this:
2082
2083@smallexample
2084asm ("foo %1,%2,%0" : "=a" (outputvar) : "g" (x + y), "di" (*z));
2085@end smallexample
2086
2087@noindent
2088is represented using a single @code{asm_operands} RTX which represents
2089the value that is stored in @code{outputvar}:
2090
2091@smallexample
2092(set @var{rtx-for-outputvar}
2093     (asm_operands "foo %1,%2,%0" "a" 0
2094                   [@var{rtx-for-addition-result} @var{rtx-for-*z}]
2095                   [(asm_input:@var{m1} "g")
2096                    (asm_input:@var{m2} "di")]))
2097@end smallexample
2098
2099@noindent
2100Here the operands of the @code{asm_operands} RTX are the assembler
2101template string, the output-operand's constraint, the index-number of the
2102output operand among the output operands specified, a vector of input
2103operand RTX's, and a vector of input-operand modes and constraints.  The
2104mode @var{m1} is the mode of the sum @code{x+y}; @var{m2} is that of
2105@code{*z}.
2106
2107When an @code{asm} statement has multiple output values, its insn has
2108several such @code{set} RTX's inside of a @code{parallel}.  Each @code{set}
2109contains a @code{asm_operands}; all of these share the same assembler
2110template and vectors, but each contains the constraint for the respective
2111output operand.  They are also distinguished by the output-operand index
2112number, which is 0, 1, @dots{} for successive output operands.
2113
2114@node Insns, Calls, Assembler, RTL
2115@section Insns
2116@cindex insns
2117
2118The RTL representation of the code for a function is a doubly-linked
2119chain of objects called @dfn{insns}.  Insns are expressions with
2120special codes that are used for no other purpose.  Some insns are
2121actual instructions; others represent dispatch tables for @code{switch}
2122statements; others represent labels to jump to or various sorts of
2123declarative information.
2124
2125In addition to its own specific data, each insn must have a unique
2126id-number that distinguishes it from all other insns in the current
2127function (after delayed branch scheduling, copies of an insn with the
2128same id-number may be present in multiple places in a function, but
2129these copies will always be identical and will only appear inside a
2130@code{sequence}), and chain pointers to the preceding and following
2131insns.  These three fields occupy the same position in every insn,
2132independent of the expression code of the insn.  They could be accessed
2133with @code{XEXP} and @code{XINT}, but instead three special macros are
2134always used:
2135
2136@table @code
2137@findex INSN_UID
2138@item INSN_UID (@var{i})
2139Accesses the unique id of insn @var{i}.
2140
2141@findex PREV_INSN
2142@item PREV_INSN (@var{i})
2143Accesses the chain pointer to the insn preceding @var{i}.
2144If @var{i} is the first insn, this is a null pointer.
2145
2146@findex NEXT_INSN
2147@item NEXT_INSN (@var{i})
2148Accesses the chain pointer to the insn following @var{i}.
2149If @var{i} is the last insn, this is a null pointer.
2150@end table
2151
2152@findex get_insns
2153@findex get_last_insn
2154The first insn in the chain is obtained by calling @code{get_insns}; the
2155last insn is the result of calling @code{get_last_insn}.  Within the
2156chain delimited by these insns, the @code{NEXT_INSN} and
2157@code{PREV_INSN} pointers must always correspond: if @var{insn} is not
2158the first insn,
2159
2160@example
2161NEXT_INSN (PREV_INSN (@var{insn})) == @var{insn}
2162@end example
2163
2164@noindent
2165is always true and if @var{insn} is not the last insn,
2166
2167@example
2168PREV_INSN (NEXT_INSN (@var{insn})) == @var{insn}
2169@end example
2170
2171@noindent
2172is always true.
2173
2174After delay slot scheduling, some of the insns in the chain might be
2175@code{sequence} expressions, which contain a vector of insns.  The value
2176of @code{NEXT_INSN} in all but the last of these insns is the next insn
2177in the vector; the value of @code{NEXT_INSN} of the last insn in the vector
2178is the same as the value of @code{NEXT_INSN} for the @code{sequence} in
2179which it is contained.  Similar rules apply for @code{PREV_INSN}.
2180
2181This means that the above invariants are not necessarily true for insns
2182inside @code{sequence} expressions.  Specifically, if @var{insn} is the
2183first insn in a @code{sequence}, @code{NEXT_INSN (PREV_INSN (@var{insn}))}
2184is the insn containing the @code{sequence} expression, as is the value
2185of @code{PREV_INSN (NEXT_INSN (@var{insn}))} is @var{insn} is the last
2186insn in the @code{sequence} expression.  You can use these expressions
2187to find the containing @code{sequence} expression.@refill
2188
2189Every insn has one of the following six expression codes:
2190
2191@table @code
2192@findex insn
2193@item insn
2194The expression code @code{insn} is used for instructions that do not jump
2195and do not do function calls.  @code{sequence} expressions are always
2196contained in insns with code @code{insn} even if one of those insns
2197should jump or do function calls.
2198
2199Insns with code @code{insn} have four additional fields beyond the three
2200mandatory ones listed above.  These four are described in a table below.
2201
2202@findex jump_insn
2203@item jump_insn
2204The expression code @code{jump_insn} is used for instructions that may
2205jump (or, more generally, may contain @code{label_ref} expressions).  If
2206there is an instruction to return from the current function, it is
2207recorded as a @code{jump_insn}.
2208
2209@findex JUMP_LABEL
2210@code{jump_insn} insns have the same extra fields as @code{insn} insns,
2211accessed in the same way and in addition contain a field
2212@code{JUMP_LABEL} which is defined once jump optimization has completed.
2213
2214For simple conditional and unconditional jumps, this field contains the
2215@code{code_label} to which this insn will (possibly conditionally)
2216branch.  In a more complex jump, @code{JUMP_LABEL} records one of the
2217labels that the insn refers to; the only way to find the others
2218is to scan the entire body of the insn.
2219
2220Return insns count as jumps, but since they do not refer to any labels,
2221they have zero in the @code{JUMP_LABEL} field.
2222
2223@findex call_insn
2224@item call_insn
2225The expression code @code{call_insn} is used for instructions that may do
2226function calls.  It is important to distinguish these instructions because
2227they imply that certain registers and memory locations may be altered
2228unpredictably.
2229
2230@findex CALL_INSN_FUNCTION_USAGE
2231@code{call_insn} insns have the same extra fields as @code{insn} insns,
2232accessed in the same way and in addition contain a field
2233@code{CALL_INSN_FUNCTION_USAGE}, which contains a list (chain of
2234@code{expr_list} expressions) containing @code{use} and @code{clobber}
2235expressions that denote hard registers used or clobbered by the called
2236function.  A register specified in a @code{clobber} in this list is
2237modified @emph{after} the execution of the @code{call_insn}, while a
2238register in a @code{clobber} in the body of the @code{call_insn} is
2239clobbered before the insn completes execution.  @code{clobber}
2240expressions in this list augment registers specified in
2241@code{CALL_USED_REGISTERS} (@pxref{Register Basics}).
2242
2243@findex code_label
2244@findex CODE_LABEL_NUMBER
2245@item code_label
2246A @code{code_label} insn represents a label that a jump insn can jump
2247to.  It contains two special fields of data in addition to the three
2248standard ones.  @code{CODE_LABEL_NUMBER} is used to hold the @dfn{label
2249number}, a number that identifies this label uniquely among all the
2250labels in the compilation (not just in the current function).
2251Ultimately, the label is represented in the assembler output as an
2252assembler label, usually of the form @samp{L@var{n}} where @var{n} is
2253the label number.
2254
2255When a @code{code_label} appears in an RTL expression, it normally
2256appears within a @code{label_ref} which represents the address of
2257the label, as a number.
2258
2259@findex LABEL_NUSES
2260The field @code{LABEL_NUSES} is only defined once the jump optimization
2261phase is completed and contains the number of times this label is
2262referenced in the current function.
2263
2264@findex barrier
2265@item barrier
2266Barriers are placed in the instruction stream when control cannot flow
2267past them.  They are placed after unconditional jump instructions to
2268indicate that the jumps are unconditional and after calls to
2269@code{volatile} functions, which do not return (e.g., @code{exit}).
2270They contain no information beyond the three standard fields.
2271
2272@findex note
2273@findex NOTE_LINE_NUMBER
2274@findex NOTE_SOURCE_FILE
2275@item note
2276@code{note} insns are used to represent additional debugging and
2277declarative information.  They contain two nonstandard fields, an
2278integer which is accessed with the macro @code{NOTE_LINE_NUMBER} and a
2279string accessed with @code{NOTE_SOURCE_FILE}.
2280
2281If @code{NOTE_LINE_NUMBER} is positive, the note represents the
2282position of a source line and @code{NOTE_SOURCE_FILE} is the source file name
2283that the line came from.  These notes control generation of line
2284number data in the assembler output.
2285
2286Otherwise, @code{NOTE_LINE_NUMBER} is not really a line number but a
2287code with one of the following values (and @code{NOTE_SOURCE_FILE}
2288must contain a null pointer):
2289
2290@table @code
2291@findex NOTE_INSN_DELETED
2292@item NOTE_INSN_DELETED
2293Such a note is completely ignorable.  Some passes of the compiler
2294delete insns by altering them into notes of this kind.
2295
2296@findex NOTE_INSN_BLOCK_BEG
2297@findex NOTE_INSN_BLOCK_END
2298@item NOTE_INSN_BLOCK_BEG
2299@itemx NOTE_INSN_BLOCK_END
2300These types of notes indicate the position of the beginning and end
2301of a level of scoping of variable names.  They control the output
2302of debugging information.
2303
2304@findex NOTE_INSN_EH_REGION_BEG
2305@findex NOTE_INSN_EH_REGION_END
2306@item NOTE_INSN_EH_REGION_BEG
2307@itemx NOTE_INSN_EH_REGION_END
2308These types of notes indicate the position of the beginning and end of a
2309level of scoping for exception handling.  @code{NOTE_BLOCK_NUMBER}
2310identifies which @code{CODE_LABEL} is associated with the given region.
2311
2312@findex NOTE_INSN_LOOP_BEG
2313@findex NOTE_INSN_LOOP_END
2314@item NOTE_INSN_LOOP_BEG
2315@itemx NOTE_INSN_LOOP_END
2316These types of notes indicate the position of the beginning and end
2317of a @code{while} or @code{for} loop.  They enable the loop optimizer
2318to find loops quickly.
2319
2320@findex NOTE_INSN_LOOP_CONT
2321@item NOTE_INSN_LOOP_CONT
2322Appears at the place in a loop that @code{continue} statements jump to.
2323
2324@findex NOTE_INSN_LOOP_VTOP
2325@item NOTE_INSN_LOOP_VTOP
2326This note indicates the place in a loop where the exit test begins for
2327those loops in which the exit test has been duplicated.  This position
2328becomes another virtual start of the loop when considering loop
2329invariants.
2330
2331@findex NOTE_INSN_FUNCTION_END
2332@item NOTE_INSN_FUNCTION_END
2333Appears near the end of the function body, just before the label that
2334@code{return} statements jump to (on machine where a single instruction
2335does not suffice for returning).  This note may be deleted by jump
2336optimization.
2337
2338@findex NOTE_INSN_SETJMP
2339@item NOTE_INSN_SETJMP
2340Appears following each call to @code{setjmp} or a related function.
2341@end table
2342
2343These codes are printed symbolically when they appear in debugging dumps.
2344@end table
2345
2346@cindex @code{HImode}, in @code{insn}
2347@cindex @code{QImode}, in @code{insn}
2348The machine mode of an insn is normally @code{VOIDmode}, but some
2349phases use the mode for various purposes; for example, the reload pass
2350sets it to @code{HImode} if the insn needs reloading but not register
2351elimination and @code{QImode} if both are required.  The common
2352subexpression elimination pass sets the mode of an insn to @code{QImode}
2353when it is the first insn in a block that has already been processed.
2354
2355Here is a table of the extra fields of @code{insn}, @code{jump_insn}
2356and @code{call_insn} insns:
2357
2358@table @code
2359@findex PATTERN
2360@item PATTERN (@var{i})
2361An expression for the side effect performed by this insn.  This must be
2362one of the following codes: @code{set}, @code{call}, @code{use},
2363@code{clobber}, @code{return}, @code{asm_input}, @code{asm_output},
2364@code{addr_vec}, @code{addr_diff_vec}, @code{trap_if}, @code{unspec},
2365@code{unspec_volatile}, @code{parallel}, or @code{sequence}.  If it is a @code{parallel},
2366each element of the @code{parallel} must be one these codes, except that
2367@code{parallel} expressions cannot be nested and @code{addr_vec} and
2368@code{addr_diff_vec} are not permitted inside a @code{parallel} expression.
2369
2370@findex INSN_CODE
2371@item INSN_CODE (@var{i})
2372An integer that says which pattern in the machine description matches
2373this insn, or -1 if the matching has not yet been attempted.
2374
2375Such matching is never attempted and this field remains -1 on an insn
2376whose pattern consists of a single @code{use}, @code{clobber},
2377@code{asm_input}, @code{addr_vec} or @code{addr_diff_vec} expression.
2378
2379@findex asm_noperands
2380Matching is also never attempted on insns that result from an @code{asm}
2381statement.  These contain at least one @code{asm_operands} expression.
2382The function @code{asm_noperands} returns a non-negative value for
2383such insns.
2384
2385In the debugging output, this field is printed as a number followed by
2386a symbolic representation that locates the pattern in the @file{md}
2387file as some small positive or negative offset from a named pattern.
2388
2389@findex LOG_LINKS
2390@item LOG_LINKS (@var{i})
2391A list (chain of @code{insn_list} expressions) giving information about
2392dependencies between instructions within a basic block.  Neither a jump
2393nor a label may come between the related insns.
2394
2395@findex REG_NOTES
2396@item REG_NOTES (@var{i})
2397A list (chain of @code{expr_list} and @code{insn_list} expressions)
2398giving miscellaneous information about the insn.  It is often
2399information pertaining to the registers used in this insn.
2400@end table
2401
2402The @code{LOG_LINKS} field of an insn is a chain of @code{insn_list}
2403expressions.  Each of these has two operands: the first is an insn,
2404and the second is another @code{insn_list} expression (the next one in
2405the chain).  The last @code{insn_list} in the chain has a null pointer
2406as second operand.  The significant thing about the chain is which
2407insns appear in it (as first operands of @code{insn_list}
2408expressions).  Their order is not significant.
2409
2410This list is originally set up by the flow analysis pass; it is a null
2411pointer until then.  Flow only adds links for those data dependencies
2412which can be used for instruction combination.  For each insn, the flow
2413analysis pass adds a link to insns which store into registers values
2414that are used for the first time in this insn.  The instruction
2415scheduling pass adds extra links so that every dependence will be
2416represented.  Links represent data dependencies, antidependencies and
2417output dependencies; the machine mode of the link distinguishes these
2418three types: antidependencies have mode @code{REG_DEP_ANTI}, output
2419dependencies have mode @code{REG_DEP_OUTPUT}, and data dependencies have
2420mode @code{VOIDmode}.
2421
2422The @code{REG_NOTES} field of an insn is a chain similar to the
2423@code{LOG_LINKS} field but it includes @code{expr_list} expressions in
2424addition to @code{insn_list} expressions.  There are several kinds of
2425register notes, which are distinguished by the machine mode, which in a
2426register note is really understood as being an @code{enum reg_note}.
2427The first operand @var{op} of the note is data whose meaning depends on
2428the kind of note.
2429
2430@findex REG_NOTE_KIND
2431@findex PUT_REG_NOTE_KIND
2432The macro @code{REG_NOTE_KIND (@var{x})} returns the kind of
2433register note.  Its counterpart, the macro @code{PUT_REG_NOTE_KIND
2434(@var{x}, @var{newkind})} sets the register note type of @var{x} to be
2435@var{newkind}.
2436
2437Register notes are of three classes: They may say something about an
2438input to an insn, they may say something about an output of an insn, or
2439they may create a linkage between two insns.  There are also a set
2440of values that are only used in @code{LOG_LINKS}.
2441
2442These register notes annotate inputs to an insn:
2443
2444@table @code
2445@findex REG_DEAD
2446@item REG_DEAD
2447The value in @var{op} dies in this insn; that is to say, altering the
2448value immediately after this insn would not affect the future behavior
2449of the program. 
2450
2451This does not necessarily mean that the register @var{op} has no useful
2452value after this insn since it may also be an output of the insn.  In
2453such a case, however, a @code{REG_DEAD} note would be redundant and is
2454usually not present until after the reload pass, but no code relies on
2455this fact.
2456
2457@findex REG_INC
2458@item REG_INC
2459The register @var{op} is incremented (or decremented; at this level
2460there is no distinction) by an embedded side effect inside this insn.
2461This means it appears in a @code{post_inc}, @code{pre_inc},
2462@code{post_dec} or @code{pre_dec} expression.
2463
2464@findex REG_NONNEG
2465@item REG_NONNEG
2466The register @var{op} is known to have a nonnegative value when this
2467insn is reached.  This is used so that decrement and branch until zero
2468instructions, such as the m68k dbra, can be matched.
2469
2470The @code{REG_NONNEG} note is added to insns only if the machine
2471description has a @samp{decrement_and_branch_until_zero} pattern.
2472
2473@findex REG_NO_CONFLICT
2474@item REG_NO_CONFLICT
2475This insn does not cause a conflict between @var{op} and the item
2476being set by this insn even though it might appear that it does.
2477In other words, if the destination register and @var{op} could
2478otherwise be assigned the same register, this insn does not
2479prevent that assignment.
2480
2481Insns with this note are usually part of a block that begins with a
2482@code{clobber} insn specifying a multi-word pseudo register (which will
2483be the output of the block), a group of insns that each set one word of
2484the value and have the @code{REG_NO_CONFLICT} note attached, and a final
2485insn that copies the output to itself with an attached @code{REG_EQUAL}
2486note giving the expression being computed.  This block is encapsulated
2487with @code{REG_LIBCALL} and @code{REG_RETVAL} notes on the first and
2488last insns, respectively.
2489
2490@findex REG_LABEL
2491@item REG_LABEL
2492This insn uses @var{op}, a @code{code_label}, but is not a
2493@code{jump_insn}.  The presence of this note allows jump optimization to
2494be aware that @var{op} is, in fact, being used.
2495@end table
2496
2497The following notes describe attributes of outputs of an insn:
2498
2499@table @code
2500@findex REG_EQUIV
2501@findex REG_EQUAL
2502@item REG_EQUIV
2503@itemx REG_EQUAL
2504This note is only valid on an insn that sets only one register and
2505indicates that that register will be equal to @var{op} at run time; the
2506scope of this equivalence differs between the two types of notes.  The
2507value which the insn explicitly copies into the register may look
2508different from @var{op}, but they will be equal at run time.  If the
2509output of the single @code{set} is a @code{strict_low_part} expression,
2510the note refers to the register that is contained in @code{SUBREG_REG}
2511of the @code{subreg} expression.
2512 
2513For @code{REG_EQUIV}, the register is equivalent to @var{op} throughout
2514the entire function, and could validly be replaced in all its
2515occurrences by @var{op}.  (``Validly'' here refers to the data flow of
2516the program; simple replacement may make some insns invalid.)  For
2517example, when a constant is loaded into a register that is never
2518assigned any other value, this kind of note is used.
2519
2520When a parameter is copied into a pseudo-register at entry to a function,
2521a note of this kind records that the register is equivalent to the stack
2522slot where the parameter was passed.  Although in this case the register
2523may be set by other insns, it is still valid to replace the register
2524by the stack slot throughout the function.
2525
2526A @code{REG_EQUIV} note is also used on an instruction which copies a
2527register parameter into a pseudo-register at entry to a function, if
2528there is a stack slot where that parameter could be stored.  Although
2529other insns may set the pseudo-register, it is valid for the compiler to
2530replace the pseudo-register by stack slot throughout the function,
2531provided the compiler ensures that the stack slot is properly
2532initialized by making the replacement in the initial copy instruction as
2533well.  This is used on machines for which the calling convention
2534allocates stack space for register parameters.  See
2535@code{REG_PARM_STACK_SPACE} in @ref{Stack Arguments}.
2536
2537In the case of @code{REG_EQUAL}, the register that is set by this insn
2538will be equal to @var{op} at run time at the end of this insn but not
2539necessarily elsewhere in the function.  In this case, @var{op}
2540is typically an arithmetic expression.  For example, when a sequence of
2541insns such as a library call is used to perform an arithmetic operation,
2542this kind of note is attached to the insn that produces or copies the
2543final value.
2544
2545These two notes are used in different ways by the compiler passes.
2546@code{REG_EQUAL} is used by passes prior to register allocation (such as
2547common subexpression elimination and loop optimization) to tell them how
2548to think of that value.  @code{REG_EQUIV} notes are used by register
2549allocation to indicate that there is an available substitute expression
2550(either a constant or a @code{mem} expression for the location of a
2551parameter on the stack) that may be used in place of a register if
2552insufficient registers are available.
2553
2554Except for stack homes for parameters, which are indicated by a
2555@code{REG_EQUIV} note and are not useful to the early optimization
2556passes and pseudo registers that are equivalent to a memory location
2557throughout there entire life, which is not detected until later in
2558the compilation, all equivalences are initially indicated by an attached
2559@code{REG_EQUAL} note.  In the early stages of register allocation, a
2560@code{REG_EQUAL} note is changed into a @code{REG_EQUIV} note if
2561@var{op} is a constant and the insn represents the only set of its
2562destination register.
2563
2564Thus, compiler passes prior to register allocation need only check for
2565@code{REG_EQUAL} notes and passes subsequent to register allocation
2566need only check for @code{REG_EQUIV} notes.
2567
2568@findex REG_UNUSED
2569@item REG_UNUSED
2570The register @var{op} being set by this insn will not be used in a
2571subsequent insn.  This differs from a @code{REG_DEAD} note, which
2572indicates that the value in an input will not be used subsequently.
2573These two notes are independent; both may be present for the same
2574register.
2575
2576@findex REG_WAS_0
2577@item REG_WAS_0
2578The single output of this insn contained zero before this insn.
2579@var{op} is the insn that set it to zero.  You can rely on this note if
2580it is present and @var{op} has not been deleted or turned into a @code{note};
2581its absence implies nothing.
2582@end table
2583
2584These notes describe linkages between insns.  They occur in pairs: one
2585insn has one of a pair of notes that points to a second insn, which has
2586the inverse note pointing back to the first insn.
2587
2588@table @code
2589@findex REG_RETVAL
2590@item REG_RETVAL
2591This insn copies the value of a multi-insn sequence (for example, a
2592library call), and @var{op} is the first insn of the sequence (for a
2593library call, the first insn that was generated to set up the arguments
2594for the library call).
2595
2596Loop optimization uses this note to treat such a sequence as a single
2597operation for code motion purposes and flow analysis uses this note to
2598delete such sequences whose results are dead.
2599
2600A @code{REG_EQUAL} note will also usually be attached to this insn to
2601provide the expression being computed by the sequence.
2602
2603@findex REG_LIBCALL
2604@item REG_LIBCALL
2605This is the inverse of @code{REG_RETVAL}: it is placed on the first
2606insn of a multi-insn sequence, and it points to the last one.
2607
2608@findex REG_CC_SETTER
2609@findex REG_CC_USER
2610@item REG_CC_SETTER
2611@itemx REG_CC_USER
2612On machines that use @code{cc0}, the insns which set and use @code{cc0}
2613set and use @code{cc0} are adjacent.  However, when branch delay slot
2614filling is done, this may no longer be true.  In this case a
2615@code{REG_CC_USER} note will be placed on the insn setting @code{cc0} to
2616point to the insn using @code{cc0} and a @code{REG_CC_SETTER} note will
2617be placed on the insn using @code{cc0} to point to the insn setting
2618@code{cc0}.@refill
2619@end table
2620
2621These values are only used in the @code{LOG_LINKS} field, and indicate
2622the type of dependency that each link represents.  Links which indicate
2623a data dependence (a read after write dependence) do not use any code,
2624they simply have mode @code{VOIDmode}, and are printed without any
2625descriptive text.
2626
2627@table @code
2628@findex REG_DEP_ANTI
2629@item REG_DEP_ANTI
2630This indicates an anti dependence (a write after read dependence).
2631
2632@findex REG_DEP_OUTPUT
2633@item REG_DEP_OUTPUT
2634This indicates an output dependence (a write after write dependence).
2635@end table
2636
2637These notes describe information gathered from gcov profile data.  They
2638are stored in the @code{REG_NOTES} field of an insn as an
2639@code{expr_list}.
2640
2641@table @code
2642@findex REG_EXEC_COUNT
2643@item REG_EXEC_COUNT
2644This is used to indicate the number of times a basic block was executed
2645according to the profile data.  The note is attached to the first insn in
2646the basic block.
2647
2648@findex REG_BR_PROB
2649@item REG_BR_PROB
2650This is used to specify the ratio of branches to non-branches of a
2651branch insn according to the profile data.  The value is stored as a
2652value between 0 and REG_BR_PROB_BASE; larger values indicate a higher
2653probability that the branch will be taken.
2654@end table
2655
2656For convenience, the machine mode in an @code{insn_list} or
2657@code{expr_list} is printed using these symbolic codes in debugging dumps.
2658
2659@findex insn_list
2660@findex expr_list
2661The only difference between the expression codes @code{insn_list} and
2662@code{expr_list} is that the first operand of an @code{insn_list} is
2663assumed to be an insn and is printed in debugging dumps as the insn's
2664unique id; the first operand of an @code{expr_list} is printed in the
2665ordinary way as an expression.
2666
2667@node Calls, Sharing, Insns, RTL
2668@section RTL Representation of Function-Call Insns
2669@cindex calling functions in RTL
2670@cindex RTL function-call insns
2671@cindex function-call insns
2672
2673Insns that call subroutines have the RTL expression code @code{call_insn}.
2674These insns must satisfy special rules, and their bodies must use a special
2675RTL expression code, @code{call}.
2676
2677@cindex @code{call} usage
2678A @code{call} expression has two operands, as follows:
2679
2680@example
2681(call (mem:@var{fm} @var{addr}) @var{nbytes})
2682@end example
2683
2684@noindent
2685Here @var{nbytes} is an operand that represents the number of bytes of
2686argument data being passed to the subroutine, @var{fm} is a machine mode
2687(which must equal as the definition of the @code{FUNCTION_MODE} macro in
2688the machine description) and @var{addr} represents the address of the
2689subroutine.
2690
2691For a subroutine that returns no value, the @code{call} expression as
2692shown above is the entire body of the insn, except that the insn might
2693also contain @code{use} or @code{clobber} expressions.
2694
2695@cindex @code{BLKmode}, and function return values
2696For a subroutine that returns a value whose mode is not @code{BLKmode},
2697the value is returned in a hard register.  If this register's number is
2698@var{r}, then the body of the call insn looks like this:
2699
2700@example
2701(set (reg:@var{m} @var{r})
2702     (call (mem:@var{fm} @var{addr}) @var{nbytes}))
2703@end example
2704
2705@noindent
2706This RTL expression makes it clear (to the optimizer passes) that the
2707appropriate register receives a useful value in this insn.
2708
2709When a subroutine returns a @code{BLKmode} value, it is handled by
2710passing to the subroutine the address of a place to store the value.
2711So the call insn itself does not ``return'' any value, and it has the
2712same RTL form as a call that returns nothing.
2713
2714On some machines, the call instruction itself clobbers some register,
2715for example to contain the return address.  @code{call_insn} insns
2716on these machines should have a body which is a @code{parallel}
2717that contains both the @code{call} expression and @code{clobber}
2718expressions that indicate which registers are destroyed.  Similarly,
2719if the call instruction requires some register other than the stack
2720pointer that is not explicitly mentioned it its RTL, a @code{use}
2721subexpression should mention that register.
2722
2723Functions that are called are assumed to modify all registers listed in
2724the configuration macro @code{CALL_USED_REGISTERS} (@pxref{Register
2725Basics}) and, with the exception of @code{const} functions and library
2726calls, to modify all of memory.
2727
2728Insns containing just @code{use} expressions directly precede the
2729@code{call_insn} insn to indicate which registers contain inputs to the
2730function.  Similarly, if registers other than those in
2731@code{CALL_USED_REGISTERS} are clobbered by the called function, insns
2732containing a single @code{clobber} follow immediately after the call to
2733indicate which registers.
2734
2735@node Sharing
2736@section Structure Sharing Assumptions
2737@cindex sharing of RTL components
2738@cindex RTL structure sharing assumptions
2739
2740The compiler assumes that certain kinds of RTL expressions are unique;
2741there do not exist two distinct objects representing the same value.
2742In other cases, it makes an opposite assumption: that no RTL expression
2743object of a certain kind appears in more than one place in the
2744containing structure.
2745
2746These assumptions refer to a single function; except for the RTL
2747objects that describe global variables and external functions,
2748and a few standard objects such as small integer constants,
2749no RTL objects are common to two functions.
2750
2751@itemize @bullet
2752@cindex @code{reg}, RTL sharing
2753@item
2754Each pseudo-register has only a single @code{reg} object to represent it,
2755and therefore only a single machine mode.
2756
2757@cindex symbolic label
2758@cindex @code{symbol_ref}, RTL sharing
2759@item
2760For any symbolic label, there is only one @code{symbol_ref} object
2761referring to it.
2762
2763@cindex @code{const_int}, RTL sharing
2764@item
2765There is only one @code{const_int} expression with value 0, only
2766one with value 1, and only one with value @minus{}1.
2767Some other integer values are also stored uniquely.
2768
2769@cindex @code{pc}, RTL sharing
2770@item
2771There is only one @code{pc} expression.
2772
2773@cindex @code{cc0}, RTL sharing
2774@item
2775There is only one @code{cc0} expression.
2776
2777@cindex @code{const_double}, RTL sharing
2778@item
2779There is only one @code{const_double} expression with value 0 for
2780each floating point mode.  Likewise for values 1 and 2.
2781
2782@cindex @code{label_ref}, RTL sharing
2783@cindex @code{scratch}, RTL sharing
2784@item
2785No @code{label_ref} or @code{scratch} appears in more than one place in
2786the RTL structure; in other words, it is safe to do a tree-walk of all
2787the insns in the function and assume that each time a @code{label_ref}
2788or @code{scratch} is seen it is distinct from all others that are seen.
2789
2790@cindex @code{mem}, RTL sharing
2791@item
2792Only one @code{mem} object is normally created for each static
2793variable or stack slot, so these objects are frequently shared in all
2794the places they appear.  However, separate but equal objects for these
2795variables are occasionally made.
2796
2797@cindex @code{asm_operands}, RTL sharing
2798@item
2799When a single @code{asm} statement has multiple output operands, a
2800distinct @code{asm_operands} expression is made for each output operand.
2801However, these all share the vector which contains the sequence of input
2802operands.  This sharing is used later on to test whether two
2803@code{asm_operands} expressions come from the same statement, so all
2804optimizations must carefully preserve the sharing if they copy the
2805vector at all.
2806
2807@item
2808No RTL object appears in more than one place in the RTL structure
2809except as described above.  Many passes of the compiler rely on this
2810by assuming that they can modify RTL objects in place without unwanted
2811side-effects on other insns.
2812
2813@findex unshare_all_rtl
2814@item
2815During initial RTL generation, shared structure is freely introduced.
2816After all the RTL for a function has been generated, all shared
2817structure is copied by @code{unshare_all_rtl} in @file{emit-rtl.c},
2818after which the above rules are guaranteed to be followed.
2819
2820@findex copy_rtx_if_shared
2821@item
2822During the combiner pass, shared structure within an insn can exist
2823temporarily.  However, the shared structure is copied before the
2824combiner is finished with the insn.  This is done by calling
2825@code{copy_rtx_if_shared}, which is a subroutine of
2826@code{unshare_all_rtl}.
2827@end itemize
2828
2829@node Reading RTL
2830@section Reading RTL
2831
2832To read an RTL object from a file, call @code{read_rtx}.  It takes one
2833argument, a stdio stream, and returns a single RTL object.
2834
2835Reading RTL from a file is very slow.  This is not currently a
2836problem since reading RTL occurs only as part of building the
2837compiler.
2838
2839People frequently have the idea of using RTL stored as text in a file as
2840an interface between a language front end and the bulk of GNU CC.  This
2841idea is not feasible.
2842
2843GNU CC was designed to use RTL internally only.  Correct RTL for a given
2844program is very dependent on the particular target machine.  And the RTL
2845does not contain all the information about the program.
2846
2847The proper way to interface GNU CC to a new language front end is with
2848the ``tree'' data structure.  There is no manual for this data
2849structure, but it is described in the files @file{tree.h} and
2850@file{tree.def}.
Note: See TracBrowser for help on using the repository browser.