Side Effect Expressions#
The expression codes described so far represent values, not actions. But machine instructions never produce values; they are meaningful only for their side effects on the state of the machine. Special expression codes are used to represent side effects.
The body of an instruction is always one of these side effect codes; the codes described above, which represent values, appear only as the operands of these.
(set lval x)Represents the action of storing the value of
xinto the place represented bylval.lvalmust be an expression representing a place that can be stored in:reg(orsubreg,strict_low_partorzero_extract),mem,pc, orparallel.If
lvalis areg,subregormem, it has a machine mode; thenxmust be valid for that mode.If
lvalis aregwhose machine mode is less than the full width of the register, then it means that the part of the register specified by the machine mode is given the specified value and the rest of the register receives an undefined value. Likewise, iflvalis asubregwhose machine mode is narrower than the mode of the register, the rest of the register can be changed in an undefined way.If
lvalis astrict_low_partof a subreg, then the part of the register specified by the machine mode of thesubregis given the valuexand the rest of the register is not changed.If
lvalis azero_extract, then the referenced part of the bit-field (a memory or register reference) specified by thezero_extractis given the valuexand the rest of the bit-field is not changed. Note thatsign_extractcannot appear inlval.If
lvalis aparallel, it is used to represent the case of a function returning a structure in multiple registers. Each element of theparallelis anexpr_listwhose first operand is aregand whose second operand is aconst_intrepresenting the offset (in bytes) into the structure at which the data in that register corresponds. The first element may be null to indicate that the structure is also passed partly in memory.If
lvalis(pc), we have a jump instruction, and the possibilities forxare very limited. It may be alabel_refexpression (unconditional jump). It may be anif_then_else(conditional jump), in which case either the second or the third operand must be(pc)(for the case which does not jump) and the other of the two must be alabel_ref(for the case which does jump).xmay also be amemor(plus:SI (pc) y), whereymay be aregor amem; these unusual patterns are used to represent jumps through branch tables.If
lvalis not(pc), the mode oflvalmust not beVOIDmodeand the mode ofxmust be valid for the mode oflval.lvalis customarily accessed with theSET_DESTmacro andxwith theSET_SRCmacro.(return)As the sole expression in a pattern, represents a return from the current function, on machines where this can be done with one instruction, such as VAXen. On machines where a multi-instruction ‘epilogue’ must be executed in order to return from the function, returning is done by jumping to a label which precedes the epilogue, and the
returnexpression code is never used.Inside an
if_then_elseexpression, represents the value to be placed inpcto return to the caller.Note that an insn pattern of
(return)is logically equivalent to(set (pc) (return)), but the latter form is never used.(simple_return)Like
(return), but truly represents only a function return, while(return)may represent an insn that also performs other functions of the function epilogue. Like(return), this may also occur in conditional jumps.(call function nargs)Represents a function call.
functionis amemexpression whose address is the address of the function to be called.nargsis an expression which can be used for two purposes: on some machines it represents the number of bytes of stack argument; on others, it represents the number of argument registers.Each machine has a standard machine mode which
functionmust have. The machine description defines macroFUNCTION_MODEto expand into the requisite mode name. The purpose of this mode is to specify what kind of addressing is allowed, on machines where the allowed kinds of addressing depend on the machine mode being addressed.(clobber x)Represents the storing or possible storing of an unpredictable, undescribed value into
x, which must be areg,scratch,parallelormemexpression.One place this is used is in string instructions that store standard values into particular hard registers. It may not be worth the trouble to describe the values that are stored, but it is essential to inform the compiler that the registers will be altered, lest it attempt to keep data in them across the string instruction.
If
xis(mem:BLK (const_int 0))or(mem:BLK (scratch)), it means that all memory locations must be presumed clobbered. Ifxis aparallel, it has the same meaning as aparallelin asetexpression.Note that the machine description classifies certain hard registers as ‘call-clobbered’. All function call instructions are assumed by default to clobber these registers, so there is no need to use
clobberexpressions to indicate this fact. Also, each function call is assumed to have the potential to alter any memory location, unless the function is declaredconst.If the last group of expressions in a
parallelare each aclobberexpression whose arguments areregormatch_scratch(see RTL Template) expressions, the combiner phase can add the appropriateclobberexpressions to an insn it has constructed when doing so will cause a pattern to be matched.This feature can be used, for example, on a machine that whose multiply and add instructions don’t use an MQ register but which has an add-accumulate instruction that does clobber the MQ register. Similarly, a combined instruction might require a temporary register while the constituent instructions might not.
When a
clobberexpression for a register appears inside aparallelwith other side effects, the register allocator guarantees that the register is unoccupied both before and after that insn if it is a hard register clobber. For pseudo-register clobber, the register allocator and the reload pass do not assign the same hard register to the clobber and the input operands if there is an insn alternative containing the&constraint (see Constraint Modifier Characters) for the clobber and the hard register is in register classes of the clobber in the alternative. You can clobber either a specific hard register, a pseudo register, or ascratchexpression; in the latter two cases, GCC will allocate a hard register that is available there for use as a temporary.For instructions that require a temporary register, you should use
scratchinstead of a pseudo-register because this will allow the combiner phase to add theclobberwhen required. You do this by coding (clobber(match_scratch…)). If you do clobber a pseudo register, use one which appears nowhere else—generate a new one each time. Otherwise, you may confuse CSE.There is one other known use for clobbering a pseudo register in a
parallel: when one of the input operands of the insn is also clobbered by the insn. In this case, using the same pseudo register in the clobber and elsewhere in the insn produces the expected results.(use x)Represents the use of the value of
x. It indicates that the value inxat this point in the program is needed, even though it may not be apparent why this is so. Therefore, the compiler will not attempt to delete previous instructions whose only effect is to store a value inx.xmust be aregexpression.In some situations, it may be tempting to add a
useof a register in aparallelto describe a situation where the value of a special register will modify the behavior of the instruction. A hypothetical example might be a pattern for an addition that can either wrap around or use saturating addition depending on the value of a special control register:(parallel [(set (reg:SI 2) (unspec:SI [(reg:SI 3) (reg:SI 4)] 0)) (use (reg:SI 1))])
This will not work, several of the optimizers only look at expressions locally; it is very likely that if you have multiple insns with identical inputs to the
unspec, they will be optimized away even if register 1 changes in between.This means that
usecan only be used to describe that the register is live. You should think twice before addingusestatements, more often you will want to useunspecinstead. TheuseRTX is most commonly useful to describe that a fixed register is implicitly used in an insn. It is also safe to use in patterns where the compiler knows for other reasons that the result of the whole pattern is variable, such ascpymemmorcallpatterns.During the reload phase, an insn that has a
useas pattern can carry a reg_equal note. Theseuseinsns will be deleted before the reload phase exits.During the delayed branch scheduling phase,
xmay be an insn. This indicates thatxpreviously was located at this place in the code and its data dependencies need to be taken into account. Theseuseinsns will be deleted before the delayed branch scheduling phase exits.(parallel [x0 x1 ...])Represents several side effects performed in parallel. The square brackets stand for a vector; the operand of
parallelis a vector of expressions.x0,x1and so on are individual side effect expressions—expressions of codeset,call,return,simple_return,clobberoruse.‘In parallel’ means that first all the values used in the individual side-effects are computed, and second all the actual side-effects are performed. For example,
(parallel [(set (reg:SI 1) (mem:SI (reg:SI 1))) (set (mem:SI (reg:SI 1)) (reg:SI 1))])
says unambiguously that the values of hard register 1 and the memory location addressed by it are interchanged. In both places where
(reg:SI 1)appears as a memory address it refers to the value in register 1 before the execution of the insn.It follows that it is incorrect to use
paralleland expect the result of onesetto be available for the next one. For example, people sometimes attempt to represent a jump-if-zero instruction this way:(parallel [(set (reg:CC CC_REG) (reg:SI 34)) (set (pc) (if_then_else (eq (reg:CC CC_REG) (const_int 0)) (label_ref ...) (pc)))])
But this is incorrect, because it says that the jump condition depends on the condition code value before this instruction, not on the new value that is set by this instruction.
Peephole optimization, which takes place together with final assembly code output, can produce insns whose patterns consist of a
parallelwhose elements are the operands needed to output the resulting assembler code—oftenreg,memor constant expressions. This would not be well-formed RTL at any other stage in compilation, but it is OK then because no further optimization remains to be done.(cond_exec [cond expr])Represents a conditionally executed expression. The
expris executed only if thecondis nonzero. Thecondexpression must not have side-effects, but theexprmay very well have side-effects.(sequence [insns ...])Represents a sequence of insns. If a
sequenceappears in the chain of insns, then each of theinsnsthat appears in the sequence must be suitable for appearing in the chain of insns, i.e. must satisfy theINSN_Ppredicate.After delay-slot scheduling is completed, an insn and all the insns that reside in its delay slots are grouped together into a
sequence. The insn requiring the delay slot is the first insn in the vector; subsequent insns are to be placed in the delay slot.INSN_ANNULLED_BRANCH_Pis set on an insn in a delay slot to indicate that a branch insn should be used that will conditionally annul the effect of the insns in the delay slots. In such a case,INSN_FROM_TARGET_Pindicates that the insn is from the target of the branch and should be executed only if the branch is taken; otherwise the insn should be executed only if the branch is not taken. See Delay Slot Scheduling.Some back ends also use
sequenceobjects for purposes other than delay-slot groups. This is not supported in the common parts of the compiler, which treat such sequences as delay-slot groups.DWARF2 Call Frame Address (CFA) adjustments are sometimes also expressed using
sequenceobjects as the value of aRTX_FRAME_RELATED_Pnote. This only happens if the CFA adjustments cannot be easily derived from the pattern of the instruction to which the note is attached. In such cases, the value of the note is used instead of best-guesing the semantics of the instruction. The back end can attach notes containing asequenceofsetpatterns that express the effect of the parent instruction.These expression codes appear in place of a side effect, as the body of an insn, though strictly speaking they do not always describe side effects as such:
(asm_input s)Represents literal assembler code as described by the string
s.(unspec [operands ...] index)(unspec_volatile [operands ...] index)Represents a machine-specific operation on
operands.indexselects between multiple machine-specific operations.unspec_volatileis used for volatile operations and operations that may trap;unspecis used for other operations.These codes may appear inside a
patternof an insn, inside aparallel, or inside an expression.(addr_vec:m [lr0 lr1 ...])Represents a table of jump addresses. The vector elements
lr0, etc., arelabel_refexpressions. The modemspecifies how much space is given to each address; normallymwould bePmode.(addr_diff_vec:m base [lr0 lr1 ...] min max flags)Represents a table of jump addresses expressed as offsets from
base. The vector elementslr0, etc., arelabel_refexpressions and so isbase. The modemspecifies how much space is given to each address-difference.minandmaxare set up by branch shortening and hold a label with a minimum and a maximum address, respectively.flagsindicates the relative position ofbase,minandmaxto the containing insn and ofminandmaxtobase. See rtl.def for details.(prefetch:m addr rw locality)Represents prefetch of memory at address
addr. Operandrwis 1 if the prefetch is for data to be written, 0 otherwise; targets that do not support write prefetches should treat this as a normal prefetch. Operandlocalityspecifies the amount of temporal locality; 0 if there is none or 1, 2, or 3 for increasing levels of temporal locality; targets that do not support locality hints should ignore this.This insn is used to minimize cache-miss latency by moving data into a cache before it is accessed. It should use only non-faulting data prefetch instructions.