ARC Options#
The following options control the architecture variant for which code is being compiled:
- -mbarrel-shifter#
Generate instructions supported by barrel shifter. This is the default unless
-mcpu=ARC601or-mcpu=ARCEMis in effect.
- -mjli-always#
Force to call a function using jli_s instruction. This option is valid only for ARCv2 architecture.
- -mcpu=cpu#
Set architecture type, register usage, and instruction scheduling parameters for
cpu. There are also shortcut alias options available for backward compatibility and convenience. Supported values forcpuarearc600Compile for ARC600. Aliases:
-mA6,-mARC600.arc601Compile for ARC601. Alias:
-mARC601.arc700Compile for ARC700. Aliases:
-mA7,-mARC700. This is the default when configured with--with-cpu=arc700.arcemCompile for ARC EM.
archsCompile for ARC HS.
emCompile for ARC EM CPU with no hardware extensions.
em4Compile for ARC EM4 CPU.
em4_dmipsCompile for ARC EM4 DMIPS CPU.
em4_fpusCompile for ARC EM4 DMIPS CPU with the single-precision floating-point extension.
em4_fpudaCompile for ARC EM4 DMIPS CPU with single-precision floating-point and double assist instructions.
hsCompile for ARC HS CPU with no hardware extensions except the atomic instructions.
hs34Compile for ARC HS34 CPU.
hs38Compile for ARC HS38 CPU.
hs38_linuxCompile for ARC HS38 CPU with all hardware extensions on.
hs4xCompile for ARC HS4x CPU.
hs4xdCompile for ARC HS4xD CPU.
hs4x_rel31Compile for ARC HS4x CPU release 3.10a.
arc600_normCompile for ARC 600 CPU with
norminstructions enabled.arc600_mul32x16Compile for ARC 600 CPU with
normand 32x16-bit multiply instructions enabled.arc600_mul64Compile for ARC 600 CPU with
normandmul64-family instructions enabled.arc601_normCompile for ARC 601 CPU with
norminstructions enabled.arc601_mul32x16Compile for ARC 601 CPU with
normand 32x16-bit multiply instructions enabled.arc601_mul64Compile for ARC 601 CPU with
normandmul64-family instructions enabled.nps400Compile for ARC 700 on NPS400 chip.
em_miniCompile for ARC EM minimalist configuration featuring reduced register set.
- -mdpfp, -mdpfp-compact#
Generate double-precision FPX instructions, tuned for the compact implementation.
- -mdpfp-fast#
Generate double-precision FPX instructions, tuned for the fast implementation.
- -mno-dpfp-lrsr#
Disable
lrandsrinstructions from using FPX extension aux registers.
- -mea#
Generate extended arithmetic instructions. Currently only
divaw,adds,subs, andsat16are supported. Only valid for-mcpu=ARC700.
- -mno-mpy#
Do not generate
mpy-family instructions for ARC700. This option is deprecated.
- -mmul32x16#
Generate 32x16-bit multiply and multiply-accumulate instructions.
- -mmul64#
Generate
mul64andmulu64instructions. Only valid for-mcpu=ARC600.
- -mnorm#
Generate
norminstructions. This is the default if-mcpu=ARC700is in effect.
- -mspfp, -mspfp-compact#
Generate single-precision FPX instructions, tuned for the compact implementation.
- -mspfp-fast#
Generate single-precision FPX instructions, tuned for the fast implementation.
- -msimd#
Enable generation of ARC SIMD instructions via target-specific builtins. Only valid for
-mcpu=ARC700.
- -msoft-float#
This option ignored; it is provided for compatibility purposes only. Software floating-point code is emitted by default, and this default can overridden by FPX options;
-mspfp,-mspfp-compact, or-mspfp-fastfor single precision, and-mdpfp,-mdpfp-compact, or-mdpfp-fastfor double precision.
- -mswap#
Generate
swapinstructions.
- -matomic#
This enables use of the locked load/store conditional extension to implement atomic memory built-in functions. Not available for ARC 6xx or ARC EM cores.
- -mdiv-rem#
Enable
divandreminstructions for ARCv2 cores.
- -mcode-density#
Enable code density instructions for ARC EM. This option is on by default for ARC HS.
- -mll64#
Enable double load/store operations for ARC HS cores.
- -mtp-regno=regno#
Specify thread pointer register number.
- -mmpy-option=multo#
Compile ARCv2 code with a multiplier design option. You can specify the option using either a string or numeric value for
multo.wlh1is the default value. The recognized values are:0noneNo multiplier available.
1w16x16 multiplier, fully pipelined. The following instructions are enabled:
mpywandmpyuw.2wlh132x32 multiplier, fully pipelined (1 stage). The following instructions are additionally enabled:
mpy,mpyu,mpym,mpymu, andmpy_s.3wlh232x32 multiplier, fully pipelined (2 stages). The following instructions are additionally enabled:
mpy,mpyu,mpym,mpymu, andmpy_s.4wlh3Two 16x16 multipliers, blocking, sequential. The following instructions are additionally enabled:
mpy,mpyu,mpym,mpymu, andmpy_s.5wlh4One 16x16 multiplier, blocking, sequential. The following instructions are additionally enabled:
mpy,mpyu,mpym,mpymu, andmpy_s.6wlh5One 32x4 multiplier, blocking, sequential. The following instructions are additionally enabled:
mpy,mpyu,mpym,mpymu, andmpy_s.7plus_dmpyARC HS SIMD support.
8plus_macdARC HS SIMD support.
9plus_qmacwARC HS SIMD support.
This option is only available for ARCv2 cores.
- -mfpu=fpu#
Enables support for specific floating-point hardware extensions for ARCv2 cores. Supported values for
fpuare:fpusEnables support for single-precision floating-point hardware extensions.
fpudEnables support for double-precision floating-point hardware extensions. The single-precision floating-point extension is also enabled. Not available for ARC EM.
fpudaEnables support for double-precision floating-point hardware extensions using double-precision assist instructions. The single-precision floating-point extension is also enabled. This option is only available for ARC EM.
fpuda_divEnables support for double-precision floating-point hardware extensions using double-precision assist instructions. The single-precision floating-point, square-root, and divide extensions are also enabled. This option is only available for ARC EM.
fpuda_fmaEnables support for double-precision floating-point hardware extensions using double-precision assist instructions. The single-precision floating-point and fused multiply and add hardware extensions are also enabled. This option is only available for ARC EM.
fpuda_allEnables support for double-precision floating-point hardware extensions using double-precision assist instructions. All single-precision floating-point hardware extensions are also enabled. This option is only available for ARC EM.
fpus_divEnables support for single-precision floating-point, square-root and divide hardware extensions.
fpud_divEnables support for double-precision floating-point, square-root and divide hardware extensions. This option includes option
fpus_div. Not available for ARC EM.fpus_fmaEnables support for single-precision floating-point and fused multiply and add hardware extensions.
fpud_fmaEnables support for double-precision floating-point and fused multiply and add hardware extensions. This option includes option
fpus_fma. Not available for ARC EM.fpus_allEnables support for all single-precision floating-point hardware extensions.
fpud_allEnables support for all single- and double-precision floating-point hardware extensions. Not available for ARC EM.
- -mirq-ctrl-saved=register-range,blink,lp_count#
Specifies general-purposes registers that the processor automatically saves/restores on interrupt entry and exit.
register-rangeis specified as two registers separated by a dash. The register range always starts withr0, the upper limit isfpregister.blinkandlp_countare optional. This option is only valid for ARC EM and ARC HS cores.
- -mrgf-banked-regs=number#
Specifies the number of registers replicated in second register bank on entry to fast interrupt. Fast interrupts are interrupts with the highest priority level P0. These interrupts save only PC and STATUS32 registers to avoid memory transactions during interrupt entry and exit sequences. Use this option when you are using fast interrupts in an ARC V2 family processor. Permitted values are 4, 8, 16, and 32.
- -mlpc-width=width#
Specify the width of the
lp_countregister. Valid values forwidthare 8, 16, 20, 24, 28 and 32 bits. The default width is fixed to 32 bits. If the width is less than 32, the compiler does not attempt to transform loops in your program to use the zero-delay loop mechanism unless it is known that thelp_countregister can hold the required loop-counter value. Depending on the width specified, the compiler and run-time library might continue to use the loop mechanism for various needs. This option defines macro__ARC_LPC_WIDTH__with the value ofwidth.
- -mrf16#
This option instructs the compiler to generate code for a 16-entry register file. This option defines the
__ARC_RF16__preprocessor macro.
- -mbranch-index#
Enable use of
biorbihinstructions to implement jump tables.
The following options are passed through to the assembler, and also define preprocessor macro symbols.
- -mdsp-packa#
Passed down to the assembler to enable the DSP Pack A extensions. Also sets the preprocessor symbol
__Xdsp_packa. This option is deprecated.
- -mdvbf#
Passed down to the assembler to enable the dual Viterbi butterfly extension. Also sets the preprocessor symbol
__Xdvbf. This option is deprecated.
- -mlock#
Passed down to the assembler to enable the locked load/store conditional extension. Also sets the preprocessor symbol
__Xlock.
- -mmac-d16#
Passed down to the assembler. Also sets the preprocessor symbol
__Xxmac_d16. This option is deprecated.
- -mmac-24#
Passed down to the assembler. Also sets the preprocessor symbol
__Xxmac_24. This option is deprecated.
- -mrtsc#
Passed down to the assembler to enable the 64-bit time-stamp counter extension instruction. Also sets the preprocessor symbol
__Xrtsc. This option is deprecated.
- -mswape#
Passed down to the assembler to enable the swap byte ordering extension instruction. Also sets the preprocessor symbol
__Xswape.
- -mtelephony#
Passed down to the assembler to enable dual- and single-operand instructions for telephony. Also sets the preprocessor symbol
__Xtelephony. This option is deprecated.
- -mxy#
Passed down to the assembler to enable the XY memory extension. Also sets the preprocessor symbol
__Xxy.
The following options control how the assembly code is annotated:
- -misize#
Annotate assembler instructions with estimated addresses.
- -mannotate-align#
Explain what alignment considerations lead to the decision to make an instruction short or long.
The following options are passed through to the linker:
- -marclinux#
Passed through to the linker, to specify use of the
arclinuxemulation. This option is enabled by default in tool chains built forarc-linux-uclibcandarceb-linux-uclibctargets when profiling is not requested.
- -marclinux_prof#
Passed through to the linker, to specify use of the
arclinux_profemulation. This option is enabled by default in tool chains built forarc-linux-uclibcandarceb-linux-uclibctargets when profiling is requested.
The following options control the semantics of generated code:
- -mlong-calls#
Generate calls as register indirect calls, thus providing access to the full 32-bit address range.
- -mmedium-calls#
Don’t use less than 25-bit addressing range for calls, which is the offset available for an unconditional branch-and-link instruction. Conditional execution of function calls is suppressed, to allow use of the 25-bit range, rather than the 21-bit range with conditional branch-and-link. This is the default for tool chains built for
arc-linux-uclibcandarceb-linux-uclibctargets.
- -G num#
Put definitions of externally-visible data in a small data section if that data is no bigger than
numbytes. The default value ofnumis 4 for any ARC configuration, or 8 when we have double load/store operations.
- -mno-sdata#
Do not generate sdata references. This is the default for tool chains built for
arc-linux-uclibcandarceb-linux-uclibctargets.
- -msdata#
Default setting; overrides
-mno-sdata.
- -mvolatile-cache#
Use ordinarily cached memory accesses for volatile references. This is the default.
- -mno-volatile-cache#
Enable cache bypass for volatile references.
- -mvolatile-cache#
Default setting; overrides
-mno-volatile-cache.
The following options fine tune code generation:
- -malign-call#
Does nothing. Preserved for backward compatibility.
- -mauto-modify-reg#
Enable the use of pre/post modify with register displacement.
- -mbbit-peephole#
Enable bbit peephole2.
- -mno-brcc#
This option disables a target-specific pass in
arc_reorgto generate compare-and-branch (brcc) instructions. It has no effect on generation of these instructions driven by the combiner pass.
- -mcase-vector-pcrel#
Use PC-relative switch case tables to enable case table shortening. This is the default for
-Os.
- -mcompact-casesi#
Enable compact
casesipattern. This is the default for-Os, and only available for ARCv1 cores. This option is deprecated.
- -mno-cond-exec#
Disable the ARCompact-specific pass to generate conditional execution instructions.
Due to delay slot scheduling and interactions between operand numbers, literal sizes, instruction lengths, and the support for conditional execution, the target-independent pass to generate conditional execution is often lacking, so the ARC port has kept a special pass around that tries to find more conditional execution generation opportunities after register allocation, branch shortening, and delay slot scheduling have been done. This pass generally, but not always, improves performance and code size, at the cost of extra compilation time, which is why there is an option to switch it off. If you have a problem with call instructions exceeding their allowable offset range because they are conditionalized, you should consider using
-mmedium-callsinstead.
- -mearly-cbranchsi#
Enable pre-reload use of the
cbranchsipattern.
- -mexpand-adddi#
Expand
adddi3andsubdi3at RTL generation time intoadd.f,adcetc. This option is deprecated.
- -mindexed-loads#
Enable the use of indexed loads. This can be problematic because some optimizers then assume that indexed stores exist, which is not the case.
- -mlra#
Enable Local Register Allocation. This is still experimental for ARC, so by default the compiler uses standard reload (i.e.
-mno-lra).
- -mlra-priority-none#
Don’t indicate any priority for target registers.
- -mlra-priority-compact#
Indicate target register priority for
r0..r3/r12..r15.
- -mlra-priority-noncompact#
Reduce target register priority for
r0..r3/r12..r15.
- -mmillicode#
When optimizing for size (using
-Os), prologues and epilogues that have to save or restore a large number of registers are often shortened by using call to a special function in libgcc; this is referred to as a millicode call. As these calls can pose performance issues, and/or cause linking issues when linking in a nonstandard way, this option is provided to turn on or off millicode call generation.
- -mcode-density-frame#
This option enable the compiler to emit
enterandleaveinstructions. These instructions are only valid for CPUs with code-density feature.
- -mmixed-code#
Does nothing. Preserved for backward compatibility.
- -mq-class#
Ths option is deprecated. Enable
qinstruction alternatives. This is the default for-Os.
- -mRcq#
Does nothing. Preserved for backward compatibility.
- -mRcw#
Does nothing. Preserved for backward compatibility.
- -msize-level=level#
Fine-tune size optimization with regards to instruction lengths and alignment. The recognized values for
levelare:0No size optimization. This level is deprecated and treated like
1.1Short instructions are used opportunistically.
2In addition, alignment of loops and of code after barriers are dropped.
3In addition, optional data alignment is dropped, and the option Os is enabled.
This defaults to
3when-Osis in effect. Otherwise, the behavior when this is not set is equivalent to level1.
- -mtune=cpu#
Set instruction scheduling parameters for
cpu, overriding any implied by-mcpu=.Supported values for
cpuareARC600Tune for ARC600 CPU.
ARC601Tune for ARC601 CPU.
ARC700Tune for ARC700 CPU with standard multiplier block.
ARC700-xmacTune for ARC700 CPU with XMAC block.
ARC725DTune for ARC725D CPU.
ARC750DTune for ARC750D CPU.
core3Tune for ARCv2 core3 type CPU. This option enable usage of
dbnzinstruction.release31aTune for ARC4x release 3.10a.
- -mmultcost=num#
Cost to assume for a multiply instruction, with
4being equal to a normal instruction.
- -munalign-prob-threshold=probability#
Does nothing. Preserved for backward compatibility.
The following options are maintained for backward compatibility, but are now deprecated and will be removed in a future release:
- -margonaut#
Obsolete FPX.
- -mbig-endian, -EB#
Compile code for big-endian targets. Use of these options is now deprecated. Big-endian code is supported by configuring GCC to build
arceb-elf32andarceb-linux-uclibctargets, for which big endian is the default.
- -mlittle-endian, -EL#
Compile code for little-endian targets. Use of these options is now deprecated. Little-endian code is supported by configuring GCC to build
arc-elf32andarc-linux-uclibctargets, for which little endian is the default.
- -mbarrel_shifter#
Replaced by
-mbarrel-shifter.
- -mdpfp_compact#
Replaced by
-mdpfp-compact.
- -mdpfp_fast#
Replaced by
-mdpfp-fast.
- -mdsp_packa#
Replaced by
-mdsp-packa.
- -mspfp_compact#
Replaced by
-mspfp-compact.
- -mspfp_fast#
Replaced by
-mspfp-fast.
- -mtune=cpu#
Values
arc600,arc601,arc700andarc700-xmacforcpuare replaced byARC600,ARC601,ARC700andARC700-xmacrespectively.
- -multcost=num#
Replaced by
-mmultcost.