Re: [PATCH 13/14] doc, x86: Clean up x86 options documentation [PR122243]

Uros Bizjak Sun, 11 Jan 2026 23:24:21 -0800

On Mon, Jan 12, 2026 at 6:10 AM Hongtao Liu <[email protected]> wrote:
>
> On Fri, Jan 9, 2026 at 3:45 AM Sandra Loosemore <[email protected]> 
> wrote:
> >
> > Besides the usual fixes in this series to make the options summary
> > agree with the options listed in the detailed documentation and add
> > missing @opindex entries, I decided it was not very helpful to users
> > to have dozens of ISA extension options documented as a group spanning
> > multiple pages in the manual.  I broke that up so each of those
> > options is described separately, using the documentation string from
> > the .opt file.
>
> The patch looks fine to me, but please wait a few days before
> committing in case Uros has some comments.


Please go ahead, LGTM too, a much needed cleanup.

Thanks,
Uros.

>
> >
> > gcc/ChangeLog
> >         PR other/122243
> >         * config/i386/i386.opt (malign-functions): Mark undocumented/unused
> >         option as Undocumented.
> >         (malign-jumps): Likewise.
> >         (malign-loops): Likewise.
> >         (mbranch-cost, mforce-drap): Mark undocumented options likely
> >         intended for developer use only as Undocumented.
> >         (mstv): Correct sense of option in doc string.
> >         (mavx512cd): Remove extra "and" from doc string.
> >         (mavx512dq): Likewise.
> >         (mavx512bw): Likewise.
> >         (mavx512vl): Likewise.
> >         (mavx512ifma): Likewise.
> >         (mavx512bvmi): Likewise.
> >         * gcc/doc/invoke.texi (Options Summary) <x86 Options>: Add
> >         missing options.  Correct whitespace and re-wrap long lines.
> >         Remove -mthreads which is now classed as a MinGW option.
> >         (Cygwin and MinGW Options): Replace existing documentation of
> >         -mthreads with the more detailed text moved from x86 Options.
> >         (x86 Options): Move introductory text about ISA extensions before
> >         the individual options instead of after.  Document them all
> >         individually instead of as a group, and move immediately after
> >         -march/-mtune documentation.  Rewrap long lines.  Document
> >         interaction between SSE and AVX with -mfpmath=sse.  Move -masm
> >         documentation farther down instead of grouped with options
> >         affecting floating-point behavior.  Add missing @opindex
> >         entries.  Rewrite the -mdaz-ftz documentation.  Document
> >         -mstack-arg-probe.  Copy-editing.  Document -mstv.  Remove
> >         obsolete warning about -mskip-rax-setup in very old GCC versions.
> >         Rewrite the -mapx-inline-asm-use-gpr32 documentation.
> >         Document -mgather and -mscatter.  Split -miamcu documentation
> >         from -m32/-m64/etc.  Rewrite -munroll-only-small-loops 
> > documentation.
> >         Document -mdispatch-scheduler.
> > ---
> >  gcc/config/i386/i386.opt |   34 +-
> >  gcc/doc/invoke.texi      | 1232 ++++++++++++++++++++++++--------------
> >  2 files changed, 793 insertions(+), 473 deletions(-)
> >
> > diff --git a/gcc/config/i386/i386.opt b/gcc/config/i386/i386.opt
> > index 99bb674812b..4942f512417 100644
> > --- a/gcc/config/i386/i386.opt
> > +++ b/gcc/config/i386/i386.opt
> > @@ -225,16 +225,19 @@ malign-double
> >  Target Mask(ALIGN_DOUBLE) Save
> >  Align some doubles on dword boundary.
> >
> > +; Does nothing.
> >  malign-functions=
> > -Target RejectNegative Joined UInteger
> > +Target Undocumented RejectNegative Joined UInteger
> >  Function starts are aligned to this power of 2.
> >
> > +; Does nothing.
> >  malign-jumps=
> > -Target RejectNegative Joined UInteger
> > +Target Undocumented RejectNegative Joined UInteger
> >  Jump targets are aligned to this power of 2.
> >
> > +; Does nothing.
> >  malign-loops=
> > -Target RejectNegative Joined UInteger
> > +Target Undocumented RejectNegative Joined UInteger
> >  Loop code aligned to this power of 2.
> >
> >  malign-stringops
> > @@ -277,7 +280,7 @@ EnumValue
> >  Enum(asm_dialect) String(att) Value(ASM_ATT)
> >
> >  mbranch-cost=
> > -Target RejectNegative Joined UInteger Var(ix86_branch_cost) 
> > IntegerRange(0, 5)
> > +Target Undocumented RejectNegative Joined UInteger Var(ix86_branch_cost) 
> > IntegerRange(0, 5)
> >  Branches are this expensive (arbitrary units).
> >
> >  mlarge-data-threshold=
> > @@ -328,8 +331,13 @@ mfancy-math-387
> >  Target RejectNegative InverseMask(NO_FANCY_MATH_387, USE_FANCY_MATH_387) 
> > Save
> >  Generate sin, cos, sqrt for FPU.
> >
> > +; This option seems deliberately undocumented as other options added in
> > +; the same commit were properly documented in the manual.
> > +; DRAP is usually used only in functions that do dynamic stack
> > +; allocation (e.g. alloca), this makes it happen everywhere.  Maybe it
> > +; was intended for debugging?
> >  mforce-drap
> > -Target Var(ix86_force_drap)
> > +Target Undocumented Var(ix86_force_drap)
> >  Always use Dynamic Realigned Argument Pointer (DRAP) to realign stack.
> >
> >  mfp-ret-in-387
> > @@ -599,8 +607,8 @@ the function.
> >
> >  mstv
> >  Target Mask(STV) Save
> > -Disable Scalar to Vector optimization pass transforming 64-bit integer
> > -computations into a vector ones.
> > +Enable Scalar to Vector optimization pass transforming 64-bit integer
> > +computations into vector ones.
> >
> >  -param=x86-stv-max-visits=
> >  Target Joined UInteger Var(x86_stv_max_visits) Init(10000) IntegerRange(1, 
> > 1000000) Param
> > @@ -726,27 +734,27 @@ Support MMX, SSE, SSE2, SSE3, SSSE3, SSE4.1, SSE4.2, 
> > AVX, AVX2 and AVX512F built
> >
> >  mavx512cd
> >  Target Mask(ISA_AVX512CD) Var(ix86_isa_flags) Save
> > -Support MMX, SSE, SSE2, SSE3, SSSE3, SSE4.1, SSE4.2, AVX, AVX2 and AVX512F 
> > and AVX512CD built-in functions and code generation.
> > +Support MMX, SSE, SSE2, SSE3, SSSE3, SSE4.1, SSE4.2, AVX, AVX2, AVX512F 
> > and AVX512CD built-in functions and code generation.
> >
> >  mavx512dq
> >  Target Mask(ISA_AVX512DQ) Var(ix86_isa_flags) Save
> > -Support MMX, SSE, SSE2, SSE3, SSSE3, SSE4.1, SSE4.2, AVX, AVX2 and AVX512F 
> > and AVX512DQ built-in functions and code generation.
> > +Support MMX, SSE, SSE2, SSE3, SSSE3, SSE4.1, SSE4.2, AVX, AVX2, AVX512F 
> > and AVX512DQ built-in functions and code generation.
> >
> >  mavx512bw
> >  Target Mask(ISA_AVX512BW) Var(ix86_isa_flags) Save
> > -Support MMX, SSE, SSE2, SSE3, SSSE3, SSE4.1, SSE4.2, AVX, AVX2 and AVX512F 
> > and AVX512BW built-in functions and code generation.
> > +Support MMX, SSE, SSE2, SSE3, SSSE3, SSE4.1, SSE4.2, AVX, AVX2, AVX512F 
> > and AVX512BW built-in functions and code generation.
> >
> >  mavx512vl
> >  Target Mask(ISA_AVX512VL) Var(ix86_isa_flags) Save
> > -Support MMX, SSE, SSE2, SSE3, SSSE3, SSE4.1, SSE4.2, AVX, AVX2 and AVX512F 
> > and AVX512VL built-in functions and code generation.
> > +Support MMX, SSE, SSE2, SSE3, SSSE3, SSE4.1, SSE4.2, AVX, AVX2, AVX512F 
> > and AVX512VL built-in functions and code generation.
> >
> >  mavx512ifma
> >  Target Mask(ISA_AVX512IFMA) Var(ix86_isa_flags) Save
> > -Support MMX, SSE, SSE2, SSE3, SSSE3, SSE4.1, SSE4.2, AVX, AVX2 and AVX512F 
> > and AVX512IFMA built-in functions and code generation.
> > +Support MMX, SSE, SSE2, SSE3, SSSE3, SSE4.1, SSE4.2, AVX, AVX2, AVX512F 
> > and AVX512IFMA built-in functions and code generation.
> >
> >  mavx512vbmi
> >  Target Mask(ISA_AVX512VBMI) Var(ix86_isa_flags) Save
> > -Support MMX, SSE, SSE2, SSE3, SSSE3, SSE4.1, SSE4.2, AVX, AVX2 and AVX512F 
> > and AVX512VBMI built-in functions and code generation.
> > +Support MMX, SSE, SSE2, SSE3, SSSE3, SSE4.1, SSE4.2, AVX, AVX2, AVX512F 
> > and AVX512VBMI built-in functions and code generation.
> >
> >  mavx512vpopcntdq
> >  Target Mask(ISA_AVX512VPOPCNTDQ) Var(ix86_isa_flags) Save
> > diff --git a/gcc/doc/invoke.texi b/gcc/doc/invoke.texi
> > index 8de21d7b193..c48a73650cc 100644
> > --- a/gcc/doc/invoke.texi
> > +++ b/gcc/doc/invoke.texi
> > @@ -1503,15 +1503,15 @@ See RS/6000 and PowerPC Options.
> >  -mtune-ctrl=@var{feature-list}  -mdump-tune-features  -mno-default
> >  -mfpmath=@var{unit}
> >  -masm=@var{dialect}  -mno-fancy-math-387
> > --mno-fp-ret-in-387  -m80387  -mhard-float  -msoft-float
> > --mno-wide-multiply  -mrtd  -malign-double
> > +-mno-fp-ret-in-387  -m80387  -mhard-float  -msoft-float  -mieee-fp
> > +-mrtd  -malign-double
> >  -mpreferred-stack-boundary=@var{num}
> >  -mincoming-stack-boundary=@var{num}
> > --mcld  -mcx16  -msahf  -mmovbe  -mcrc32 -mmwait
> > +-mcld  -mcx16  -msahf  -mmovbe  -mcrc32  -mmwait
> >  -mrecip  -mrecip=@var{opt}
> > --mvzeroupper  -mprefer-avx128  -mprefer-vector-width=@var{opt}
> > +-mvzeroupper  -mstv  -mprefer-avx128  -mprefer-vector-width=@var{opt}
> >  -mpartial-vector-fp-math
> > --mmove-max=@var{bits} -mstore-max=@var{bits}
> > +-mmove-max=@var{bits}  -mstore-max=@var{bits}
> >  -mnoreturn-no-callee-saved-registers
> >  -mmmx  -msse  -msse2  -msse3  -mssse3  -msse4.1  -msse4.2  -msse4  -mavx
> >  -mavx2  -mavx512f  -mavx512cd  -mavx512vl
> > @@ -1520,40 +1520,47 @@ See RS/6000 and PowerPC Options.
> >  -mptwrite  -mclflushopt  -mclwb  -mxsavec  -mxsaves
> >  -msse4a  -m3dnow  -m3dnowa  -mpopcnt  -mabm  -mbmi  -mtbm  -mfma4  -mxop
> >  -madx  -mlzcnt  -mbmi2  -mfxsr  -mxsave  -mxsaveopt  -mrtm  -mhle  -mlwp
> > --mmwaitx  -mclzero  -mpku  -mthreads  -mgfni  -mvaes  -mwaitpkg
> > --mshstk -mmanual-endbr -mcet-switch -mforce-indirect-call
> > --mavx512vbmi2 -mavx512bf16 -menqcmd
> > +-mmwaitx  -mclzero  -mpku  -mgfni  -mvaes  -mwaitpkg
> > +-mshstk  -mmanual-endbr  -mcet-switch  -mforce-indirect-call
> > +-mavx512vbmi2  -mavx512bf16  -menqcmd
> >  -mvpclmulqdq  -mavx512bitalg  -mmovdiri  -mmovdir64b  -mavx512vpopcntdq
> >  -mavx512vnni  -mprfchw  -mrdpid
> > --mrdseed  -msgx -mavx512vp2intersect -mserialize -mtsxldtrk
> > --mamx-tile  -mamx-int8  -mamx-bf16 -muintr -mhreset -mavxvnni -mamx-fp8
> > --mavx512fp16 -mavxifma -mavxvnniint8 -mavxneconvert -mcmpccxadd -mamx-fp16
> > --mprefetchi -mraoint -mamx-complex -mavxvnniint16 -msm3 -msha512 -msm4 
> > -mapxf
> > --musermsr -mavx10.1 -mavx10.2 -mamx-avx512 -mamx-tf32 -mmovrs -mamx-movrs
> > --mavx512bmm -mcldemote -mms-bitfields -mno-align-stringops 
> > -minline-all-stringops
> > +-mrdseed  -msgx  -mavx512vp2intersect  -mserialize  -mtsxldtrk
> > +-mamx-tile  -mamx-int8  -mamx-bf16  -muintr  -mhreset
> > +-mavxvnni  -mamx-fp8  -mavx512fp16  -mavxifma  -mavxvnniint8
> > +-mavxneconvert  -mcmpccxadd  -mamx-fp16  -mprefetchi  -mraoint
> > +-mamx-complex  -mavxvnniint16  -msm3  -msha512  -msm4  -mapxf
> > +-musermsr  -mavx10.1  -mavx10.2  -mamx-avx512  -mamx-tf32  -mmovrs
> > +-mamx-movrs  -mavx512bmm  -mcldemote  -mms-bitfields
> > +-mno-align-stringops  -minline-all-stringops
> >  -minline-stringops-dynamically  -mstringop-strategy=@var{alg}
> > --mkl -mwidekl
> > +-mkl  -mwidekl
> >  -mmemcpy-strategy=@var{strategy}  -mmemset-strategy=@var{strategy}
> >  -mpush-args  -maccumulate-outgoing-args  -m128bit-long-double
> >  -m96bit-long-double  -mlong-double-64  -mlong-double-80  -mlong-double-128
> >  -mregparm=@var{num}  -msseregparm
> >  -mveclibabi=@var{type}  -mvect8-ret-in-mem
> > --mpc32  -mpc64  -mpc80  -mdaz-ftz -mstackrealign
> > +-mpc32  -mpc64  -mpc80  -mdaz-ftz  -mstackrealign  -mstack-arg-probe
> >  -momit-leaf-frame-pointer  -mno-red-zone  -mno-tls-direct-seg-refs
> >  -mcmodel=@var{code-model}  -mabi=@var{name}  -maddress-mode=@var{mode}
> >  -m32  -m64  -mx32  -m16  -miamcu  -mlarge-data-threshold=@var{num}
> >  -msse2avx  -mfentry  -mrecord-mcount  -mnop-mcount  -m8bit-idiv
> > --minstrument-return=@var{type} -mfentry-name=@var{name} 
> > -mfentry-section=@var{name}
> > +-minstrument-return=@var{type}  -mrecord-return
> > +-mfentry-name=@var{name}  -mfentry-section=@var{name}
> > +-mskip-rax-setup
> >  -mavx256-split-unaligned-load  -mavx256-split-unaligned-store
> >  -malign-data=@var{type}  -mstack-protector-guard=@var{guard}
> >  -mstack-protector-guard-reg=@var{reg}
> >  -mstack-protector-guard-offset=@var{offset}
> >  -mstack-protector-guard-symbol=@var{symbol}
> > --mgeneral-regs-only  -mcall-ms2sysv-xlogues -mrelax-cmpxchg-loop
> > +-mgeneral-regs-only  -mcall-ms2sysv-xlogues  -mtls-dialect=@var{type}
> > +-mrelax-cmpxchg-loop
> >  -mindirect-branch=@var{choice}  -mfunction-return=@var{choice}
> > --mindirect-branch-register -mharden-sls=@var{choice}
> > --mindirect-branch-cs-prefix -mneeded -mno-direct-extern-access
> > --munroll-only-small-loops -mlam=@var{choice}}
> > +-mindirect-branch-register  -mharden-sls=@var{choice}
> > +-mindirect-branch-cs-prefix  -mapx-inline-asm-use-gpr32
> > +-mgather  -mscatter
> > +-mneeded  -mno-direct-extern-access
> > +-munroll-only-small-loops  -mdispatch-scheduler  -mlam=@var{choice}}
> >
> >  @emph{x86 Windows Options}
> >  See Cygwin and MinGW Options.
> > @@ -26781,8 +26788,11 @@ specifies that the @code{dllimport} attribute 
> > should be ignored.
> >
> >  @opindex mthreads
> >  @item -mthreads
> > -This option is available for MinGW targets. It specifies
> > -that MinGW-specific thread support is to be used.
> > +Support thread-safe exception handling on MinGW.  Programs that rely
> > +on thread-safe exception handling must compile and link all code with the
> > +@option{-mthreads} option.  When compiling, @option{-mthreads} defines
> > +@option{-D_MT}; when linking, it links in a special thread helper library
> > +@option{-lmingwthrd} which cleans up per-thread exception-handling data.
> >
> >  @opindex municode
> >  @opindex mno-unicode
> > @@ -35717,7 +35727,11 @@ is defined for compatibility with Diab.
> >  @subsection x86 Options
> >  @cindex x86 Options
> >
> > -These @samp{-m} options are defined for the x86 family of computers.
> > +This section documents @option{-m} options available for the x86 family
> > +of computers.
> > +
> > +The following group of options allows compilation to target a specific
> > +processor.
> >
> >  @table @gcctabopt
> >
> > @@ -36338,10 +36352,583 @@ instruction set applicable to all processors.  
> > In contrast,
> >  @option{-mtune} indicates the processor (or, in this case, collection of
> >  processors) for which the code is optimized.
> >  @end table
> > +@end table
> >
> > -@opindex mcpu
> > -@item -mcpu=@var{cpu-type}
> > -A deprecated synonym for @option{-mtune}.
> > +The following options allow more detailed control over which instruction
> > +set extensions are targeted by GCC.
> > +Each has a corresponding @option{-mno-} option to disable use of these
> > +instructions.
> > +
> > +These extensions are also available as built-in functions: see
> > +@ref{x86 Built-in Functions}, for details of the functions enabled and
> > +disabled by these switches.
> > +
> > +These options enable GCC to use these extended instructions in
> > +generated code.  Applications that
> > +perform run-time CPU detection must compile separate files for each
> > +supported architecture, using the appropriate flags.  In particular,
> > +the file containing the CPU detection code should be compiled without
> > +these options.
> > +
> > +To control whether 387 or SSE/AVX instructions are generated
> > +automatically for floating-point arithmetic, see @option{-mfpmath=}, below.
> > +
> > +@table @gcctabopt
> > +@opindex mmmx
> > +@opindex mno-mmx
> > +@item -mmmx
> > +Support MMX built-in functions.
> > +
> > +@opindex msse
> > +@opindex mno-sse
> > +@item -msse
> > +Support MMX and SSE built-in functions and code generation.
> > +
> > +@opindex msse2
> > +@opindex mno-sse2
> > +@item -msse2
> > +Support MMX, SSE and SSE2 built-in functions and code generation.
> > +
> > +@opindex msse3
> > +@opindex mno-sse3
> > +@item -msse3
> > +Support MMX, SSE, SSE2 and SSE3 built-in functions and code generation.
> > +
> > +@opindex mssse3
> > +@opindex mno-ssse3
> > +@item -mssse3
> > +Support MMX, SSE, SSE2, SSE3 and SSSE3 built-in functions and code 
> > generation.
> > +
> > +@opindex msse4.1
> > +@opindex mno-sse4.1
> > +@item -msse4.1
> > +Support MMX, SSE, SSE2, SSE3, SSSE3 and SSE4.1 built-in functions and
> > +code generation.
> > +
> > +@opindex msse4.2
> > +@opindex mno-sse4.2
> > +@item -msse4.2
> > +Support MMX, SSE, SSE2, SSE3, SSSE3, SSE4.1 and SSE4.2 built-in functions
> > +and code generation.
> > +
> > +@opindex msse4
> > +@opindex mno-sse4
> > +@item -msse4
> > +Support MMX, SSE, SSE2, SSE3, SSSE3, SSE4.1 and SSE4.2 built-in functions
> > +and code generation.
> > +
> > +Note that @option{-msse4} enables both SSE4.1 and SSE4.2 support,
> > +while @option{-mno-sse4} turns off those features; neither form of the
> > +option affects SSE4A support, controlled separately by
> > +@option{-msse4a}.
> > +
> > +@opindex msse4a
> > +@opindex mno-sse4a
> > +@item -msse4a
> > +Support MMX, SSE, SSE2, SSE3 and SSE4A built-in functions and code 
> > generation.
> > +
> > +@opindex mavx
> > +@opindex mno-avx
> > +@item -mavx
> > +Support MMX, SSE, SSE2, SSE3, SSSE3, SSE4.1, SSE4.2 and AVX built-in
> > +functions and code generation.
> > +
> > +@opindex mavx2
> > +@opindex mno-avx2
> > +@item -mavx2
> > +Support MMX, SSE, SSE2, SSE3, SSSE3, SSE4.1, SSE4.2, AVX and AVX2
> > +built-in functions and code generation.
> > +
> > +@opindex mavx512f
> > +@opindex mno-avx512f
> > +@item -mavx512f
> > +Support MMX, SSE, SSE2, SSE3, SSSE3, SSE4.1, SSE4.2, AVX, AVX2 and
> > +AVX512F built-in functions and code generation.
> > +
> > +@opindex mavx512cd
> > +@opindex mno-avx512cd
> > +@item -mavx512cd
> > +Support MMX, SSE, SSE2, SSE3, SSSE3, SSE4.1, SSE4.2, AVX, AVX2,
> > +AVX512F and AVX512CD built-in functions and code generation.
> > +
> > +@opindex mavx512vl
> > +@opindex mno-avx512vl
> > +@item -mavx512vl
> > +Support MMX, SSE, SSE2, SSE3, SSSE3, SSE4.1, SSE4.2, AVX, AVX2,
> > +AVX512F and AVX512VL built-in functions and code generation.
> > +
> > +@opindex mavx512bw
> > +@opindex mno-avx512bw
> > +@item -mavx512bw
> > +Support MMX, SSE, SSE2, SSE3, SSSE3, SSE4.1, SSE4.2, AVX, AVX2,
> > +AVX512F and AVX512BW built-in functions and code generation.
> > +
> > +@opindex mavx512dq
> > +@opindex mno-avx512dq
> > +@item -mavx512dq
> > +Support MMX, SSE, SSE2, SSE3, SSSE3, SSE4.1, SSE4.2, AVX, AVX2,
> > +AVX512F and AVX512DQ built-in functions and code generation.
> > +
> > +@opindex mavx512ifma
> > +@opindex mno-avx512ifma
> > +@item -mavx512ifma
> > +Support MMX, SSE, SSE2, SSE3, SSSE3, SSE4.1, SSE4.2, AVX, AVX2,
> > +AVX512F and AVX512IFMA built-in functions and code generation.
> > +
> > +@opindex mavx512vbmi
> > +@opindex mno-avx512vbmi
> > +@item -mavx512vbmi
> > +Support MMX, SSE, SSE2, SSE3, SSSE3, SSE4.1, SSE4.2, AVX, AVX2,
> > +AVX512F and AVX512VBMI built-in functions and code generation.
> > +
> > +@opindex mavx512vpopcntdq
> > +@opindex mno-avx512vpopcntdq
> > +@item -mavx512vpopcntdq
> > +Support MMX, SSE, SSE2, SSE3, SSSE3, SSE4.1, SSE4.2, AVX, AVX2,
> > +AVX512F and AVX512VPOPCNTDQ built-in functions and code generation.
> > +
> > +@opindex mavx512vp2intersect
> > +@opindex mno-avx512vp2intersect
> > +@item -mavx512vp2intersect
> > +Support AVX512VP2INTERSECT built-in functions and code generation.
> > +
> > +@opindex mavx512vnni
> > +@opindex mno-avx512vnni
> > +@item -mavx512vnni
> > +Support AVX512VNNI built-in functions and code generation.
> > +
> > +@opindex mavx512vbmi2
> > +@opindex mno-avx512vbmi2
> > +@item -mavx512vbmi2
> > +Support MMX, SSE, SSE2, SSE3, SSSE3, SSE4.1, SSE4.2, AVX, AVX2, AVX512F
> > +and AVX512VBMI2 built-in functions and code generation.
> > +
> > +@opindex mavx512bf16
> > +@opindex mno-avx512bf16
> > +@item -mavx512bf16
> > +Support MMX, SSE, SSE2, SSE3, SSSE3, SSE4.1, SSE4.2, AVX, AVX2, AVX512F and
> > +AVX512BF16 built-in functions and code generation.
> > +
> > +@opindex mavx512fp16
> > +@opindex mno-avx512fp16
> > +@item -mavx512fp16
> > +Support MMX, SSE, SSE2, SSE3, SSSE3, SSE4.1, SSE4.2, AVX, AVX2, AVX512F and
> > +AVX512-FP16 built-in functions and code generation.
> > +
> > +@opindex mavx512bitalg
> > +@opindex mno-avx512bitalg
> > +@item -mavx512bitalg
> > +Support MMX, SSE, SSE2, SSE3, SSSE3, SSE4.1, SSE4.2, AVX, AVX2, AVX512F and
> > +AVX512BITALG built-in functions and code generation.
> > +
> > +@opindex mavx512bmm
> > +@opindex mno-avx512bmm
> > +@item -mavx512bmm
> > +Support AVX512BMM built-in functions and code generation.
> > +
> > +@opindex mavxvnni
> > +@opindex mno-avxvnni
> > +@item -mavxvnni
> > +Support MMX, SSE, SSE2, SSE3, SSSE3, SSE4.1, SSE4.2, AVX, AVX2, and
> > +AVXVNNI built-in functions and code generation.
> > +
> > +@opindex mavxifma
> > +@opindex mno-avxifma
> > +@item -mavxifma
> > +Support MMX, SSE, SSE2, SSE3, SSSE3, SSE4.1, SSE4.2, AVX, AVX2, and
> > +AVXIFMA built-in functions and code generation.
> > +
> > +@opindex mavxvnniint8
> > +@opindex mno-avxvnniint8
> > +@item -mavxvnniint8
> > +Support MMX, SSE, SSE2, SSE3, SSSE3, SSE4.1, SSE4.2, AVX, AVX2 and
> > +AVXVNNIINT8 built-in functions and code generation.
> > +
> > +@opindex mavxneconvert
> > +@opindex mno-avxneconvert
> > +@item -mavxneconvert
> > +Support MMX, SSE, SSE2, SSE3, SSSE3, SSE4.1, SSE4.2, AVX, AVX2, and
> > +AVXNECONVERT build-in functions and code generation.
> > +
> > +@opindex mavxvnniint16
> > +@opindex mno-avxvnniint16
> > +@item -mavxvnniint16
> > +Support MMX, SSE, SSE2, SSE3, SSSE3, SSE4.1, SSE4.2, AVX, AVX2 and
> > +AVXVNNIINT16 built-in functions and code generation.
> > +
> > +@opindex mavx10.1
> > +@opindex mno-avx10.1
> > +@item -mavx10.1
> > +Support MMX, SSE, SSE2, SSE3, SSSE3, SSE4.1, SSE4.2, AVX, AVX2,
> > +and AVX10.1 built-in functions and code generation.
> > +
> > +@opindex mavx10.2
> > +@opindex mno-avx10.2
> > +@item -mavx10.2
> > +Support MMX, SSE, SSE2, SSE3, SSSE3, SSE4.1, SSE4.2, AVX, AVX2,
> > +AVX10.1 and AVX10.2 built-in functions and code generation.
> > +
> > +@opindex msha
> > +@opindex mno-sha
> > +@item -msha
> > +Support SHA1 and SHA256 built-in functions and code generation.
> > +
> > +@opindex maes
> > +@opindex mno-aes
> > +@item -maes
> > +Support AES built-in functions and code generation.
> > +
> > +@opindex mpclmul
> > +@opindex mno-pclmul
> > +@item -mpclmul
> > +Support PCLMUL built-in functions and code generation.
> > +
> > +@opindex mclflushopt
> > +@opindex mno-clflushopt
> > +@item -mclflushopt
> > +Support CLFLUSHOPT instructions.
> > +
> > +@opindex mclwb
> > +@opindex mno-clwb
> > +@item -mclwb
> > +Support CLWB instruction.
> > +
> > +@opindex mfsgsbase
> > +@opindex mno-fsgsbase
> > +@item -mfsgsbase
> > +Support FSGSBASE built-in functions and code generation.
> > +
> > +@opindex mptwrite
> > +@opindex mno-ptwrite
> > +@item -mptwrite
> > +Support PTWRITE built-in functions and code generation.
> > +
> > +@opindex mrdrnd
> > +@opindex mno-rdrnd
> > +@item -mrdrnd
> > +Support RDRND built-in functions and code generation.
> > +
> > +@opindex mf16c
> > +@opindex mno-f16c
> > +@item -mf16c
> > +Support F16C built-in functions and code generation.
> > +
> > +@opindex mfma
> > +@opindex mno-fma
> > +@item -mfma
> > +Support MMX, SSE, SSE2, SSE3, SSSE3, SSE4.1, SSE4.2, AVX and FMA
> > +built-in functions and code generation.
> > +
> > +@opindex mfma4
> > +@opindex mno-fma4
> > +@item -mfma4
> > +Support FMA4 built-in functions and code generation.
> > +
> > +@opindex mpconfig
> > +@opindex mno-pconfig
> > +@item -mpconfig
> > +Support PCONFIG built-in functions and code generation.
> > +
> > +@opindex mwbnoinvd
> > +@opindex mno-wbnoinvd
> > +@item -mwbnoinvd
> > +Support WBNOINVD built-in functions and code generation.
> > +
> > +@opindex mprfchw
> > +@opindex mno-prfchw
> > +@item -mprfchw
> > +Support PREFETCHW instruction.
> > +
> > +@opindex mrdpid
> > +@opindex mno-rdpid
> > +@item -mrdpid
> > +Support RDPID built-in functions and code generation.
> > +
> > +@opindex mrdseed
> > +@opindex mno-rdseed
> > +@item -mrdseed
> > +Support RDSEED instruction.
> > +
> > +@opindex msgx
> > +@opindex mno-sgx
> > +@item -msgx
> > +Support SGX built-in functions and code generation.
> > +
> > +@opindex mxop
> > +@opindex mno-xop
> > +@item -mxop
> > +Support XOP built-in functions and code generation.
> > +
> > +@opindex mlwp
> > +@opindex mno-lwp
> > +@item -mlwp
> > +Support LWP built-in functions and code generation.
> > +
> > +@opindex m3dnow
> > +@opindex mno-3dnow
> > +@item -m3dnow
> > +Support 3DNow! built-in functions.
> > +
> > +@opindex m3dnowa
> > +@opindex mno-3dnowa
> > +@item -m3dnowa
> > +Support Athlon 3Dnow! built-in functions.
> > +
> > +@opindex mpopcnt
> > +@opindex mno-popcnt
> > +@item -mpopcnt
> > +Support code generation of popcnt instruction.
> > +
> > +@opindex mabm
> > +@opindex mno-abm
> > +@item -mabm
> > +Support code generation of Advanced Bit Manipulation (ABM) instructions.
> > +
> > +@opindex madx
> > +@opindex mno-adx
> > +@item -madx
> > +Support flag-preserving add-carry instructions.
> > +
> > +@opindex mbmi
> > +@opindex mno-bmi
> > +@item -mbmi
> > +Support BMI built-in functions and code generation.
> > +
> > +@opindex mbmi2
> > +@opindex mno-bmi2
> > +@item -mbmi2
> > +Support BMI2 built-in functions and code generation.
> > +
> > +@opindex mlzcnt
> > +@opindex mno-lzcnt
> > +@item -mlzcnt
> > +Support LZCNT built-in function and code generation.
> > +
> > +@opindex mfxsr
> > +@opindex mno-fxsr
> > +@item -mfxsr
> > +Support FXSAVE and FXRSTOR instructions.
> > +
> > +@opindex mxsave
> > +@opindex mno-xsave
> > +@item -mxsave
> > +Support XSAVE and XRSTOR instructions.
> > +
> > +@opindex mxsaveopt
> > +@opindex mno-xsaveopt
> > +@item -mxsaveopt
> > +Support XSAVEOPT instruction.
> > +
> > +@opindex mxsavec
> > +@opindex mno-xsavec
> > +@item -mxsavec
> > +Support XSAVEC instructions.
> > +
> > +@opindex mxsaves
> > +@opindex mno-xsaves
> > +@item -mxsaves
> > +Support XSAVES and XRSTORS instructions.
> > +
> > +@opindex mrtm
> > +@opindex mno-rtm
> > +@item -mrtm
> > +Support RTM built-in functions and code generation.
> > +
> > +@opindex mhle
> > +@opindex mno-hle
> > +@item -mhle
> > +Support Hardware Lock Elision prefixes.
> > +
> > +@opindex mtbm
> > +@opindex mno-tbm
> > +@item -mtbm
> > +Support TBM built-in functions and code generation.
> > +
> > +@opindex mmwaitx
> > +@opindex mno-mwaitx
> > +@item -mmwaitx
> > +Support MWAITX and MONITORX built-in functions and code generation.
> > +
> > +@opindex mclzero
> > +@opindex mno-clzero
> > +@item -mclzero
> > +Support CLZERO built-in functions and code generation.
> > +
> > +@opindex mpku
> > +@opindex mno-pku
> > +@item -mpku
> > +Support PKU built-in functions and code generation.
> > +
> > +@opindex mgfni
> > +@opindex mno-gfni
> > +@item -mgfni
> > +Support GFNI built-in functions and code generation.
> > +
> > +@opindex mvaes
> > +@opindex mno-vaes
> > +@item -mvaes
> > +Support VAES built-in functions and code generation.
> > +
> > +@opindex mwaitpkg
> > +@opindex mno-waitpkg
> > +@item -mwaitpkg
> > +Support WAITPKG built-in functions and code generation.
> > +
> > +@opindex mvpclmulqdq
> > +@opindex mno-vpclmulqdq
> > +@item -mvpclmulqdq
> > +Support VPCLMULQDQ built-in functions and code generation.
> > +
> > +@opindex mmovdiri
> > +@opindex mno-movdiri
> > +@item -mmovdiri
> > +Support MOVDIRI built-in functions and code generation.
> > +
> > +@opindex mmovdir64b
> > +@opindex mno-movdir64b
> > +@item -mmovdir64b
> > +Support MOVDIR64B built-in functions and code generation.
> > +
> > +@opindex menqcmd
> > +@opindex mno-enqcmd
> > +@item -menqcmd
> > +Support ENQCMD built-in functions and code generation.
> > +
> > +@opindex muintr
> > +@opindex mno-uintr
> > +@item -muintr
> > +Support UINTR built-in functions and code generation.
> > +
> > +@opindex mtsxldtrk
> > +@opindex mno-tsxldtrk
> > +@item -mtsxldtrk
> > +Support TSXLDTRK built-in functions and code generation.
> > +
> > +@opindex mcldemote
> > +@opindex mno-cldemote
> > +@item -mcldemote
> > +Support CLDEMOTE built-in functions and code generation.
> > +
> > +@opindex mserialize
> > +@opindex mno-serialize
> > +@item -mserialize
> > +Support SERIALIZE built-in functions and code generation.
> > +
> > +@opindex mamx-tile
> > +@opindex mno-amx-tile
> > +@item -mamx-tile
> > +Support AMX-TILE built-in functions and code generation.
> > +
> > +@opindex mamx-int8
> > +@opindex mno-amx-int8
> > +@item -mamx-int8
> > +Support AMX-INT8 built-in functions and code generation.
> > +
> > +@opindex mamx-bf16
> > +@opindex mno-amx-bf16
> > +@item -mamx-bf16
> > +Support AMX-BF16 built-in functions and code generation.
> > +
> > +@opindex mhreset
> > +@opindex mno-hreset
> > +@item -mhreset
> > +Support HRESET built-in functions and code generation.
> > +
> > +@opindex mkl
> > +@opindex mno-kl
> > +@item -mkl
> > +Support KL built-in functions and code generation.
> > +
> > +@opindex mwidekl
> > +@opindex mno-widekl
> > +@item -mwidekl
> > +Support WIDEKL built-in functions and code generation.
> > +
> > +@opindex mcmpccxadd
> > +@opindex mno-cmpccxadd
> > +@item -mcmpccxadd
> > +Support MMX, SSE, SSE2, SSE3, SSSE3, SSE4.1, SSE4.2, AVX, AVX2, and
> > +CMPCCXADD build-in functions and code generation.
> > +
> > +@opindex mamx-fp16
> > +@opindex mno-amx-fp16
> > +@item -mamx-fp16
> > +Support AMX-FP16 built-in functions and code generation.
> > +
> > +@opindex mprefetchi
> > +@opindex mno-prefetchi
> > +@item -mprefetchi
> > +Support PREFETCHI built-in functions and code generation.
> > +
> > +@opindex mraoint
> > +@opindex mno-raoint
> > +@item -mraoint
> > +Support RAOINT built-in functions and code generation.
> > +
> > +@opindex mamx-complex
> > +@opindex mno-amx-complex
> > +@item -mamx-complex
> > +Support AMX-COMPLEX built-in functions and code generation.
> > +
> > +@opindex msm3
> > +@opindex mno-sm3
> > +@item -msm3
> > +Support MMX, SSE, SSE2, SSE3, SSSE3, SSE4.1, SSE4.2, AVX and
> > +SM3 built-in functions and code generation.
> > +
> > +@opindex msm4
> > +@opindex mno-sm4
> > +@item -msm4
> > +Support MMX, SSE, SSE2, SSE3, SSSE3, SSE4.1, SSE4.2, AVX and
> > +SM4 built-in functions and code generation.
> > +
> > +@opindex msha512
> > +@opindex mno-sha512
> > +@item -msha512
> > +Support MMX, SSE, SSE2, SSE3, SSSE3, SSE4.1, SSE4.2, AVX and
> > +SHA512 built-in functions and code generation.
> > +
> > +@opindex mapxf
> > +@opindex mno-apxf
> > +@item -mapxf
> > +Support code generation for APX features, including EGPR, PUSH2POP2,
> > +NDD, PPX, NF, CCMP and ZU.
> > +
> > +@opindex musermsr
> > +@opindex mno-usermsr
> > +@item -musermsr
> > +Support USER_MSR built-in functions and code generation.
> > +
> > +@opindex mamx-avx512
> > +@opindex mno-amx-avx512
> > +@item -mamx-avx512
> > +Support MMX, SSE, SSE2, SSE3, SSSE3, SSE4.1, SSE4.2, AVX, AVX2,
> > +AVX10.1, AVX10.2 and AMX-AVX512 built-in functions and code generation.
> > +
> > +@opindex mamx-tf32
> > +@opindex mno-amx-tf32
> > +@item -mamx-tf32
> > +Support AMX-TF32 built-in functions and code generation.
> > +
> > +@opindex mamx-fp8
> > +@opindex mno-amx-fp8
> > +@item -mamx-fp8
> > +Support AMX-FP8 built-in functions and code generation.
> > +
> > +@opindex mmovrs
> > +@opindex mno-movrs
> > +@item -mmovrs
> > +Support MOVRS built-in functions and code generation.
> > +
> > +@opindex mamx-movrs
> > +@opindex mno-amx-movrs
> > +@item -mamx-movrs
> > +Support AMX-MOVRS built-in functions and code generation.
> > +@end table
> > +
> > +These additional options are available for the x86 processor family.
> > +
> > +@table @gcctabopt
> >
> >  @opindex mfpmath
> >  @item -mfpmath=@var{unit}
> > @@ -36350,8 +36937,9 @@ for @var{unit} are:
> >
> >  @table @samp
> >  @item 387
> > -Use the standard 387 floating-point coprocessor present on the majority of 
> > chips and
> > -emulated otherwise.  Code compiled with this option runs almost everywhere.
> > +Use the standard 387 floating-point coprocessor present on the majority
> > +of chips and emulated otherwise.
> > +Code compiled with this option runs almost everywhere.
> >  The temporary results are computed in 80-bit precision instead of the 
> > precision
> >  specified by the type, resulting in slightly different results compared to 
> > most
> >  of other chips.  See @option{-ffloat-store} for more detailed description.
> > @@ -36361,18 +36949,20 @@ This is the default choice for non-Darwin x86-32 
> > targets.
> >  @item sse
> >  Use scalar floating-point instructions present in the SSE instruction set.
> >  This instruction set is supported by Pentium III and newer chips,
> > -and in the AMD line
> > -by Athlon-4, Athlon XP and Athlon MP chips.  The earlier version of the SSE
> > +and in the AMD line by Athlon-4, Athlon XP and Athlon MP chips.
> > +The earlier version of the SSE
> >  instruction set supports only single-precision arithmetic, thus the double 
> > and
> > -extended-precision arithmetic are still done using 387.  A later version, 
> > present
> > -only in Pentium 4 and AMD x86-64 chips, supports double-precision
> > -arithmetic too.
> > +extended-precision arithmetic are still done using 387.
> > +A later version, present only in Pentium 4 and AMD x86-64 chips,
> > +supports double-precision arithmetic too.
> >
> > -For the x86-32 compiler, you must use @option{-march=@var{cpu-type}}, 
> > @option{-msse}
> > +For the x86-32 compiler, you must use @option{-march=@var{cpu-type}},
> > +@option{-msse}
> >  or @option{-msse2} switches to enable SSE extensions and make this option
> >  effective.  For the x86-64 compiler, these extensions are enabled by 
> > default.
> >
> > -The resulting code should be considerably faster in the majority of cases 
> > and avoid
> > +The resulting code should be considerably faster in the majority of cases
> > +and avoid
> >  the numerical instability problems of 387 code, but may break some existing
> >  code that expects temporaries to be 80 bits.
> >
> > @@ -36380,6 +36970,11 @@ This is the default choice for the x86-64 
> > compiler, Darwin x86-32 targets,
> >  and the default choice for x86-32 targets with the SSE2 instruction set
> >  when @option{-ffast-math} is enabled.
> >
> > +GCC depresses SSE instructions when @option{-mavx} (or another option
> > +enabling AVX extensions) is used.  Instead, it
> > +generates new AVX instructions or AVX equivalents for all SSE instructions
> > +when needed.
> > +
> >  @item sse,387
> >  @itemx sse+387
> >  @itemx both
> > @@ -36390,14 +36985,6 @@ still experimental, because the GCC register 
> > allocator does not model separate
> >  functional units well, resulting in unstable performance.
> >  @end table
> >
> > -@opindex masm=@var{dialect}
> > -@item -masm=@var{dialect}
> > -Output assembly instructions using selected @var{dialect}.  Also affects
> > -which dialect is used for basic @code{asm} (@pxref{Basic Asm}) and
> > -extended @code{asm} (@pxref{Extended Asm}). Supported choices (in dialect
> > -order) are @samp{att} or @samp{intel}. The default is @samp{att}. Darwin 
> > does
> > -not support @samp{intel}.
> > -
> >  @opindex mieee-fp
> >  @opindex mno-ieee-fp
> >  @item -mieee-fp
> > @@ -36412,7 +36999,7 @@ comparison is unordered.
> >  @itemx -mhard-float
> >  Generate output containing 80387 instructions for floating point.
> >
> > -@opindex no-80387
> > +@opindex mno-80387
> >  @opindex msoft-float
> >  @item -mno-80387
> >  @itemx -msoft-float
> > @@ -36533,6 +37120,7 @@ objects larger than @var{threshold} are placed in 
> > large data sections.  The
> >  default is 65535.
> >
> >  @opindex mrtd
> > +@opindex mno-rtd
> >  @item -mrtd
> >  Use a different function-calling convention, in which functions that
> >  take a fixed number of arguments return with the @code{ret @var{num}}
> > @@ -36583,6 +37171,7 @@ modules with the same value, including any 
> > libraries.  This includes
> >  the system libraries and startup modules.
> >
> >  @opindex mvect8-ret-in-mem
> > +@opindex mno-vect8-ret-in-mem
> >  @item -mvect8-ret-in-mem
> >  Return 8-byte vectors in memory instead of MMX registers.  This is the
> >  default on VxWorks to match the ABI of the Sun Studio compilers until
> > @@ -36615,16 +37204,20 @@ loss of accuracy, typically through so-called 
> > ``catastrophic cancellation'',
> >  when this option is used to set the precision to less than extended 
> > precision.
> >
> >  @opindex mdaz-ftz
> > +@opindex mno-daz-ftz
> >  @item -mdaz-ftz
> > -
> > -The flush-to-zero (FTZ) and denormals-are-zero (DAZ) flags in the MXCSR 
> > register
> > -are used to control floating-point calculations.SSE and AVX instructions
> > -including scalar and vector instructions could benefit from enabling the 
> > FTZ
> > -and DAZ flags when @option{-mdaz-ftz} is specified. Don't set FTZ/DAZ flags
> > -when @option{-mno-daz-ftz} or @option{-shared} is specified, 
> > @option{-mdaz-ftz}
> > -will set FTZ/DAZ flags even with @option{-shared}.
> > +@itemx -mno-daz-ftz
> > +The flush-to-zero (FTZ) and denormals-are-zero (DAZ) flags in the
> > +MXCSR register are used to control floating-point calculations.  The
> > +@option{-Ofast}, @option{-ffast-math}, or
> > +@option{-funsafe-math-optimizations} options normally link in startup code
> > +that sets these flags except when building a shared library
> > +(@option{-shared}).  You can use the @option{-mdaz-ftz} and
> > +@option{-mno-daz-ftz} options to explicitly enable or disable setting
> > +these flags, regardless of other options passed to GCC.
> >
> >  @opindex mstackrealign
> > +@opindex mno-stackrealign
> >  @item -mstackrealign
> >  Realign the stack at entry.  On the x86, the @option{-mstackrealign}
> >  option generates an alternate prologue and epilogue that realigns the
> > @@ -36633,6 +37226,11 @@ run-time stack if necessary.  This supports mixing 
> > legacy codes that keep
> >  SSE compatibility.  See also the attribute @code{force_align_arg_pointer},
> >  applicable to individual functions.
> >
> > +@opindex mstack-arg-probe
> > +@opindex mno-stack-arg-probe
> > +@item -mstack-arg-probe
> > +Emit stack probing code in the function prologue.
> > +
> >  @opindex mpreferred-stack-boundary
> >  @item -mpreferred-stack-boundary=@var{num}
> >  Attempt to keep the stack boundary aligned to a 2 raised to @var{num}
> > @@ -36679,349 +37277,6 @@ increases code size.  Code that is sensitive to 
> > stack space usage, such
> >  as embedded systems and operating system kernels, may want to reduce the
> >  preferred alignment to @option{-mpreferred-stack-boundary=2}.
> >
> > -@need 200
> > -@opindex mmmx
> > -@item -mmmx
> > -@need 200
> > -@opindex msse
> > -@itemx -msse
> > -@need 200
> > -@opindex msse2
> > -@itemx -msse2
> > -@need 200
> > -@opindex msse3
> > -@itemx -msse3
> > -@need 200
> > -@opindex mssse3
> > -@itemx -mssse3
> > -@need 200
> > -@opindex msse4
> > -@itemx -msse4
> > -@need 200
> > -@opindex msse4a
> > -@itemx -msse4a
> > -@need 200
> > -@opindex msse4.1
> > -@itemx -msse4.1
> > -@need 200
> > -@opindex msse4.2
> > -@itemx -msse4.2
> > -@need 200
> > -@opindex mavx
> > -@itemx -mavx
> > -@need 200
> > -@opindex mavx2
> > -@itemx -mavx2
> > -@need 200
> > -@opindex mavx512f
> > -@itemx -mavx512f
> > -@need 200
> > -@opindex mavx512cd
> > -@itemx -mavx512cd
> > -@need 200
> > -@opindex mavx512vl
> > -@itemx -mavx512vl
> > -@need 200
> > -@opindex mavx512bw
> > -@itemx -mavx512bw
> > -@need 200
> > -@opindex mavx512dq
> > -@itemx -mavx512dq
> > -@need 200
> > -@opindex mavx512ifma
> > -@itemx -mavx512ifma
> > -@need 200
> > -@opindex mavx512vbmi
> > -@itemx -mavx512vbmi
> > -@need 200
> > -@opindex msha
> > -@itemx -msha
> > -@need 200
> > -@opindex maes
> > -@itemx -maes
> > -@need 200
> > -@opindex mpclmul
> > -@itemx -mpclmul
> > -@need 200
> > -@opindex mclflushopt
> > -@itemx -mclflushopt
> > -@need 200
> > -@opindex mclwb
> > -@itemx -mclwb
> > -@need 200
> > -@opindex mfsgsbase
> > -@itemx -mfsgsbase
> > -@need 200
> > -@opindex mptwrite
> > -@itemx -mptwrite
> > -@need 200
> > -@opindex mrdrnd
> > -@itemx -mrdrnd
> > -@need 200
> > -@opindex mf16c
> > -@itemx -mf16c
> > -@need 200
> > -@opindex mfma
> > -@itemx -mfma
> > -@need 200
> > -@opindex mpconfig
> > -@itemx -mpconfig
> > -@need 200
> > -@opindex mwbnoinvd
> > -@itemx -mwbnoinvd
> > -@need 200
> > -@opindex mfma4
> > -@itemx -mfma4
> > -@need 200
> > -@opindex mprfchw
> > -@itemx -mprfchw
> > -@need 200
> > -@opindex mrdpid
> > -@itemx -mrdpid
> > -@need 200
> > -@opindex mrdseed
> > -@itemx -mrdseed
> > -@need 200
> > -@opindex msgx
> > -@itemx -msgx
> > -@need 200
> > -@opindex mxop
> > -@itemx -mxop
> > -@need 200
> > -@opindex mlwp
> > -@itemx -mlwp
> > -@need 200
> > -@opindex m3dnow
> > -@itemx -m3dnow
> > -@need 200
> > -@opindex m3dnowa
> > -@itemx -m3dnowa
> > -@need 200
> > -@opindex mpopcnt
> > -@itemx -mpopcnt
> > -@need 200
> > -@opindex mabm
> > -@itemx -mabm
> > -@need 200
> > -@opindex madx
> > -@itemx -madx
> > -@need 200
> > -@opindex mbmi
> > -@itemx -mbmi
> > -@need 200
> > -@opindex mbmi2
> > -@itemx -mbmi2
> > -@need 200
> > -@opindex mlzcnt
> > -@itemx -mlzcnt
> > -@need 200
> > -@opindex mfxsr
> > -@itemx -mfxsr
> > -@need 200
> > -@opindex mxsave
> > -@itemx -mxsave
> > -@need 200
> > -@opindex mxsaveopt
> > -@itemx -mxsaveopt
> > -@need 200
> > -@opindex mxsavec
> > -@itemx -mxsavec
> > -@need 200
> > -@opindex mxsaves
> > -@itemx -mxsaves
> > -@need 200
> > -@opindex mrtm
> > -@itemx -mrtm
> > -@need 200
> > -@opindex mhle
> > -@itemx -mhle
> > -@need 200
> > -@opindex mtbm
> > -@itemx -mtbm
> > -@need 200
> > -@opindex mmwaitx
> > -@itemx -mmwaitx
> > -@need 200
> > -@opindex mclzero
> > -@itemx -mclzero
> > -@need 200
> > -@opindex mpku
> > -@itemx -mpku
> > -@need 200
> > -@opindex mavx512vbmi2
> > -@itemx -mavx512vbmi2
> > -@need 200
> > -@opindex mavx512bf16
> > -@itemx -mavx512bf16
> > -@need 200
> > -@opindex mavx512fp16
> > -@itemx -mavx512fp16
> > -@need 200
> > -@opindex mgfni
> > -@itemx -mgfni
> > -@need 200
> > -@opindex mvaes
> > -@itemx -mvaes
> > -@need 200
> > -@opindex mwaitpkg
> > -@itemx -mwaitpkg
> > -@need 200
> > -@opindex mvpclmulqdq
> > -@itemx -mvpclmulqdq
> > -@need 200
> > -@opindex mavx512bitalg
> > -@itemx -mavx512bitalg
> > -@need 200
> > -@opindex mmovdiri
> > -@itemx -mmovdiri
> > -@need 200
> > -@opindex mmovdir64b
> > -@itemx -mmovdir64b
> > -@need 200
> > -@opindex menqcmd
> > -@opindex muintr
> > -@itemx -menqcmd
> > -@itemx -muintr
> > -@need 200
> > -@opindex mtsxldtrk
> > -@itemx -mtsxldtrk
> > -@need 200
> > -@opindex mavx512vpopcntdq
> > -@itemx -mavx512vpopcntdq
> > -@need 200
> > -@opindex mavx512vp2intersect
> > -@itemx -mavx512vp2intersect
> > -@need 200
> > -@opindex mavx512vnni
> > -@itemx -mavx512vnni
> > -@need 200
> > -@opindex mavxvnni
> > -@itemx -mavxvnni
> > -@need 200
> > -@opindex mcldemote
> > -@itemx -mcldemote
> > -@need 200
> > -@opindex mserialize
> > -@itemx -mserialize
> > -@need 200
> > -@opindex mamx-tile
> > -@itemx -mamx-tile
> > -@need 200
> > -@opindex mamx-int8
> > -@itemx -mamx-int8
> > -@need 200
> > -@opindex mamx-bf16
> > -@itemx -mamx-bf16
> > -@need 200
> > -@opindex mhreset
> > -@opindex mkl
> > -@itemx -mhreset
> > -@itemx -mkl
> > -@need 200
> > -@opindex mwidekl
> > -@itemx -mwidekl
> > -@need 200
> > -@opindex mavxifma
> > -@itemx -mavxifma
> > -@need 200
> > -@opindex mavxvnniint8
> > -@itemx -mavxvnniint8
> > -@need 200
> > -@opindex mavxneconvert
> > -@itemx -mavxneconvert
> > -@need 200
> > -@opindex mcmpccxadd
> > -@itemx -mcmpccxadd
> > -@need 200
> > -@opindex mamx-fp16
> > -@itemx -mamx-fp16
> > -@need 200
> > -@opindex mprefetchi
> > -@itemx -mprefetchi
> > -@need 200
> > -@opindex mraoint
> > -@itemx -mraoint
> > -@need 200
> > -@opindex mamx-complex
> > -@itemx -mamx-complex
> > -@need 200
> > -@opindex mavxvnniint16
> > -@itemx -mavxvnniint16
> > -@need 200
> > -@opindex msm3
> > -@itemx -msm3
> > -@need 200
> > -@opindex msha512
> > -@itemx -msha512
> > -@need 200
> > -@opindex msm4
> > -@itemx -msm4
> > -@need 200
> > -@opindex mapxf
> > -@itemx -mapxf
> > -@need 200
> > -@opindex musermsr
> > -@itemx -musermsr
> > -@need 200
> > -@opindex mavx10.1
> > -@itemx -mavx10.1
> > -@need 200
> > -@opindex mavx10.2
> > -@itemx -mavx10.2
> > -@need 200
> > -@opindex mamx-avx512
> > -@itemx -mamx-avx512
> > -@need 200
> > -@opindex mamx-tf32
> > -@itemx -mamx-tf32
> > -@need 200
> > -@itemx -mamx-fp8
> > -@opindex mamx-fp8
> > -@need 200
> > -@opindex mmovrs
> > -@itemx -mmovrs
> > -@need 200
> > -@opindex mamx-movrs
> > -@itemx -mamx-movrs
> > -@need 200
> > -@opindex mavx512bmm
> > -@itemx -mavx512bmm
> > -These switches enable the use of instructions in the MMX, SSE,
> > -AVX512CD, AVX512VL, AVX512BW, AVX512DQ, AVX512IFMA, AVX512VBMI, SHA, AES,
> > -PCLMUL, CLFLUSHOPT, CLWB, FSGSBASE, PTWRITE, RDRND, F16C, FMA, PCONFIG,
> > -WBNOINVD, FMA4, PREFETCHW, RDPID, RDSEED, SGX, XOP, LWP, 3DNow!@:,
> > -enhanced 3DNow!@:, POPCNT, ABM, ADX, BMI, BMI2, LZCNT, FXSR, XSAVE, 
> > XSAVEOPT,
> > -XSAVEC, XSAVES, RTM, HLE, TBM, MWAITX, CLZERO, PKU, AVX512VBMI2, GFNI, 
> > VAES,
> > -WAITPKG, VPCLMULQDQ, AVX512BITALG, MOVDIRI, MOVDIR64B, AVX512BF16, ENQCMD,
> > -AVX512VPOPCNTDQ, AVX512VNNI, SERIALIZE, UINTR, HRESET, AMXTILE, AMXINT8,
> > -AMXBF16, KL, WIDEKL, AVXVNNI, AVX512-FP16, AVXIFMA, AVXVNNIINT8, 
> > AVXNECONVERT,
> > -CMPCCXADD, AMX-FP16, PREFETCHI, RAOINT, AMX-COMPLEX, AVXVNNIINT16, SM3, 
> > SHA512,
> > -SM4, APX_F, USER_MSR, AVX10.1, AVX10.2, AMX-AVX512, AMX-TF32, AMX-FP8, 
> > MOVRS,
> > -AMX-MOVRS, AVX512BMM or CLDEMOTE extended instruction sets. Each has a
> > -corresponding @option{-mno-} option to disable use of these instructions.
> > -
> > -These extensions are also available as built-in functions: see
> > -@ref{x86 Built-in Functions}, for details of the functions enabled and
> > -disabled by these switches.
> > -
> > -Note that @option{-msse4} enables both SSE4.1 and SSE4.2 support,
> > -while @option{-mno-sse4} turns off those features; neither form of the
> > -option affects SSE4A support, controlled separately by
> > -@option{-msse4a}.
> > -
> > -To generate SSE/SSE2 instructions automatically from floating-point
> > -code (as opposed to 387 instructions), see @option{-mfpmath=sse}.
> > -
> > -GCC depresses SSEx instructions when @option{-mavx} is used. Instead, it
> > -generates new AVX instructions or AVX equivalence for all SSEx instructions
> > -when needed.
> > -
> > -These options enable GCC to use these extended instructions in
> > -generated code, even without @option{-mfpmath=sse}.  Applications that
> > -perform run-time CPU detection must compile separate files for each
> > -supported architecture, using the appropriate flags.  In particular,
> > -the file containing the CPU detection code should be compiled without
> > -these options.
> >
> >  @opindex mdump-tune-features
> >  @item -mdump-tune-features
> > @@ -37031,10 +37286,10 @@ tuning features and default settings. The names 
> > can be used in
> >
> >  @opindex mtune-ctrl=@var{feature-list}
> >  @item -mtune-ctrl=@var{feature-list}
> > -This option is used to do fine grain control of x86 code generation 
> > features.
> > -@var{feature-list} is a comma separated list of @var{feature} names. See 
> > also
> > -@option{-mdump-tune-features}. When specified, the @var{feature} is turned
> > -on if it is not preceded with @samp{^}, otherwise, it is turned off.
> > +This option is used to do fine-grain control of x86 code generation 
> > features.
> > +@var{feature-list} is a comma-separated list of @var{feature} names. See 
> > also
> > +@option{-mdump-tune-features}.  When specified, the @var{feature} is turned
> > +on if it is not preceded with @samp{^}; otherwise, it is turned off.
> >  @option{-mtune-ctrl=@var{feature-list}} is intended to be used by GCC
> >  developers. Using it may lead to code paths not covered by testing and can
> >  potentially result in compiler ICEs or runtime errors.
> > @@ -37045,6 +37300,7 @@ This option instructs GCC to turn off all tunable 
> > features. See also
> >  @option{-mtune-ctrl=@var{feature-list}} and @option{-mdump-tune-features}.
> >
> >  @opindex mcld
> > +@opindex mno-cld
> >  @item -mcld
> >  This option instructs GCC to emit a @code{cld} instruction in the prologue
> >  of functions that use string instructions.  String instructions depend on
> > @@ -37059,13 +37315,23 @@ instructions can be suppressed with the 
> > @option{-mno-cld} compiler option
> >  in this case.
> >
> >  @opindex mvzeroupper
> > +@opindex mno-vzeroupper
> >  @item -mvzeroupper
> >  This option instructs GCC to emit a @code{vzeroupper} instruction
> >  before a transfer of control flow out of the function to minimize
> >  the AVX to SSE transition penalty as well as remove unnecessary 
> > @code{zeroupper}
> >  intrinsics.
> >
> > +@opindex mstv
> > +@opindex mno-stv
> > +@item -mstv
> > +@itemx -mno-stv
> > +Enable/disable Scalar to Vectorization pass transforming 64-bit
> > +integer computation into vector ones.  This optimization is restricted
> > +to @option{-O2} and higher.
> > +
> >  @opindex mprefer-avx128
> > +@opindex mno-prefer-avx128
> >  @item -mprefer-avx128
> >  This option instructs GCC to use 128-bit AVX instructions instead of
> >  256-bit AVX instructions in the auto-vectorizer.
> > @@ -37076,7 +37342,9 @@ This option instructs GCC to use @var{opt}-bit 
> > vector width in instructions
> >  instead of default on the selected platform.
> >
> >  @opindex mpartial-vector-fp-math
> > +@opindex mno-partial-vector-fp-math
> >  @item -mpartial-vector-fp-math
> > +@itemx -mno-partial-vector-fp-math
> >  This option enables GCC to generate floating-point operations that might
> >  affect the set of floating-point status flags on partial vectors, where
> >  vector elements reside in the low part of the 128-bit SSE register.  Unless
> > @@ -37119,6 +37387,7 @@ Prefer 512-bit vector width for instructions.
> >  @end table
> >
> >  @opindex mnoreturn-no-callee-saved-registers
> > +@opindex mno-noreturn-no-callee-saved-registers
> >  @item -mnoreturn-no-callee-saved-registers
> >  This option optimizes functions with @code{noreturn} attribute or
> >  @code{_Noreturn} specifier by not saving in the function prologue 
> > callee-saved
> > @@ -37128,6 +37397,7 @@ register).  This option can interfere with 
> > debugging of the caller of the
> >  is not enabled by default.
> >
> >  @opindex mcx16
> > +@opindex mno-cx16
> >  @item -mcx16
> >  This option enables GCC to generate @code{CMPXCHG16B} instructions in 
> > 64-bit
> >  code to implement compare-and-exchange operations on 16-byte aligned 
> > 128-bit
> > @@ -37137,6 +37407,7 @@ machine word in size.  The compiler uses this 
> > instruction to implement
> >  128-bit integers, a library call is always used.
> >
> >  @opindex msahf
> > +@opindex mno-sahf
> >  @item -msahf
> >  This option enables generation of @code{SAHF} instructions in 64-bit code.
> >  Early Intel Pentium 4 CPUs with Intel 64 support,
> > @@ -37149,28 +37420,33 @@ In 64-bit mode, the @code{SAHF} instruction is 
> > used to optimize @code{fmod},
> >  see @ref{Other Builtins} for details.
> >
> >  @opindex mmovbe
> > +@opindex mno-movbe
> >  @item -mmovbe
> >  This option enables use of the @code{movbe} instruction to optimize
> >  byte swapping of four and eight byte entities.
> >
> >  @opindex mshstk
> > +@opindex mno-shstk
> >  @item -mshstk
> >  The @option{-mshstk} option enables shadow stack built-in functions
> >  from x86 Control-flow Enforcement Technology (CET).
> >
> >  @opindex mcrc32
> > +@opindex mno-crc32
> >  @item -mcrc32
> >  This option enables built-in functions @code{__builtin_ia32_crc32qi},
> >  @code{__builtin_ia32_crc32hi}, @code{__builtin_ia32_crc32si} and
> >  @code{__builtin_ia32_crc32di} to generate the @code{crc32} machine 
> > instruction.
> >
> >  @opindex mmwait
> > +@opindex mno-mwait
> >  @item -mmwait
> >  This option enables built-in functions @code{__builtin_ia32_monitor},
> >  and @code{__builtin_ia32_mwait} to generate the @code{monitor} and
> >  @code{mwait} machine instructions.
> >
> >  @opindex mrecip
> > +@opindex mno-recip
> >  @item -mrecip
> >  This option enables use of @code{RCPSS} and @code{RSQRTSS} instructions
> >  (and their vectorized variants @code{RCPPS} and @code{RSQRTPS})
> > @@ -37193,7 +37469,7 @@ for vectorized single-float division and vectorized 
> > @code{sqrtf(@var{x})}
> >  already with @option{-ffast-math} (or the above option combination), and
> >  doesn't need @option{-mrecip}.
> >
> > -@opindex mrecip=opt
> > +@opindex mrecip=
> >  @item -mrecip=@var{opt}
> >  This option controls which reciprocal estimate instructions
> >  may be used.  @var{opt} is a comma-separated list of options, which may
> > @@ -37289,13 +37565,23 @@ You can control this behavior for specific 
> > functions by
> >  using the function attributes @code{ms_abi} and @code{sysv_abi}.
> >  @xref{Function Attributes}.
> >
> > +@opindex masm=@var{dialect}
> > +@item -masm=@var{dialect}
> > +Output assembly instructions using selected @var{dialect}.  Also affects
> > +which dialect is used for basic @code{asm} (@pxref{Basic Asm}) and
> > +extended @code{asm} (@pxref{Extended Asm}). Supported choices (in dialect
> > +order) are @samp{att} or @samp{intel}. The default is @samp{att}. Darwin 
> > does
> > +not support @samp{intel}.
> > +
> >  @opindex mforce-indirect-call
> > +@opindex mno-force-indirect-call
> >  @item -mforce-indirect-call
> >  Force all calls to functions to be indirect. This is useful
> >  when using Intel Processor Trace where it generates more precise timing
> >  information for function calls.
> >
> >  @opindex mmanual-endbr
> > +@opindex mno-manual-endbr
> >  @item -mmanual-endbr
> >  Insert ENDBR instruction at function entry only via the @code{cf_check}
> >  function attribute. This is useful when used with the option
> > @@ -37303,6 +37589,7 @@ function attribute. This is useful when used with 
> > the option
> >  function entry.
> >
> >  @opindex mcet-switch
> > +@opindex mno-cet-switch
> >  @item -mcet-switch
> >  By default, CET instrumentation is turned off on switch statements that
> >  use a jump table and indirect branch track is disabled.  Since jump
> > @@ -37341,6 +37628,7 @@ by default.  In some cases disabling it may improve 
> > performance because of
> >  improved scheduling and reduced dependencies.
> >
> >  @opindex maccumulate-outgoing-args
> > +@opindex mno-accumulate-outgoing-args
> >  @item -maccumulate-outgoing-args
> >  If enabled, the maximum amount of space required for outgoing arguments is
> >  computed in the function prologue.  This is faster on most modern CPUs
> > @@ -37348,14 +37636,6 @@ because of reduced dependencies, improved 
> > scheduling and reduced stack usage
> >  when the preferred stack boundary is not equal to 2.  The drawback is a 
> > notable
> >  increase in code size.  This switch implies @option{-mno-push-args}.
> >
> > -@opindex mthreads
> > -@item -mthreads
> > -Support thread-safe exception handling on MinGW.  Programs that rely
> > -on thread-safe exception handling must compile and link all code with the
> > -@option{-mthreads} option.  When compiling, @option{-mthreads} defines
> > -@option{-D_MT}; when linking, it links in a special thread helper library
> > -@option{-lmingwthrd} which cleans up per-thread exception-handling data.
> > -
> >  @opindex mms-bitfields
> >  @opindex mno-ms-bitfields
> >  @item -mms-bitfields
> > @@ -37458,11 +37738,11 @@ Taking this into account, it is important to note 
> > the following:
> >  @enumerate
> >  @item If a zero-length bit-field follows a normal bit-field, the type of 
> > the
> >  zero-length bit-field may affect the alignment of the structure as whole. 
> > For
> > -example, @code{t2} has a size of 4 bytes, since the zero-length bit-field 
> > follows a
> > -normal bit-field, and is of type short.
> > +example, @code{t2} has a size of 4 bytes, since the zero-length bit-field
> > +follows a normal bit-field, and is of type short.
> >
> > -@item Even if a zero-length bit-field is not followed by a normal 
> > bit-field, it may
> > -still affect the alignment of the structure:
> > +@item Even if a zero-length bit-field is not followed by a normal 
> > bit-field,
> > +it may still affect the alignment of the structure:
> >
> >  @smallexample
> >  struct
> > @@ -37500,6 +37780,7 @@ code size and improves performance in case the 
> > destination is already aligned,
> >  but GCC doesn't know about it.
> >
> >  @opindex minline-all-stringops
> > +@opindex mno-inline-all-stringops
> >  @item -minline-all-stringops
> >  By default GCC inlines string operations only when the destination is
> >  known to be aligned to least a 4-byte boundary.
> > @@ -37510,6 +37791,7 @@ The option enables inline expansion of 
> > @code{strlen} for all
> >  pointer alignments.
> >
> >  @opindex minline-stringops-dynamically
> > +@opindex mno-inline-stringops-dynamically
> >  @item -minline-stringops-dynamically
> >  For string operations of unknown size, use run-time checks with
> >  inline code for small blocks and a library call for large blocks.
> > @@ -37536,31 +37818,34 @@ Always use a library call.
> >
> >  @opindex mmemcpy-strategy=@var{strategy}
> >  @item -mmemcpy-strategy=@var{strategy}
> > -Override the internal decision heuristic to decide if 
> > @code{__builtin_memcpy}
> > -should be inlined and what inline algorithm to use when the expected size
> > -of the copy operation is known. @var{strategy}
> > -is a comma-separated list of @var{alg}:@var{max_size}:@var{dest_align} 
> > triplets.
> > -@var{alg} is specified in @option{-mstringop-strategy}, @var{max_size} 
> > specifies
> > -the max byte size with which inline algorithm @var{alg} is allowed.  For 
> > the last
> > -triplet, the @var{max_size} must be @code{-1}. The @var{max_size} of the 
> > triplets
> > -in the list must be specified in increasing order.  The minimal byte size 
> > for
> > -@var{alg} is @code{0} for the first triplet and @code{@var{max_size} + 1} 
> > of the
> > -preceding range.
> > +Override the internal decision heuristic to decide if
> > +@code{__builtin_memcpy} should be inlined and what inline algorithm to
> > +use when the expected size of the copy operation is known.
> > +@var{strategy} is a comma-separated list of
> > +@var{alg}:@var{max_size}:@var{dest_align} triplets.  @var{alg} is
> > +specified in @option{-mstringop-strategy}, @var{max_size} specifies
> > +the max byte size with which inline algorithm @var{alg} is allowed.
> > +For the last triplet, the @var{max_size} must be @code{-1}.
> > +The @var{max_size} of the triplets in the list must be specified in
> > +increasing order.  The minimal byte size for @var{alg} is @code{0} for
> > +the first triplet and @code{@var{max_size} + 1} of the preceding range.
> >
> >  @opindex mmemset-strategy=@var{strategy}
> >  @item -mmemset-strategy=@var{strategy}
> > -The option is similar to @option{-mmemcpy-strategy=} except that it is to 
> > control
> > -@code{__builtin_memset} expansion.
> > +This option is similar to @option{-mmemcpy-strategy=} except that it
> > +controls the @code{__builtin_memset} expansion.
> >
> >  @opindex momit-leaf-frame-pointer
> > +@opindex mno-omit-leaf-frame-pointer
> >  @item -momit-leaf-frame-pointer
> >  Don't keep the frame pointer in a register for leaf functions.  This
> >  avoids the instructions to save, set up, and restore frame pointers and
> >  makes an extra register available in leaf functions.  The option
> > -@option{-momit-leaf-frame-pointer} removes the frame pointer for leaf 
> > functions,
> > -which might make debugging harder.
> > +@option{-momit-leaf-frame-pointer} removes the frame pointer for leaf
> > +functions, which might make debugging harder.
> >
> >  @opindex mtls-direct-seg-refs
> > +@opindex mno-tls-direct-seg-refs
> >  @item -mtls-direct-seg-refs
> >  @itemx -mno-tls-direct-seg-refs
> >  Controls whether TLS variables may be accessed with offsets from the
> > @@ -37572,12 +37857,14 @@ segment to cover the entire TLS area.
> >  For systems that use the GNU C Library, the default is on.
> >
> >  @opindex msse2avx
> > +@opindex mno-sse2avx
> >  @item -msse2avx
> >  @itemx -mno-sse2avx
> >  Specify that the assembler should encode SSE instructions with VEX
> >  prefix.  The option @option{-mavx} turns this on by default.
> >
> >  @opindex mfentry
> > +@opindex mno-fentry
> >  @item -mfentry
> >  @itemx -mno-fentry
> >  If profiling is active (@option{-pg}), put the profiling
> > @@ -37586,55 +37873,58 @@ Note: On x86 architectures the attribute 
> > @code{ms_hook_prologue}
> >  isn't possible at the moment for @option{-mfentry} and @option{-pg}.
> >
> >  @opindex mrecord-mcount
> > +@opindex mno-record-mcount
> >  @item -mrecord-mcount
> >  @itemx -mno-record-mcount
> > -If profiling is active (@option{-pg}), generate a __mcount_loc section
> > -that contains pointers to each profiling call. This is useful for
> > -automatically patching and out calls.
> > +If profiling is active (@option{-pg}), generate a section that contains
> > +pointers to each profiling call; this is useful for automatically patching
> > +the calls.  You can use the @option{mfentry-section=} option to set the
> > +name of the section; it defaults to @samp{__mcount_loc}.
> >
> >  @opindex mnop-mcount
> > +@opindex mno-nop-mcount
> >  @item -mnop-mcount
> >  @itemx -mno-nop-mcount
> >  If profiling is active (@option{-pg}), generate the calls to
> > -the profiling functions as NOPs. This is useful when they
> > +the profiling functions as NOPs.  This is useful when they
> >  should be patched in later dynamically. This is likely only
> >  useful together with @option{-mrecord-mcount}.
> >
> >  @opindex minstrument-return
> >  @item -minstrument-return=@var{type}
> > -Instrument function exit in -pg -mfentry instrumented functions with
> > -call to specified function. This only instruments true returns ending
> > -with ret, but not sibling calls ending with jump. Valid types
> > -are @var{none} to not instrument, @var{call} to generate a call to 
> > __return__,
> > -or @var{nop5} to generate a 5 byte nop.
> > +With @option{-pg -mfentry}, instrument function exits according to
> > +@var{type}, which may be one of @samp{none} to not instrument,
> > +@samp{call} to generate a call to @code{__return__}, or @samp{nop5} to
> > +generate a 5-byte nop sequence.  This option only instruments true
> > +returns ending with a @code{ret}, not sibling calls via a jump.
> >
> >  @opindex mrecord-return
> > +@opindex mno-record-return
> >  @item -mrecord-return
> >  @itemx -mno-record-return
> > -Generate a __return_loc section pointing to all return instrumentation 
> > code.
> > +Generate a @code{__return_loc} section pointing to all return 
> > instrumentation
> > +code.
> >
> >  @opindex mfentry-name
> >  @item -mfentry-name=@var{name}
> > -Set name of __fentry__ symbol called at function entry for -pg -mfentry 
> > functions.
> > +Set name of @code{__fentry__} symbol called at function entry for
> > +@option{-pg -mfentry} functions.
> >
> >  @opindex mfentry-section
> >  @item -mfentry-section=@var{name}
> > -Set name of section to record -mrecord-mcount calls (default __mcount_loc).
> > +Set name of section to record @option{-mrecord-mcount} calls.  The
> > +default is @samp{__mcount_loc}.
> >
> >  @opindex mskip-rax-setup
> > +@opindex mno-skip-rax-setup
> >  @item -mskip-rax-setup
> >  @itemx -mno-skip-rax-setup
> >  When generating code for the x86-64 architecture with SSE extensions
> >  disabled, @option{-mskip-rax-setup} can be used to skip setting up RAX
> >  register when there are no variable arguments passed in vector registers.
> >
> > -@strong{Warning:} Since RAX register is used to avoid unnecessarily
> > -saving vector registers on stack when passing variable arguments, the
> > -impacts of this option are callees may waste some stack space,
> > -misbehave or jump to a random location.  GCC 4.4 or newer don't have
> > -those issues, regardless the RAX register value.
> > -
> >  @opindex m8bit-idiv
> > +@opindex mno-8bit-idiv
> >  @item -m8bit-idiv
> >  @itemx -mno-8bit-idiv
> >  On some processors, like Intel Atom, 8-bit unsigned integer divide is
> > @@ -37644,7 +37934,9 @@ to 255, 8-bit unsigned integer divide is used 
> > instead of
> >  32-bit/64-bit integer divide.
> >
> >  @opindex mavx256-split-unaligned-load
> > +@opindex mno-avx256-split-unaligned-load
> >  @opindex mavx256-split-unaligned-store
> > +@opindex mno-avx256-split-unaligned-store
> >  @item -mavx256-split-unaligned-load
> >  @itemx -mavx256-split-unaligned-store
> >  Split 32-byte AVX unaligned load and store.
> > @@ -37679,6 +37971,7 @@ prevents the compiler from using floating-point, 
> > vector, mask and bound
> >  registers.
> >
> >  @opindex mrelax-cmpxchg-loop
> > +@opindex mno-relax-cmpxchg-loop
> >  @item -mrelax-cmpxchg-loop
> >  When emitting a compare-and-swap loop for @ref{__sync Builtins}
> >  and @ref{__atomic Builtins} lacking a native instruction, optimize
> > @@ -37728,6 +38021,7 @@ not be reachable in the large code model.
> >
> >
> >  @opindex mindirect-branch-register
> > +@opindex mno-indirect-branch-register
> >  @item -mindirect-branch-register
> >  Force indirect call and jump via register.
> >
> > @@ -37740,20 +38034,27 @@ hardening.  @samp{return} enables SLS hardening 
> > for function returns.
> >  @samp{all} enables all SLS hardening.
> >
> >  @opindex mindirect-branch-cs-prefix
> > +@opindex mno-indirect-branch-cs-prefix
> >  @item -mindirect-branch-cs-prefix
> >  Add CS prefix to call and jmp to indirect thunk with branch target in
> > -r8-r15 registers so that the call and jmp instruction length is 6 bytes
> > -to allow them to be replaced with @samp{lfence; call *%r8-r15} or
> > -@samp{lfence; jmp *%r8-r15} at run-time.
> > +r8-r15 registers, so that the call and jmp instruction length is 6 bytes.
> > +This allows them to potentially be replaced with @samp{lfence; call 
> > *%r8-r15}
> > +or @samp{lfence; jmp *%r8-r15} at run time.
> >
> >  @opindex mapx-inline-asm-use-gpr32
> > +@opindex mno-apx-inline-asm-use-gpr32
> >  @item -mapx-inline-asm-use-gpr32
> > -For inline asm support with APX, by default the EGPR feature was
> > -disabled to prevent potential illegal instruction with EGPR occurs.
> > -To invoke egpr usage in inline asm, use new compiler option
> > --mapx-inline-asm-use-gpr32 and user should ensure the instruction
> > -supports EGPR.
> > +By default, EGPR usage is disabled in inline asm constraints as GCC cannot
> > +be aware of whether the asm instructions support GP32 or not.
> > +If your inline asm can handle EGPR, use 
> > @option{-mapx-inline-asm-use-gpr32}.
> >
> > +@opindex mgather
> > +@opindex mno-gather
> > +@opindex mscatter
> > +@opindex mno-scatter
> > +@item -mgather
> > +@itemx -mscatter
> > +Enable vectorization for gather and scatter instructions, respectively.
> >  @end table
> >
> >  These @samp{-m} switches are supported in addition to the above
> > @@ -37764,12 +38065,10 @@ on x86-64 processors in 64-bit environments.
> >  @opindex m64
> >  @opindex mx32
> >  @opindex m16
> > -@opindex miamcu
> >  @item -m32
> >  @itemx -m64
> >  @itemx -mx32
> >  @itemx -m16
> > -@itemx -miamcu
> >  Generate code for a 16-bit, 32-bit or 64-bit environment.
> >  The @option{-m32} option sets @code{int}, @code{long}, and pointer types
> >  to 32 bits, and
> > @@ -37788,7 +38087,10 @@ The @option{-m16} option is the same as 
> > @option{-m32}, except for that
> >  it outputs the @code{.code16gcc} assembly directive at the beginning of
> >  the assembly output so that the binary can run in 16-bit mode.
> >
> > -The @option{-miamcu} option generates code which conforms to Intel MCU
> > +@opindex miamcu
> > +@opindex mno-iamcu
> > +@item -miamcu
> > +The @option{-miamcu} option generates code that conforms to Intel MCU
> >  psABI.  It requires the @option{-m32} option to be turned on.
> >
> >  @opindex mno-red-zone
> > @@ -37840,6 +38142,7 @@ and x32 environments.  It is the default address 
> > mode for 32-bit and
> >  x32 environments.
> >
> >  @opindex mneeded
> > +@opindex mno-needed
> >  @item -mneeded
> >  @itemx -mno-needed
> >  Emit GNU_PROPERTY_X86_ISA_1_NEEDED GNU property for Linux target to
> > @@ -37848,7 +38151,7 @@ indicate the micro-architecture ISA level required 
> > to execute the binary.
> >  @opindex mno-direct-extern-access
> >  @opindex mdirect-extern-access
> >  @item -mno-direct-extern-access
> > -Without @option{-fpic} nor @option{-fPIC}, always use the GOT pointer
> > +Without @option{-fpic} or @option{-fPIC}, always use the GOT pointer
> >  to access external symbols.  With @option{-fpic} or @option{-fPIC},
> >  treat access to protected symbols as local symbols.  The default is
> >  @option{-mdirect-extern-access}.
> > @@ -37861,10 +38164,19 @@ protected symbols are used in shared libraries 
> > and executable.
> >  @opindex munroll-only-small-loops
> >  @opindex mno-unroll-only-small-loops
> >  @item -munroll-only-small-loops
> > -Controls conservative small loop unrolling. It is default enabled by
> > -O2, and unrolls loop with less than 4 insns by 1 time. Explicit
> > --f[no-]unroll-[all-]loops would disable this flag to avoid any
> > -unintended unrolling behavior that user does not want.
> > +@itemx -mno-unroll-only-small-loops
> > +Controls conservative small loop unrolling.  It is enabled by default with
> > +@option{-O2}, and unrolls loops with less than 4 instructions by 1 time.
> > +This gives better utilization of the instruction decoding pipeline on 
> > modern
> > +processors.  You can disable this with 
> > @option{-mno-unroll-only-small-loops},
> > +and it is also disabled if the more general options @option{-funroll-loops}
> > +or @option{-funroll-all-loops} are either enabled or explicitly disabled.
> > +
> > +@opindex mdispatch-scheduler
> > +@item -mdispatch-scheduler
> > +Enable instruction scheduling.  This is only supported on @samp{bdver1},
> > +@samp{bdver2}, @samp{bdver3}, @samp{bdver4}, and @samp{znver1} processors 
> > and
> > +additionally requires @option{-fschedule-insns -fsched-pressure}.
> >
> >  @opindex mlam
> >  @item -mlam=@var{choice}
> > --
> > 2.39.5
> >
>
>
> --
> BR,
> Hongtao

Re: [PATCH 13/14] doc, x86: Clean up x86 options documentation [PR122243]

Reply via email to