[v2] Add TARGET_MOVE_WITH_MODE_P

  On Wed, Mar 02, 2022 at 09:51:26AM +0100, Richard Biener wrote:
> On Tue, Mar 1, 2022 at 11:41 PM H.J. Lu via Gcc-patches
> <gcc-patches@gcc.gnu.org> wrote:
> >
> > Add TARGET_FOLD_MEMCPY_MAX for the maximum number of bytes to fold memcpy.
> > The default is
> >
> > MOVE_MAX * MOVE_RATIO (optimize_function_for_size_p (cfun))
> >
> > For x86, it is MOVE_MAX to restore the old behavior before
> 
> I know we've discussed this to death in the PR, I just want to repeat here
> that the GIMPLE folding expects to generate a single load and a single
> store (that is what it does on the GIMPLE level) which is why MOVE_MAX
> was chosen originally (it's documented to what a "single instruction" does).
> In practice MOVE_MAX does not seem to cover vector register sizes
> so Richard pulled MOVE_RATIO which is really intended to cover
> the case of using multiple instructions for moving memory (but then I
> don't remember whether for the ARM case the single load/store GIMPLE
> will be expanded to multiple load/store instructions).
> 
> TARGET_FOLD_MEMCPY_MAX sounds like a stop-gap solution,
> being very specific for memcpy folding (we also fold memmove btw).
> 
> There is also MOVE_MAX_PIECES which _might_ be more appropriate
> than MOVE_MAX here and still honor the idea of single instructions.
> Now neither arm nor aarch64 define this and it defaults to MOVE_MAX,
> not MOVE_MAX * MOVE_RATIO.
> 
> So if we need a new hook then that hook should at least get the
> 'speed' argument of MOVE_RATIO and it should get a better name.
> 
> I still think that it should be possible to improve the insn check to
> avoid use of "disabled" modes, maybe that's also a point to add
> a new hook like .move_with_mode_p or so?  To quote, we do

Here is the v2 patch to add TARGET_MOVE_WITH_MODE_P.

> 
>               scalar_int_mode mode;
>               if (int_mode_for_size (ilen * 8, 0).exists (&mode)
>                   && GET_MODE_SIZE (mode) * BITS_PER_UNIT == ilen * 8
>                   && have_insn_for (SET, mode)
>                   /* If the destination pointer is not aligned we must be able
>                      to emit an unaligned store.  */
>                   && (dest_align >= GET_MODE_ALIGNMENT (mode)
>                       || !targetm.slow_unaligned_access (mode, dest_align)
>                       || (optab_handler (movmisalign_optab, mode)
>                           != CODE_FOR_nothing)))
> 
> where I understand the ISA is enabled and if the user explicitely
> uses it that's OK but -mprefer-avx128 should tell GCC to never
> generate AVX256 code where the user was not explicitely using it
> (still for example glibc might happily use AVX256 code to implement
> the memcpy we are folding!)
> 
> Note the BB vectorizer also might end up with using AVX256 because
> in places it also relies on optab queries and the vector_mode_supported_p
> check (but the memcpy folding uses the fake integer modes).  So
> x86 might need to implement the related_mode hook to avoid "auto"-using
> a larger vector mode which the default implementation would happily do.
> 
> Richard.

OK for master?

Thanks.

H.J.
---
Add TARGET_MOVE_WITH_MODE_P to return true if move with mode can be
generated implicitly.  The default definition returns true.  The x86
version returns true if the mode size <= MOVE_MAX, which is the max
number of bytes we can move in one reasonably fast instruction.

gcc/

	PR target/103393
	* gimple-fold.cc (gimple_fold_builtin_memory_op): Call
	targetm.move_with_mode_p to check if move with mode can be
	generated implicitly.
	* target.def: Add move_with_mode_p.
	* targhooks.cc (default_move_with_mode_p): New.
	* targhooks.h (default_move_with_mode_p): Likewise.
	* config/i386/i386.cc (ix86_move_with_mode_p): New.
	(TARGET_MOVE_WITH_MODE_P): Likewise.
	* doc/tm.texi.in: Add TARGET_MOVE_WITH_MODE_P.
	* doc/tm.texi: Regenerate.

gcc/testsuite/

	PR target/103393
	* gcc.target/i386/pr103393-1.c: New test.
	* gcc.target/i386/pr103393-2.c: Likewise.
	* gcc.target/i386/pr103393-3.c: Likewise.
	* gcc.target/i386/pr103393-4.c: Likewise.
	* gcc.target/i386/pr103393-5.c: Likewise.
---
 gcc/config/i386/i386.cc                    | 12 ++++++++++++
 gcc/doc/tm.texi                            |  5 +++++
 gcc/doc/tm.texi.in                         |  2 ++
 gcc/gimple-fold.cc                         |  1 +
 gcc/target.def                             |  7 +++++++
 gcc/testsuite/gcc.target/i386/pr103393-1.c | 16 ++++++++++++++++
 gcc/testsuite/gcc.target/i386/pr103393-2.c | 16 ++++++++++++++++
 gcc/testsuite/gcc.target/i386/pr103393-3.c | 16 ++++++++++++++++
 gcc/testsuite/gcc.target/i386/pr103393-4.c | 16 ++++++++++++++++
 gcc/testsuite/gcc.target/i386/pr103393-5.c | 16 ++++++++++++++++
 10 files changed, 107 insertions(+)
 create mode 100644 gcc/testsuite/gcc.target/i386/pr103393-1.c
 create mode 100644 gcc/testsuite/gcc.target/i386/pr103393-2.c
 create mode 100644 gcc/testsuite/gcc.target/i386/pr103393-3.c
 create mode 100644 gcc/testsuite/gcc.target/i386/pr103393-4.c
 create mode 100644 gcc/testsuite/gcc.target/i386/pr103393-5.c

Message ID	Yh/fD6YGRT+G3Ltt@gmail.com
State	New
Headers	DMARC-Filter: OpenDMARC Filter v1.4.1 sourceware.org 5B2883858D39 Date: Wed, 2 Mar 2022 13:18:07 -0800 To: Richard Biener <richard.guenther@gmail.com> Subject: [PATCH v2] Add TARGET_MOVE_WITH_MODE_P Message-ID: <Yh/fD6YGRT+G3Ltt@gmail.com> References: <20220301224100.910199-1-hjl.tools@gmail.com> <CAFiYyc1akc2HWZaFLcgQM0qeYQYi0hFMxbwOC=7JP-9H0RqGFg@mail.gmail.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <CAFiYyc1akc2HWZaFLcgQM0qeYQYi0hFMxbwOC=7JP-9H0RqGFg@mail.gmail.com> Precedence: list From: "H.J. Lu via Gcc-patches" <gcc-patches@gcc.gnu.org> Reply-To: "H.J. Lu" <hjl.tools@gmail.com> Cc: GCC Patches <gcc-patches@gcc.gnu.org>, Richard Earnshaw <rearnsha@arm.com> Errors-To: gcc-patches-bounces+patchwork=sourceware.org@gcc.gnu.org Sender: "Gcc-patches" <gcc-patches-bounces+patchwork=sourceware.org@gcc.gnu.org>
Series	[v2] Add TARGET_MOVE_WITH_MODE_P \| [v2] Add TARGET_MOVE_WITH_MODE_P

[v2] Add TARGET_MOVE_WITH_MODE_P

Commit Message

Comments

Patch