From patchwork Mon Jan 24 12:11:30 2022
Content-Type: text/plain; charset="utf-8"
MIME-Version: 1.0
Content-Transfer-Encoding: 7bit
X-Patchwork-Submitter: Richard Biener <rguenther@suse.de>
X-Patchwork-Id: 50388
Return-Path: <gcc-patches-bounces+patchwork=sourceware.org@gcc.gnu.org>
X-Original-To: patchwork@sourceware.org
Delivered-To: patchwork@sourceware.org
Received: from server2.sourceware.org (localhost [IPv6:::1])
	by sourceware.org (Postfix) with ESMTP id 57D983858002
	for <patchwork@sourceware.org>; Mon, 24 Jan 2022 12:12:02 +0000 (GMT)
DKIM-Filter: OpenDKIM Filter v2.11.0 sourceware.org 57D983858002
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gcc.gnu.org;
	s=default; t=1643026322;
	bh=OOhC9+G1iYt+8tQOpgfVQUK4QBDMHwrzCCF51HVJdyY=;
	h=Date:To:Subject:List-Id:List-Unsubscribe:List-Archive:List-Post:
	 List-Help:List-Subscribe:From:Reply-To:From;
	b=lo+EYsBApqbRiaJJo+vthNzO7QXpuRWRnML3Ov0gpIrTIDNMToddvl1Yz9Uro7ZGa
	 En1IJkqvpZ1C/F2bpjum4WUDa5tBykhCSIhl2aVTVnaHqAY428gZ7fB9e/QZsAbMvN
	 j7YyCZYVQ7yeqXFVCGvOcj+8F6pMyHtbGP23X2mE=
X-Original-To: gcc-patches@gcc.gnu.org
Delivered-To: gcc-patches@gcc.gnu.org
Received: from smtp-out1.suse.de (smtp-out1.suse.de [195.135.220.28])
 by sourceware.org (Postfix) with ESMTPS id A1FAA3858D3C
 for <gcc-patches@gcc.gnu.org>; Mon, 24 Jan 2022 12:11:32 +0000 (GMT)
DMARC-Filter: OpenDMARC Filter v1.4.1 sourceware.org A1FAA3858D3C
Received: from imap2.suse-dmz.suse.de (imap2.suse-dmz.suse.de
 [192.168.254.74])
 (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits)
 key-exchange X25519 server-signature ECDSA (P-521) server-digest SHA512)
 (No client certificate requested)
 by smtp-out1.suse.de (Postfix) with ESMTPS id 7600F218E5;
 Mon, 24 Jan 2022 12:11:31 +0000 (UTC)
Received: from imap2.suse-dmz.suse.de (imap2.suse-dmz.suse.de
 [192.168.254.74])
 (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits)
 key-exchange X25519 server-signature ECDSA (P-521) server-digest SHA512)
 (No client certificate requested)
 by imap2.suse-dmz.suse.de (Postfix) with ESMTPS id 5B36313B2C;
 Mon, 24 Jan 2022 12:11:31 +0000 (UTC)
Received: from dovecot-director2.suse.de ([192.168.254.65])
 by imap2.suse-dmz.suse.de with ESMTPSA id Y7jhFHOX7mFZJwAAMHmgww
 (envelope-from <rguenther@suse.de>); Mon, 24 Jan 2022 12:11:31 +0000
Date: Mon, 24 Jan 2022 13:11:30 +0100 (CET)
To: gcc-patches@gcc.gnu.org
Subject: [PATCH] tree-optimization/102131 - fix niter analysis wrt overflow
MIME-Version: 1.0
Message-Id: <20220124121131.5B36313B2C@imap2.suse-dmz.suse.de>
X-Spam-Status: No, score=-11.8 required=5.0 tests=BAYES_00, DKIM_SIGNED,
 DKIM_VALID, DKIM_VALID_AU, DKIM_VALID_EF, GIT_PATCH_0, SPF_HELO_NONE,
 SPF_PASS,
 TXREP autolearn=ham autolearn_force=no version=3.4.4
X-Spam-Checker-Version: SpamAssassin 3.4.4 (2020-01-24) on
 server2.sourceware.org
X-BeenThere: gcc-patches@gcc.gnu.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: Gcc-patches mailing list <gcc-patches.gcc.gnu.org>
List-Unsubscribe: <https://gcc.gnu.org/mailman/options/gcc-patches>,
 <mailto:gcc-patches-request@gcc.gnu.org?subject=unsubscribe>
List-Archive: <https://gcc.gnu.org/pipermail/gcc-patches/>
List-Post: <mailto:gcc-patches@gcc.gnu.org>
List-Help: <mailto:gcc-patches-request@gcc.gnu.org?subject=help>
List-Subscribe: <https://gcc.gnu.org/mailman/listinfo/gcc-patches>,
 <mailto:gcc-patches-request@gcc.gnu.org?subject=subscribe>
X-Patchwork-Original-From: Richard Biener via Gcc-patches
 <gcc-patches@gcc.gnu.org>
From: Richard Biener <rguenther@suse.de>
Reply-To: Richard Biener <rguenther@suse.de>
Errors-To: gcc-patches-bounces+patchwork=sourceware.org@gcc.gnu.org
Sender: "Gcc-patches"
 <gcc-patches-bounces+patchwork=sourceware.org@gcc.gnu.org>

This fixes the overflow issues seen with analyzing
BASE0 + STEP0 cmp BASE1 + STEP1 as BASE0 + STEP0 - STEP1 cmp BASE1
by following the logic we have when simplifying comparisons.

Bootstrapped and tested on x86_64-unknown-linux-gnu.

I've built SPEC 2017 with extra statistics and in all of it there
are 518 cases where the new condition triggers (that's of course
usually multiple times per actual loop).  I don't have the number
of cases where IV analysis failed upon further analysis with the
old logic though.  Those 518 cases are out of a total number of
4744090 cases that run into the simplification out of which 4722153
return successfully from number_of_iterations_cond.  619 cases fail
with the old test (non-wrapping IVs and constant step).

Based on this analysis I have pushed this change now.

Richard.

2022-01-24  Richard Biener  <rguenther@suse.de>
	    Jiufu Guo <guojiufu@linux.ibm.com>

	PR tree-optimization/100740
	PR tree-optimization/101508
	PR tree-optimization/101972
	PR tree-optimization/102131
	* tree-ssa-loop-niter.cc (): Properly constrain
	BASE0 + STEP0 cmp BASE1 + STEP1 to BASE0 + STEP0 - STEP1 cmp BASE1
	transform.

	* gcc.dg/torture/pr100740.c: New testcase.
	* gcc.dg/torture/pr101508.c: Likewise.
	* gcc.dg/torture/pr101972.c: Likewise.
	* gcc.dg/torture/pr102131-1.c: Likewise.
	* gcc.dg/torture/pr102131-2.c: Likewise.
	* gcc.dg/torture/pr102131-3.c: Likewise.
	* gcc.dg/torture/pr102131-4.c: Likewise.
---
 gcc/testsuite/gcc.dg/torture/pr100740.c   | 12 +++++++
 gcc/testsuite/gcc.dg/torture/pr101508.c   | 13 ++++++++
 gcc/testsuite/gcc.dg/torture/pr101972.c   | 39 +++++++++++++++++++++++
 gcc/testsuite/gcc.dg/torture/pr102131-1.c | 16 ++++++++++
 gcc/testsuite/gcc.dg/torture/pr102131-2.c | 15 +++++++++
 gcc/testsuite/gcc.dg/torture/pr102131-3.c | 11 +++++++
 gcc/testsuite/gcc.dg/torture/pr102131-4.c | 15 +++++++++
 gcc/tree-ssa-loop-niter.cc                | 29 +++++++++++------
 8 files changed, 141 insertions(+), 9 deletions(-)
 create mode 100644 gcc/testsuite/gcc.dg/torture/pr100740.c
 create mode 100644 gcc/testsuite/gcc.dg/torture/pr101508.c
 create mode 100644 gcc/testsuite/gcc.dg/torture/pr101972.c
 create mode 100644 gcc/testsuite/gcc.dg/torture/pr102131-1.c
 create mode 100644 gcc/testsuite/gcc.dg/torture/pr102131-2.c
 create mode 100644 gcc/testsuite/gcc.dg/torture/pr102131-3.c
 create mode 100644 gcc/testsuite/gcc.dg/torture/pr102131-4.c

diff --git a/gcc/testsuite/gcc.dg/torture/pr100740.c b/gcc/testsuite/gcc.dg/torture/pr100740.c
new file mode 100644
index 00000000000..a85855ebe18
--- /dev/null
+++ b/gcc/testsuite/gcc.dg/torture/pr100740.c
@@ -0,0 +1,12 @@
+/* { dg-do run } */
+
+unsigned a, b;
+int main()
+{
+  unsigned c = 0;
+  for (a = 0; a < 2; a++)
+    for (b = 0; b < 2; b++)
+      if (++c < a)
+	__builtin_abort ();
+  return 0;
+}
diff --git a/gcc/testsuite/gcc.dg/torture/pr101508.c b/gcc/testsuite/gcc.dg/torture/pr101508.c
new file mode 100644
index 00000000000..e1cb2645a31
--- /dev/null
+++ b/gcc/testsuite/gcc.dg/torture/pr101508.c
@@ -0,0 +1,13 @@
+/* { dg-do run } */
+
+int
+main ()
+{
+  unsigned i;
+  for (i = 0; i < 3; ++i)
+    {
+      if (i > i * 2)
+        __builtin_abort ();
+    }
+  return 0;
+}
diff --git a/gcc/testsuite/gcc.dg/torture/pr101972.c b/gcc/testsuite/gcc.dg/torture/pr101972.c
new file mode 100644
index 00000000000..3adb4828ea5
--- /dev/null
+++ b/gcc/testsuite/gcc.dg/torture/pr101972.c
@@ -0,0 +1,39 @@
+/* { dg-do run  } */
+/* { dg-require-effective-target int32plus } */
+
+int a, b, c, d, f;
+static short e = 63891;
+char g = 30;
+unsigned h(int i, int j) { return i << j; }
+int *l(int *);
+void m()
+{
+  a = 0;
+  for (; a >= 0; a--)
+    {
+      int *k = &b;
+      *k = e < 0;
+    }
+  c = b;
+  l(&c);
+}
+int *l(int *i)
+{
+  d = 2;
+  for (; d <= 6; d++)
+    {
+      if (h(d, *i) <= d)
+	;
+      else
+	continue;
+      g = 0;
+      return &f;
+    }
+  return (void *)0;
+}
+int main()
+{
+  m();
+  if (g != 30)
+    __builtin_abort ();
+}
diff --git a/gcc/testsuite/gcc.dg/torture/pr102131-1.c b/gcc/testsuite/gcc.dg/torture/pr102131-1.c
new file mode 100644
index 00000000000..5ff576d8634
--- /dev/null
+++ b/gcc/testsuite/gcc.dg/torture/pr102131-1.c
@@ -0,0 +1,16 @@
+/* { dg-do run } */
+
+int a;
+int main()
+{
+  unsigned b = 0;
+  int c = 1;
+  for (; b < 3; b++)
+    {
+      while (c < b)
+	__builtin_abort ();
+      for (a = 0; a < 3; a++)
+	c++;
+    }
+  return 0;
+}
diff --git a/gcc/testsuite/gcc.dg/torture/pr102131-2.c b/gcc/testsuite/gcc.dg/torture/pr102131-2.c
new file mode 100644
index 00000000000..830c72c23d9
--- /dev/null
+++ b/gcc/testsuite/gcc.dg/torture/pr102131-2.c
@@ -0,0 +1,15 @@
+/* { dg-do run } */
+
+int a;
+int main()
+{
+  unsigned b = 0;
+  int c = 1;
+  for (;b < 3; b++)
+    {
+      if (c < b)
+	__builtin_abort ();
+      c+=3;
+    }
+  return 0;
+}
diff --git a/gcc/testsuite/gcc.dg/torture/pr102131-3.c b/gcc/testsuite/gcc.dg/torture/pr102131-3.c
new file mode 100644
index 00000000000..aed10c973e6
--- /dev/null
+++ b/gcc/testsuite/gcc.dg/torture/pr102131-3.c
@@ -0,0 +1,11 @@
+/* { dg-do run } */
+
+int a;
+int main()
+{
+  unsigned b = 0;
+  for (a = 2; a < 8; a += 2)
+    if (++b > a)
+      __builtin_abort();
+  return 0;
+}
diff --git a/gcc/testsuite/gcc.dg/torture/pr102131-4.c b/gcc/testsuite/gcc.dg/torture/pr102131-4.c
new file mode 100644
index 00000000000..c63c08b2137
--- /dev/null
+++ b/gcc/testsuite/gcc.dg/torture/pr102131-4.c
@@ -0,0 +1,15 @@
+/* { dg-do run } */
+/* { dg-require-effective-target int32plus } */
+
+unsigned a;
+int main()
+{
+  unsigned b = 1;
+  for (; b < 4; b++) {
+      a = (a ^ 2000000000) * -b;
+      if (b > a)
+	__builtin_abort ();
+      a = 3000000000;
+  }
+  return 0;
+}
diff --git a/gcc/tree-ssa-loop-niter.cc b/gcc/tree-ssa-loop-niter.cc
index 21cc257c91b..747d631149b 100644
--- a/gcc/tree-ssa-loop-niter.cc
+++ b/gcc/tree-ssa-loop-niter.cc
@@ -1903,17 +1903,28 @@ number_of_iterations_cond (class loop *loop,
       tree step = fold_binary_to_constant (MINUS_EXPR, step_type,
 					   iv0->step, iv1->step);
 
-      /* No need to check sign of the new step since below code takes care
-	 of this well.  */
-      if (code != NE_EXPR
-	  && (TREE_CODE (step) != INTEGER_CST
-	      || !iv0->no_overflow || !iv1->no_overflow))
-	return false;
+      /* For code other than NE_EXPR we have to ensure moving the evolution
+	 of IV1 to that of IV0 does not introduce overflow.  */
+      if (TREE_CODE (step) != INTEGER_CST
+	  || !iv0->no_overflow || !iv1->no_overflow)
+	{
+	  if (code != NE_EXPR)
+	    return false;
+	  iv0->no_overflow = false;
+	}
+      /* If the new step of IV0 has changed sign or is of greater
+	 magnitude then we do not know whether IV0 does overflow
+	 and thus the transform is not valid for code other than NE_EXPR  */
+      else if (tree_int_cst_sign_bit (step) != tree_int_cst_sign_bit (iv0->step)
+	       || wi::gtu_p (wi::abs (wi::to_widest (step)),
+			     wi::abs (wi::to_widest (iv0->step))))
+	{
+	  if (code != NE_EXPR)
+	    return false;
+	  iv0->no_overflow = false;
+	}
 
       iv0->step = step;
-      if (!POINTER_TYPE_P (type))
-	iv0->no_overflow = false;
-
       iv1->step = build_int_cst (step_type, 0);
       iv1->no_overflow = true;
     }