[v4] I386: Add more testcases for unsigned SAT_ADD vector pattern

Message ID 20241127015742.4011511-1-pan2.li@intel.com
State New
Headers
Series [v4] I386: Add more testcases for unsigned SAT_ADD vector pattern |

Checks

Context Check Description
linaro-tcwg-bot/tcwg_gcc_build--master-arm success Build passed
linaro-tcwg-bot/tcwg_gcc_build--master-aarch64 success Build passed
linaro-tcwg-bot/tcwg_gcc_check--master-arm success Test passed
linaro-tcwg-bot/tcwg_gcc_check--master-aarch64 success Test passed

Commit Message

Li, Pan2 Nov. 27, 2024, 1:57 a.m. UTC
  From: Pan Li <pan2.li@intel.com>

Some forms like below failed to recog the SAT_ADD pattern
for target i386.  It is related to some match pattern
extraction but get fixed after the refactor of the SAT_ADD
pattern.  Thus, add testcases to ensure we may have similar
issue in futrue.

  #define DEF_SAT_ADD(T)   \
  T sat_add_##T (T x, T y) \
  {                        \
    T res;                 \
    res = x + y;           \
    res |= -(T)(res < x);  \
    return res;            \
  }

  #define VEC_DEF_SAT_ADD(T)                       \
  void vec_sat_add(T * restrict a, T * restrict b) \
  {                                                \
    for (int i = 0; i < 8; i++)                    \
      b[i] = sat_add_##T (a[i], b[i]);             \
  }

  DEF_SAT_ADD (uint32_t)
  VEC_DEF_SAT_ADD (uint32_t)

The below test suites are passed for this patch.
make -k check-gcc RUNTESTFLAGS="--target_board=unix\{,-m32\} i386.exp=pr112600-5a-*.c"

	PR target/112600

gcc/testsuite/ChangeLog:

	* gcc.target/i386/pr112600-5a-u16.c: New test.
	* gcc.target/i386/pr112600-5a-u32.c: New test.
	* gcc.target/i386/pr112600-5a-u64.c: New test.
	* gcc.target/i386/pr112600-5a-u8.c: New test.
	* gcc.target/i386/pr112600-5a.h: New test.

Signed-off-by: Pan Li <pan2.li@intel.com>
---
 .../gcc.target/i386/pr112600-5a-u16.c         | 10 +++++++++
 .../gcc.target/i386/pr112600-5a-u32.c         |  9 ++++++++
 .../gcc.target/i386/pr112600-5a-u64.c         | 10 +++++++++
 .../gcc.target/i386/pr112600-5a-u8.c          | 10 +++++++++
 gcc/testsuite/gcc.target/i386/pr112600-5a.h   | 22 +++++++++++++++++++
 5 files changed, 61 insertions(+)
 create mode 100644 gcc/testsuite/gcc.target/i386/pr112600-5a-u16.c
 create mode 100644 gcc/testsuite/gcc.target/i386/pr112600-5a-u32.c
 create mode 100644 gcc/testsuite/gcc.target/i386/pr112600-5a-u64.c
 create mode 100644 gcc/testsuite/gcc.target/i386/pr112600-5a-u8.c
 create mode 100644 gcc/testsuite/gcc.target/i386/pr112600-5a.h
  

Comments

Uros Bizjak Nov. 27, 2024, 8:15 a.m. UTC | #1
On Wed, Nov 27, 2024 at 3:00 AM <pan2.li@intel.com> wrote:
>
> From: Pan Li <pan2.li@intel.com>
>
> Some forms like below failed to recog the SAT_ADD pattern

... failed to be recognized as a SAT_ADD pattern ...

> for target i386.  It is related to some match pattern
> extraction but get fixed after the refactor of the SAT_ADD
> pattern.  Thus, add testcases to ensure we may have similar
> issue in futrue.

... to ensure we won't have similar issues in the future.

>
>   #define DEF_SAT_ADD(T)   \
>   T sat_add_##T (T x, T y) \
>   {                        \
>     T res;                 \
>     res = x + y;           \
>     res |= -(T)(res < x);  \
>     return res;            \
>   }
>
>   #define VEC_DEF_SAT_ADD(T)                       \
>   void vec_sat_add(T * restrict a, T * restrict b) \
>   {                                                \
>     for (int i = 0; i < 8; i++)                    \
>       b[i] = sat_add_##T (a[i], b[i]);             \
>   }
>
>   DEF_SAT_ADD (uint32_t)
>   VEC_DEF_SAT_ADD (uint32_t)
>
> The below test suites are passed for this patch.
> make -k check-gcc RUNTESTFLAGS="--target_board=unix\{,-m32\} i386.exp=pr112600-5a-*.c"
>
>         PR target/112600
>
> gcc/testsuite/ChangeLog:
>
>         * gcc.target/i386/pr112600-5a-u16.c: New test.
>         * gcc.target/i386/pr112600-5a-u32.c: New test.
>         * gcc.target/i386/pr112600-5a-u64.c: New test.
>         * gcc.target/i386/pr112600-5a-u8.c: New test.
>         * gcc.target/i386/pr112600-5a.h: New test.

OK, but please drop "a" suffix from new files (when I added original
pr112600 testcases, "a" suffix was for char type and "b" was for short
type, but this is not the case with your testcases).

Thanks,
Uros.

>
> Signed-off-by: Pan Li <pan2.li@intel.com>
> ---
>  .../gcc.target/i386/pr112600-5a-u16.c         | 10 +++++++++
>  .../gcc.target/i386/pr112600-5a-u32.c         |  9 ++++++++
>  .../gcc.target/i386/pr112600-5a-u64.c         | 10 +++++++++
>  .../gcc.target/i386/pr112600-5a-u8.c          | 10 +++++++++
>  gcc/testsuite/gcc.target/i386/pr112600-5a.h   | 22 +++++++++++++++++++
>  5 files changed, 61 insertions(+)
>  create mode 100644 gcc/testsuite/gcc.target/i386/pr112600-5a-u16.c
>  create mode 100644 gcc/testsuite/gcc.target/i386/pr112600-5a-u32.c
>  create mode 100644 gcc/testsuite/gcc.target/i386/pr112600-5a-u64.c
>  create mode 100644 gcc/testsuite/gcc.target/i386/pr112600-5a-u8.c
>  create mode 100644 gcc/testsuite/gcc.target/i386/pr112600-5a.h
>
> diff --git a/gcc/testsuite/gcc.target/i386/pr112600-5a-u16.c b/gcc/testsuite/gcc.target/i386/pr112600-5a-u16.c
> new file mode 100644
> index 00000000000..f462bfa4800
> --- /dev/null
> +++ b/gcc/testsuite/gcc.target/i386/pr112600-5a-u16.c
> @@ -0,0 +1,10 @@
> +/* PR target/112600 */
> +/* { dg-do compile } */
> +/* { dg-options "-O2 -msse2 -fdump-tree-optimized" } */
> +
> +#include "pr112600-5a.h"
> +
> +DEF_SAT_ADD (uint16_t)
> +VEC_DEF_SAT_ADD (uint16_t)
> +
> +/* { dg-final { scan-tree-dump-times ".SAT_ADD " 3 "optimized" } } */
> diff --git a/gcc/testsuite/gcc.target/i386/pr112600-5a-u32.c b/gcc/testsuite/gcc.target/i386/pr112600-5a-u32.c
> new file mode 100644
> index 00000000000..5797c97ebe9
> --- /dev/null
> +++ b/gcc/testsuite/gcc.target/i386/pr112600-5a-u32.c
> @@ -0,0 +1,9 @@
> +/* PR target/112600 */
> +/* { dg-do compile } */
> +/* { dg-options "-O2 -msse2 -fdump-tree-optimized" } */
> +
> +#include "pr112600-5a.h"
> +
> +DEF_SAT_ADD (uint32_t)
> +
> +/* { dg-final { scan-tree-dump-times ".SAT_ADD " 1 "optimized" } } */
> diff --git a/gcc/testsuite/gcc.target/i386/pr112600-5a-u64.c b/gcc/testsuite/gcc.target/i386/pr112600-5a-u64.c
> new file mode 100644
> index 00000000000..d5f81f72ed5
> --- /dev/null
> +++ b/gcc/testsuite/gcc.target/i386/pr112600-5a-u64.c
> @@ -0,0 +1,10 @@
> +/* PR target/112600 */
> +/* { dg-do compile  { target { ! ia32 } } } */
> +/* { dg-options "-O2 -msse2 -fdump-tree-optimized" } */
> +
> +#include "pr112600-5a.h"
> +
> +DEF_SAT_ADD (uint64_t)
> +
> +
> +/* { dg-final { scan-tree-dump-times ".SAT_ADD " 1 "optimized" } } */
> diff --git a/gcc/testsuite/gcc.target/i386/pr112600-5a-u8.c b/gcc/testsuite/gcc.target/i386/pr112600-5a-u8.c
> new file mode 100644
> index 00000000000..cb8657ecd86
> --- /dev/null
> +++ b/gcc/testsuite/gcc.target/i386/pr112600-5a-u8.c
> @@ -0,0 +1,10 @@
> +/* PR target/112600 */
> +/* { dg-do compile } */
> +/* { dg-options "-O2 -msse2 -fdump-tree-optimized" } */
> +
> +#include "pr112600-5a.h"
> +
> +DEF_SAT_ADD (uint8_t)
> +VEC_DEF_SAT_ADD (uint8_t)
> +
> +/* { dg-final { scan-tree-dump-times ".SAT_ADD " 2 "optimized" } } */
> diff --git a/gcc/testsuite/gcc.target/i386/pr112600-5a.h b/gcc/testsuite/gcc.target/i386/pr112600-5a.h
> new file mode 100644
> index 00000000000..482c865e953
> --- /dev/null
> +++ b/gcc/testsuite/gcc.target/i386/pr112600-5a.h
> @@ -0,0 +1,22 @@
> +#ifndef HAVE_DEFINED_PR112600_5A_H
> +#define HAVE_DEFINED_PR112600_5A_H
> +
> +#include <stdint.h>
> +
> +#define DEF_SAT_ADD(T)   \
> +T sat_add_##T (T x, T y) \
> +{                        \
> +  T res;                 \
> +  res = x + y;           \
> +  res |= -(T)(res < x);  \
> +  return res;            \
> +}
> +
> +#define VEC_DEF_SAT_ADD(T)                       \
> +void vec_sat_add(T * restrict a, T * restrict b) \
> +{                                                \
> +  for (int i = 0; i < 16; i++)                   \
> +    b[i] = sat_add_##T (a[i], b[i]);             \
> +}
> +
> +#endif
> --
> 2.43.0
>
  
Li, Pan2 Nov. 27, 2024, 2:25 p.m. UTC | #2
> OK, but please drop "a" suffix from new files (when I added original
> pr112600 testcases, "a" suffix was for char type and "b" was for short
> type, but this is not the case with your testcases).

Thanks Uros, will commit with these changes if no surprise from test.

Pan

-----Original Message-----
From: Uros Bizjak <ubizjak@gmail.com> 
Sent: Wednesday, November 27, 2024 4:15 PM
To: Li, Pan2 <pan2.li@intel.com>
Cc: gcc-patches@gcc.gnu.org; Liu, Hongtao <hongtao.liu@intel.com>
Subject: Re: [PATCH v4] I386: Add more testcases for unsigned SAT_ADD vector pattern

On Wed, Nov 27, 2024 at 3:00 AM <pan2.li@intel.com> wrote:
>
> From: Pan Li <pan2.li@intel.com>
>
> Some forms like below failed to recog the SAT_ADD pattern

... failed to be recognized as a SAT_ADD pattern ...

> for target i386.  It is related to some match pattern
> extraction but get fixed after the refactor of the SAT_ADD
> pattern.  Thus, add testcases to ensure we may have similar
> issue in futrue.

... to ensure we won't have similar issues in the future.

>
>   #define DEF_SAT_ADD(T)   \
>   T sat_add_##T (T x, T y) \
>   {                        \
>     T res;                 \
>     res = x + y;           \
>     res |= -(T)(res < x);  \
>     return res;            \
>   }
>
>   #define VEC_DEF_SAT_ADD(T)                       \
>   void vec_sat_add(T * restrict a, T * restrict b) \
>   {                                                \
>     for (int i = 0; i < 8; i++)                    \
>       b[i] = sat_add_##T (a[i], b[i]);             \
>   }
>
>   DEF_SAT_ADD (uint32_t)
>   VEC_DEF_SAT_ADD (uint32_t)
>
> The below test suites are passed for this patch.
> make -k check-gcc RUNTESTFLAGS="--target_board=unix\{,-m32\} i386.exp=pr112600-5a-*.c"
>
>         PR target/112600
>
> gcc/testsuite/ChangeLog:
>
>         * gcc.target/i386/pr112600-5a-u16.c: New test.
>         * gcc.target/i386/pr112600-5a-u32.c: New test.
>         * gcc.target/i386/pr112600-5a-u64.c: New test.
>         * gcc.target/i386/pr112600-5a-u8.c: New test.
>         * gcc.target/i386/pr112600-5a.h: New test.

OK, but please drop "a" suffix from new files (when I added original
pr112600 testcases, "a" suffix was for char type and "b" was for short
type, but this is not the case with your testcases).

Thanks,
Uros.

>
> Signed-off-by: Pan Li <pan2.li@intel.com>
> ---
>  .../gcc.target/i386/pr112600-5a-u16.c         | 10 +++++++++
>  .../gcc.target/i386/pr112600-5a-u32.c         |  9 ++++++++
>  .../gcc.target/i386/pr112600-5a-u64.c         | 10 +++++++++
>  .../gcc.target/i386/pr112600-5a-u8.c          | 10 +++++++++
>  gcc/testsuite/gcc.target/i386/pr112600-5a.h   | 22 +++++++++++++++++++
>  5 files changed, 61 insertions(+)
>  create mode 100644 gcc/testsuite/gcc.target/i386/pr112600-5a-u16.c
>  create mode 100644 gcc/testsuite/gcc.target/i386/pr112600-5a-u32.c
>  create mode 100644 gcc/testsuite/gcc.target/i386/pr112600-5a-u64.c
>  create mode 100644 gcc/testsuite/gcc.target/i386/pr112600-5a-u8.c
>  create mode 100644 gcc/testsuite/gcc.target/i386/pr112600-5a.h
>
> diff --git a/gcc/testsuite/gcc.target/i386/pr112600-5a-u16.c b/gcc/testsuite/gcc.target/i386/pr112600-5a-u16.c
> new file mode 100644
> index 00000000000..f462bfa4800
> --- /dev/null
> +++ b/gcc/testsuite/gcc.target/i386/pr112600-5a-u16.c
> @@ -0,0 +1,10 @@
> +/* PR target/112600 */
> +/* { dg-do compile } */
> +/* { dg-options "-O2 -msse2 -fdump-tree-optimized" } */
> +
> +#include "pr112600-5a.h"
> +
> +DEF_SAT_ADD (uint16_t)
> +VEC_DEF_SAT_ADD (uint16_t)
> +
> +/* { dg-final { scan-tree-dump-times ".SAT_ADD " 3 "optimized" } } */
> diff --git a/gcc/testsuite/gcc.target/i386/pr112600-5a-u32.c b/gcc/testsuite/gcc.target/i386/pr112600-5a-u32.c
> new file mode 100644
> index 00000000000..5797c97ebe9
> --- /dev/null
> +++ b/gcc/testsuite/gcc.target/i386/pr112600-5a-u32.c
> @@ -0,0 +1,9 @@
> +/* PR target/112600 */
> +/* { dg-do compile } */
> +/* { dg-options "-O2 -msse2 -fdump-tree-optimized" } */
> +
> +#include "pr112600-5a.h"
> +
> +DEF_SAT_ADD (uint32_t)
> +
> +/* { dg-final { scan-tree-dump-times ".SAT_ADD " 1 "optimized" } } */
> diff --git a/gcc/testsuite/gcc.target/i386/pr112600-5a-u64.c b/gcc/testsuite/gcc.target/i386/pr112600-5a-u64.c
> new file mode 100644
> index 00000000000..d5f81f72ed5
> --- /dev/null
> +++ b/gcc/testsuite/gcc.target/i386/pr112600-5a-u64.c
> @@ -0,0 +1,10 @@
> +/* PR target/112600 */
> +/* { dg-do compile  { target { ! ia32 } } } */
> +/* { dg-options "-O2 -msse2 -fdump-tree-optimized" } */
> +
> +#include "pr112600-5a.h"
> +
> +DEF_SAT_ADD (uint64_t)
> +
> +
> +/* { dg-final { scan-tree-dump-times ".SAT_ADD " 1 "optimized" } } */
> diff --git a/gcc/testsuite/gcc.target/i386/pr112600-5a-u8.c b/gcc/testsuite/gcc.target/i386/pr112600-5a-u8.c
> new file mode 100644
> index 00000000000..cb8657ecd86
> --- /dev/null
> +++ b/gcc/testsuite/gcc.target/i386/pr112600-5a-u8.c
> @@ -0,0 +1,10 @@
> +/* PR target/112600 */
> +/* { dg-do compile } */
> +/* { dg-options "-O2 -msse2 -fdump-tree-optimized" } */
> +
> +#include "pr112600-5a.h"
> +
> +DEF_SAT_ADD (uint8_t)
> +VEC_DEF_SAT_ADD (uint8_t)
> +
> +/* { dg-final { scan-tree-dump-times ".SAT_ADD " 2 "optimized" } } */
> diff --git a/gcc/testsuite/gcc.target/i386/pr112600-5a.h b/gcc/testsuite/gcc.target/i386/pr112600-5a.h
> new file mode 100644
> index 00000000000..482c865e953
> --- /dev/null
> +++ b/gcc/testsuite/gcc.target/i386/pr112600-5a.h
> @@ -0,0 +1,22 @@
> +#ifndef HAVE_DEFINED_PR112600_5A_H
> +#define HAVE_DEFINED_PR112600_5A_H
> +
> +#include <stdint.h>
> +
> +#define DEF_SAT_ADD(T)   \
> +T sat_add_##T (T x, T y) \
> +{                        \
> +  T res;                 \
> +  res = x + y;           \
> +  res |= -(T)(res < x);  \
> +  return res;            \
> +}
> +
> +#define VEC_DEF_SAT_ADD(T)                       \
> +void vec_sat_add(T * restrict a, T * restrict b) \
> +{                                                \
> +  for (int i = 0; i < 16; i++)                   \
> +    b[i] = sat_add_##T (a[i], b[i]);             \
> +}
> +
> +#endif
> --
> 2.43.0
>
  

Patch

diff --git a/gcc/testsuite/gcc.target/i386/pr112600-5a-u16.c b/gcc/testsuite/gcc.target/i386/pr112600-5a-u16.c
new file mode 100644
index 00000000000..f462bfa4800
--- /dev/null
+++ b/gcc/testsuite/gcc.target/i386/pr112600-5a-u16.c
@@ -0,0 +1,10 @@ 
+/* PR target/112600 */
+/* { dg-do compile } */
+/* { dg-options "-O2 -msse2 -fdump-tree-optimized" } */
+
+#include "pr112600-5a.h"
+
+DEF_SAT_ADD (uint16_t)
+VEC_DEF_SAT_ADD (uint16_t)
+
+/* { dg-final { scan-tree-dump-times ".SAT_ADD " 3 "optimized" } } */
diff --git a/gcc/testsuite/gcc.target/i386/pr112600-5a-u32.c b/gcc/testsuite/gcc.target/i386/pr112600-5a-u32.c
new file mode 100644
index 00000000000..5797c97ebe9
--- /dev/null
+++ b/gcc/testsuite/gcc.target/i386/pr112600-5a-u32.c
@@ -0,0 +1,9 @@ 
+/* PR target/112600 */
+/* { dg-do compile } */
+/* { dg-options "-O2 -msse2 -fdump-tree-optimized" } */
+
+#include "pr112600-5a.h"
+
+DEF_SAT_ADD (uint32_t)
+
+/* { dg-final { scan-tree-dump-times ".SAT_ADD " 1 "optimized" } } */
diff --git a/gcc/testsuite/gcc.target/i386/pr112600-5a-u64.c b/gcc/testsuite/gcc.target/i386/pr112600-5a-u64.c
new file mode 100644
index 00000000000..d5f81f72ed5
--- /dev/null
+++ b/gcc/testsuite/gcc.target/i386/pr112600-5a-u64.c
@@ -0,0 +1,10 @@ 
+/* PR target/112600 */
+/* { dg-do compile  { target { ! ia32 } } } */
+/* { dg-options "-O2 -msse2 -fdump-tree-optimized" } */
+
+#include "pr112600-5a.h"
+
+DEF_SAT_ADD (uint64_t)
+
+
+/* { dg-final { scan-tree-dump-times ".SAT_ADD " 1 "optimized" } } */
diff --git a/gcc/testsuite/gcc.target/i386/pr112600-5a-u8.c b/gcc/testsuite/gcc.target/i386/pr112600-5a-u8.c
new file mode 100644
index 00000000000..cb8657ecd86
--- /dev/null
+++ b/gcc/testsuite/gcc.target/i386/pr112600-5a-u8.c
@@ -0,0 +1,10 @@ 
+/* PR target/112600 */
+/* { dg-do compile } */
+/* { dg-options "-O2 -msse2 -fdump-tree-optimized" } */
+
+#include "pr112600-5a.h"
+
+DEF_SAT_ADD (uint8_t)
+VEC_DEF_SAT_ADD (uint8_t)
+
+/* { dg-final { scan-tree-dump-times ".SAT_ADD " 2 "optimized" } } */
diff --git a/gcc/testsuite/gcc.target/i386/pr112600-5a.h b/gcc/testsuite/gcc.target/i386/pr112600-5a.h
new file mode 100644
index 00000000000..482c865e953
--- /dev/null
+++ b/gcc/testsuite/gcc.target/i386/pr112600-5a.h
@@ -0,0 +1,22 @@ 
+#ifndef HAVE_DEFINED_PR112600_5A_H
+#define HAVE_DEFINED_PR112600_5A_H
+
+#include <stdint.h>
+
+#define DEF_SAT_ADD(T)   \
+T sat_add_##T (T x, T y) \
+{                        \
+  T res;                 \
+  res = x + y;           \
+  res |= -(T)(res < x);  \
+  return res;            \
+}
+
+#define VEC_DEF_SAT_ADD(T)                       \
+void vec_sat_add(T * restrict a, T * restrict b) \
+{                                                \
+  for (int i = 0; i < 16; i++)                   \
+    b[i] = sat_add_##T (a[i], b[i]);             \
+}
+
+#endif