[AArch64] ILP32 math changes

Message ID	1504041387.5204.20.camel@cavium.com
State	New, archived
Headers	Mailing-List: contact libc-alpha-help@sourceware.org; run by ezmlm Precedence: bulk Sender: libc-alpha-owner@sourceware.org Message-ID: <1504041387.5204.20.camel@cavium.com> Subject: [PATCH] [AArch64] ILP32 math changes From: Steve Ellcey <sellcey@cavium.com> Reply-To: sellcey@cavium.com To: libc-alpha <libc-alpha@sourceware.org> Cc: Szabolcs Nagy <szabolcs.nagy@arm.com>, rth@twiddle.net, Joseph Myers <joseph@codesourcery.com>, Wilco Dijkstra <Wilco.Dijkstra@arm.com>, nd <nd@arm.com> Date: Tue, 29 Aug 2017 14:16:27 -0700 Content-Type: multipart/mixed; boundary="=-oP8RJdvOHSaWUrc2/8hU" Mime-Version: 1.0 Received-SPF: None (protection.outlook.com: cavium.com does not designate permitted sender hosts) X-Microsoft-Exchange-Diagnostics: =?us-ascii?Q?1; MWHPR07MB3549; 23:w9mZCt2DPpfPtPPt2ap7CEz7I0gZLJP96jM919Z7R?= =?us-ascii?Q?o5Dmn3oBVYLruEjS2o8F3gBOg9z4wV/qpvTvSzC5ZdiZOqB5N4zMq6s3xEDa?= =?us-ascii?Q?xsWfIaXU7E9y2us3GIbjXlSEQapiAbOrlJEoxdnut3dQ5O+CxtM5Um1N0Uhu?= =?us-ascii?Q?K5UusSB6JYZ10mIUhLslPfwq6rZ1/kq0PZsTBAGy93tbgmWPEZN/OA5qWwFq?= =?us-ascii?Q?ie6fTqdHEi1egtxGsiJYJcJZ6zhTTh9GYBYRbv97GBCJF6PWHx4IYdr6RMrJ?= =?us-ascii?Q?dowgGx4u/MQDhcHtbAqUqN2qDT4KrWF3dor/khhTHwlidnVh92KDfhv6m5kC?= =?us-ascii?Q?LcZ/Ql51Hl6uom87AJzPylRaxmI4D8/wJNpuHKfdzMVBWEM5Jd9zo+kEZG07?= =?us-ascii?Q?DhVBLEMd2uGjBXrjwD4u0TMn8ob5P2M4uEP4vbbdbtILAIFBF2/7moSijT9w?= =?us-ascii?Q?vlUV7pQvtMF+oLbFsmGR6AvwHBk+G+S9MCbEzibJ9L+/0G09Yjsot5lbPpXK?= =?us-ascii?Q?HCVYzM0n+vG71WOlpTcXXj95CJqJ3vQI7v7cqGTUUVJIJllwFSP9iCOtKS8P?= =?us-ascii?Q?q406CrdDmOUgj8vP/bsq6ztl560QejltKXfoWY/nXifRrsPpfOcCGRHBkS0N?= =?us-ascii?Q?oRvWTncqQSyBol7NDVw2ph+egvZuM/T2IhwMjZ1apavHHI4PZvaUVJ/7SJP5?= =?us-ascii?Q?tAlSBdWdEVztCQVJ57u91mLgwZRu6tPf5vxEJF/zd1e2LxLymrlhxZx3gTVe?= =?us-ascii?Q?424k8J50M0cDeg5faMNv+2DO7JAJ+nE/s59SaRAsUFRtHtYbUtqYVz0gpTB0?= =?us-ascii?Q?C/Llym6yIdFH+CPU0G4qFI7dPlI+8nxgPOiozpRC/1nOfkKlH7z+aIqUcOI2?= =?us-ascii?Q?Uq8lZl+7zuqB5IrqXjVb0h2kn+7wIKHYAxkWW8wUVWIoA4o9Z9a2KydaI/Bk?= =?us-ascii?Q?4NpMUme9A9Yu3LREJX6MYrb151uyCAANQD0C35enXxNLQSlopKsPV2aJeud6?= =?us-ascii?Q?JIo/wnRey33xAYJzYxT/IatMx+kQ+NsX9OpEXS3+R2C5V2nUZh/2I+/3BFkR?= =?us-ascii?Q?p/XKt19PigMpSn/cd5XM2lHb1c9PMBLwj9GnVtCS6C0YvNtf74w+OV6QRpdL?= =?us-ascii?Q?Xczuh1FHmybJrBeJZpV6+atdrMWV6BvgwzaKzsEzUcbd7+AELvn8aNQTot3g?= =?us-ascii?Q?p/p5Y2QiRSXP9e2M1NzhphIKXOwa7JPxAQ2HW93MHxCJE9L7FA7z2hL+N2//?= =?us-ascii?Q?HpJnR498EdbBuKgyGA=3D?= X-Microsoft-Exchange-Diagnostics: 1; MWHPR07MB3549; 6:EuHq+ItyWMxhfRQce5KMdpk7ujxXlb4pOUv2OIGpAKozUjlw3MZ9fcI3PrcXMshDV2I+Pb42WIj9MyIWuaL6tTPDEPL4w1cCX1+tRXmDYer1UYC93EDg9OQBys9wsmkQ3QnggkNHisaPGb2MLvSq1nLA+Ie66sHqdl1bTcZiPxEh0IoophgNh0m1PjMGwWWQaig3L+qdmhVCZR1JZ0jjNFnN8rTdyV6n9dRHe/QPIxZ+YXPNe6xwosUmahjMzckfNvFZOSMoRhSFKjKz47VEKNpFU2aEoKf231QcGu4uq/ZrEyV24BiTh762FXy747vsoDkuXftnwSblWEe40X8Vzg==; 5:SUpexMR7RItC9CArxFXlDzy2xc9oyeGWDSBD+Ad9kE16f59o8zxxhv5Okfc/1ZiWZ0omJ0NdsUfuQAsB89eCeCCa1iQzSsGb5oYr9TNSeTDFJGvftso3BpLDULCwMWtztim+VYxKuFUVsMyomvS2JA==; 24:sIuW+0tN/I13AgEg+QGwsCiRN+Shj7DNW6gSMF8rcOOE7GPYf+dMkE5gHkpckiXQjv2svp5bqfjT3QKxvQjNN2R3IltL95P10OybXxKBlSs=; 7:dezWKO/aF6b8VwChua7+BsNf4rVX9oa4gJrtfOXU42otCptn1NeKzWRpk85VfQyrGL5AdDXpgmKTkEc+Nv2Pa7js0t/NjaBnZf/SgHQLHPj00mreDeAFUZfJoTrYw6yAqqlFO/X55moKvCFZsZucp4t8FQWzh0MvlOZBB7jwlpXxyDKk+O0ZctZOpvvigrpGszBl1GEBa6tu46jrMmMXQAXF20QIs5pl0+DTXliIqQw= SpamDiagnosticOutput: 1:99 SpamDiagnosticMetadata: NSPM

Message ID

1504041387.5204.20.camel@cavium.com

State

New, archived

Headers

Mailing-List: contact libc-alpha-help@sourceware.org; run by ezmlm
Precedence: bulk
Sender: libc-alpha-owner@sourceware.org
Message-ID: <1504041387.5204.20.camel@cavium.com>
Subject: [PATCH] [AArch64]  ILP32 math changes
From: Steve Ellcey <sellcey@cavium.com>
Reply-To: sellcey@cavium.com
To: libc-alpha <libc-alpha@sourceware.org>
Cc: Szabolcs Nagy <szabolcs.nagy@arm.com>, rth@twiddle.net, Joseph Myers
	<joseph@codesourcery.com>, Wilco Dijkstra <Wilco.Dijkstra@arm.com>, nd
	<nd@arm.com>
Date: Tue, 29 Aug 2017 14:16:27 -0700
Content-Type: multipart/mixed; boundary="=-oP8RJdvOHSaWUrc2/8hU"
Mime-Version: 1.0
Received-SPF: None (protection.outlook.com: cavium.com does not designate
	permitted sender hosts)
SpamDiagnosticOutput: 1:99
SpamDiagnosticMetadata: NSPM
X-MS-Exchange-CrossTenant-OriginalArrivalTime: 29 Aug 2017 21:16:29.8308
	(UTC)
X-MS-Exchange-CrossTenant-FromEntityHeader: Hosted
X-MS-Exchange-Transport-CrossTenantHeadersStamped: MWHPR07MB3549

Commit Message

Steve Ellcey Aug. 29, 2017, 9:16 p.m. UTC

  Since Szabolcs has expressed interest in getting some aarch64 ILP32
changes into glibc mainline (if they don't affect the kernel or glibc
ABI) I am resubmitting this patch for approval in advance of the main
ILP32 support patches.  I updated it with Richard Henderson's code (but
only doing it when the incoming argument is large).  I tested it on
aarch64 ILP32 and LP64 with no regressions.

Is this something that can be checked in now withnout waiting for the
kernel ILP32 support?

Steve Ellcey
sellcey@cavium.com


2017-08-29  Steve Ellcey  <sellcey@cavium.com>
	    Richard Henderson <rth@twiddle.net>

	* sysdeps/aarch64/fpu/s_llrint.c (OREG_SIZE): New macro.
	* sysdeps/aarch64/fpu/s_llround.c (OREG_SIZE): Likewise.
	* sysdeps/aarch64/fpu/s_llrintf.c (OREGS, IREGS): Remove.
	(IREG_SIZE, OREG_SIZE): New macros.
	* sysdeps/aarch64/fpu/s_llroundf.c: (OREGS, IREGS): Remove.
	(IREG_SIZE, OREG_SIZE): New macros.
	* sysdeps/aarch64/fpu/s_lrintf.c (IREGS): Remove.
	(IREG_SIZE): New macro.
	* sysdeps/aarch64/fpu/s_lroundf.c (IREGS): Remove.
	(IREG_SIZE): New macro.
	* sysdeps/aarch64/fpu/s_lrint.c (get-rounding-mode.h, stdint.h):
	New includes.
	(IREG_SIZE, OREG_SIZE): Initialize if not already set.
	(OREGS, IREGS): Set based on IREG_SIZE and OREG_SIZE.
	(__CONCATX): Handle exceptions correctly on large values that may
	set FE_INVALID.
	* sysdeps/aarch64/fpu/s_lround.c (IREG_SIZE, OREG_SIZE):
	Initialize if not already set.
        (OREGS, IREGS): Set based on IREG_SIZE and OREG_SIZE.

Comments

Szabolcs Nagy Aug. 31, 2017, 4:22 p.m. UTC | #1

On 29/08/17 22:16, Steve Ellcey wrote:
> Since Szabolcs has expressed interest in getting some aarch64 ILP32
> changes into glibc mainline (if they don't affect the kernel or glibc
> ABI) I am resubmitting this patch for approval in advance of the main
> ILP32 support patches.  I updated it with Richard Henderson's code (but
> only doing it when the incoming argument is large).  I tested it on
> aarch64 ILP32 and LP64 with no regressions.
> 
> Is this something that can be checked in now withnout waiting for the
> kernel ILP32 support?
> 

OK.

(we plan to change the math functions to use gcc builtins
in some places instead of asm which will simplify things,
but that can go in later on top of this.)

> Steve Ellcey
> sellcey@cavium.com
> 
> 
> 2017-08-29  Steve Ellcey  <sellcey@cavium.com>
> 	    Richard Henderson <rth@twiddle.net>
> 
> 	* sysdeps/aarch64/fpu/s_llrint.c (OREG_SIZE): New macro.
> 	* sysdeps/aarch64/fpu/s_llround.c (OREG_SIZE): Likewise.
> 	* sysdeps/aarch64/fpu/s_llrintf.c (OREGS, IREGS): Remove.
> 	(IREG_SIZE, OREG_SIZE): New macros.
> 	* sysdeps/aarch64/fpu/s_llroundf.c: (OREGS, IREGS): Remove.
> 	(IREG_SIZE, OREG_SIZE): New macros.
> 	* sysdeps/aarch64/fpu/s_lrintf.c (IREGS): Remove.
> 	(IREG_SIZE): New macro.
> 	* sysdeps/aarch64/fpu/s_lroundf.c (IREGS): Remove.
> 	(IREG_SIZE): New macro.
> 	* sysdeps/aarch64/fpu/s_lrint.c (get-rounding-mode.h, stdint.h):
> 	New includes.
> 	(IREG_SIZE, OREG_SIZE): Initialize if not already set.
> 	(OREGS, IREGS): Set based on IREG_SIZE and OREG_SIZE.
> 	(__CONCATX): Handle exceptions correctly on large values that may
> 	set FE_INVALID.
> 	* sysdeps/aarch64/fpu/s_lround.c (IREG_SIZE, OREG_SIZE):
> 	Initialize if not already set.
>         (OREGS, IREGS): Set based on IREG_SIZE and OREG_SIZE.
>

diff mbox

Patch

diff --git a/sysdeps/aarch64/fpu/s_llrint.c b/sysdeps/aarch64/fpu/s_llrint.c
index c0d0d0e..57821c0 100644
--- a/sysdeps/aarch64/fpu/s_llrint.c
+++ b/sysdeps/aarch64/fpu/s_llrint.c
@@ -18,4 +18,5 @@ 
 
 #define FUNC llrint
 #define OTYPE long long int
+#define OREG_SIZE 64
 #include <s_lrint.c>
diff --git a/sysdeps/aarch64/fpu/s_llrintf.c b/sysdeps/aarch64/fpu/s_llrintf.c
index 67724c6..98ed4f8 100644
--- a/sysdeps/aarch64/fpu/s_llrintf.c
+++ b/sysdeps/aarch64/fpu/s_llrintf.c
@@ -18,6 +18,7 @@ 
 
 #define FUNC llrintf
 #define ITYPE float
-#define IREGS "s"
+#define IREG_SIZE 32
 #define OTYPE long long int
+#define OREG_SIZE 64
 #include <s_lrint.c>
diff --git a/sysdeps/aarch64/fpu/s_llround.c b/sysdeps/aarch64/fpu/s_llround.c
index ed4b192..ef7aedf 100644
--- a/sysdeps/aarch64/fpu/s_llround.c
+++ b/sysdeps/aarch64/fpu/s_llround.c
@@ -18,4 +18,5 @@ 
 
 #define FUNC llround
 #define OTYPE long long int
+#define OREG_SIZE 64
 #include <s_lround.c>
diff --git a/sysdeps/aarch64/fpu/s_llroundf.c b/sysdeps/aarch64/fpu/s_llroundf.c
index 360ce8b..294f0f4 100644
--- a/sysdeps/aarch64/fpu/s_llroundf.c
+++ b/sysdeps/aarch64/fpu/s_llroundf.c
@@ -18,6 +18,7 @@ 
 
 #define FUNC llroundf
 #define ITYPE float
-#define IREGS "s"
+#define IREG_SIZE 32
 #define OTYPE long long int
+#define OREG_SIZE 64
 #include <s_lround.c>
diff --git a/sysdeps/aarch64/fpu/s_lrint.c b/sysdeps/aarch64/fpu/s_lrint.c
index 8c61a03..6ef64e2 100644
--- a/sysdeps/aarch64/fpu/s_lrint.c
+++ b/sysdeps/aarch64/fpu/s_lrint.c
@@ -17,6 +17,8 @@ 
    <http://www.gnu.org/licenses/>.  */
 
 #include <math.h>
+#include <get-rounding-mode.h>
+#include <stdint.h>
 
 #ifndef FUNC
 # define FUNC lrint
@@ -24,18 +26,37 @@ 
 
 #ifndef ITYPE
 # define ITYPE double
-# define IREGS "d"
+# define IREG_SIZE 64
 #else
-# ifndef IREGS
-#  error IREGS not defined
+# ifndef IREG_SIZE
+#  error IREG_SIZE not defined
 # endif
 #endif
 
 #ifndef OTYPE
 # define OTYPE long int
+# ifdef __ILP32__
+#  define OREG_SIZE 32
+# else
+#  define OREG_SIZE 64
+# endif
+#else
+# ifndef OREG_SIZE
+#  error OREG_SIZE not defined
+# endif
 #endif
 
-#define OREGS "x"
+#if IREG_SIZE == 32
+# define IREGS "s"
+#else
+# define IREGS "d"
+#endif
+
+#if OREG_SIZE == 32
+# define OREGS "w"
+#else
+# define OREGS "x"
+#endif
 
 #define __CONCATX(a,b) __CONCAT(a,b)
 
@@ -44,6 +65,37 @@  __CONCATX(__,FUNC) (ITYPE x)
 {
   OTYPE result;
   ITYPE temp;
+
+#if IREG_SIZE == 64 && OREG_SIZE == 32
+  if (__builtin_fabs (x) > INT32_MAX)
+    {
+      /* Converting large values to a 32 bit int may cause the frintx/fcvtza
+	 sequence to set both FE_INVALID and FE_INEXACT.  To avoid this
+	 check the rounding mode and do a single instruction with the
+	 appropriate rounding mode.  */
+
+      switch (get_rounding_mode ())
+	{
+	case FE_TONEAREST:
+	  asm volatile ("fcvtns" "\t%" OREGS "0, %" IREGS "1"
+			: "=r" (result) : "w" (x));
+	  break;
+	case FE_UPWARD:
+	  asm volatile ("fcvtps" "\t%" OREGS "0, %" IREGS "1"
+			: "=r" (result) : "w" (x));
+	  break;
+	case FE_DOWNWARD:
+	  asm volatile ("fcvtms" "\t%" OREGS "0, %" IREGS "1"
+			: "=r" (result) : "w" (x));
+	  break;
+	case FE_TOWARDZERO:
+	default:
+	  asm volatile ("fcvtzs" "\t%" OREGS "0, %" IREGS "1"
+			: "=r" (result) : "w" (x));
+	}
+      return result;
+    }
+#endif
   asm ( "frintx" "\t%" IREGS "1, %" IREGS "2\n\t"
         "fcvtzs" "\t%" OREGS "0, %" IREGS "1"
         : "=r" (result), "=w" (temp) : "w" (x) );
diff --git a/sysdeps/aarch64/fpu/s_lrintf.c b/sysdeps/aarch64/fpu/s_lrintf.c
index a995e4b..2e73271 100644
--- a/sysdeps/aarch64/fpu/s_lrintf.c
+++ b/sysdeps/aarch64/fpu/s_lrintf.c
@@ -18,5 +18,5 @@ 
 
 #define FUNC lrintf
 #define ITYPE float
-#define IREGS "s"
+#define IREG_SIZE 32
 #include <s_lrint.c>
diff --git a/sysdeps/aarch64/fpu/s_lround.c b/sysdeps/aarch64/fpu/s_lround.c
index 9be9e7f..1f77d82 100644
--- a/sysdeps/aarch64/fpu/s_lround.c
+++ b/sysdeps/aarch64/fpu/s_lround.c
@@ -24,18 +24,37 @@ 
 
 #ifndef ITYPE
 # define ITYPE double
-# define IREGS "d"
+# define IREG_SIZE 64
 #else
-# ifndef IREGS
-#  error IREGS not defined
+# ifndef IREG_SIZE
+#  error IREG_SIZE not defined
 # endif
 #endif
 
 #ifndef OTYPE
 # define OTYPE long int
+# ifdef __ILP32__
+#  define OREG_SIZE 32
+# else
+#  define OREG_SIZE 64
+# endif
+#else
+# ifndef OREG_SIZE
+#  error OREG_SIZE not defined
+# endif
+#endif
+
+#if IREG_SIZE == 32
+# define IREGS "s"
+#else
+# define IREGS "d"
 #endif
 
-#define OREGS "x"
+#if OREG_SIZE == 32
+# define OREGS "w"
+#else
+# define OREGS "x"
+#endif
 
 #define __CONCATX(a,b) __CONCAT(a,b)
 
diff --git a/sysdeps/aarch64/fpu/s_lroundf.c b/sysdeps/aarch64/fpu/s_lroundf.c
index 4a066d4..b30ddb6 100644
--- a/sysdeps/aarch64/fpu/s_lroundf.c
+++ b/sysdeps/aarch64/fpu/s_lroundf.c
@@ -18,5 +18,5 @@ 
 
 #define FUNC lroundf
 #define ITYPE float
-#define IREGS "s"
+#define IREG_SIZE 32
 #include <s_lround.c>