From patchwork Mon Oct  3 14:18:02 2022
Content-Type: text/plain; charset="utf-8"
MIME-Version: 1.0
Content-Transfer-Encoding: 7bit
X-Patchwork-Submitter: Adhemerval Zanella <adhemerval.zanella@linaro.org>
X-Patchwork-Id: 58301
Return-Path: <libc-alpha-bounces+patchwork=sourceware.org@sourceware.org>
X-Original-To: patchwork@sourceware.org
Delivered-To: patchwork@sourceware.org
Received: from server2.sourceware.org (localhost [IPv6:::1])
	by sourceware.org (Postfix) with ESMTP id A7DFC385AC1B
	for <patchwork@sourceware.org>; Mon,  3 Oct 2022 14:18:43 +0000 (GMT)
DKIM-Filter: OpenDKIM Filter v2.11.0 sourceware.org A7DFC385AC1B
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=sourceware.org;
	s=default; t=1664806723;
	bh=tt2OaEGDeKEBdw5YWp0f49lps5B5VigWrL9Uan5NhbU=;
	h=To:Subject:Date:List-Id:List-Unsubscribe:List-Archive:List-Post:
	 List-Help:List-Subscribe:From:Reply-To:From;
	b=ArWga6vyYHcsN022fR9XluK1baoauQCb2mWUlB/tUTDwFCgxvycNnMazgu13+NYLY
	 yZAxOHsuB1FRGnYgKAKN5qyyiMessKvHOXh4c+r2tUPfrzLvcu81tfoW1sBPTxHU1/
	 haX+/JbN7MRrSL0sXbzNKol/lUSSoHrudDow8/ME=
X-Original-To: libc-alpha@sourceware.org
Delivered-To: libc-alpha@sourceware.org
Received: from mail-ua1-x92e.google.com (mail-ua1-x92e.google.com
 [IPv6:2607:f8b0:4864:20::92e])
 by sourceware.org (Postfix) with ESMTPS id D16273858D38
 for <libc-alpha@sourceware.org>; Mon,  3 Oct 2022 14:18:12 +0000 (GMT)
DMARC-Filter: OpenDMARC Filter v1.4.1 sourceware.org D16273858D38
Received: by mail-ua1-x92e.google.com with SMTP id i16so549856uak.1
 for <libc-alpha@sourceware.org>; Mon, 03 Oct 2022 07:18:12 -0700 (PDT)
X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed;
 d=1e100.net; s=20210112;
 h=content-transfer-encoding:mime-version:message-id:date:subject:to
 :from:x-gm-message-state:from:to:cc:subject:date;
 bh=tt2OaEGDeKEBdw5YWp0f49lps5B5VigWrL9Uan5NhbU=;
 b=jWujxUfN5ADQBhBmPogLrgTuBatcGibueWgQpOTbGOiKtPaTAURUezX4rR9JRfEcYv
 Zd13bosJDMvdOdsYAJZNS/XstaeHbjHHguttVa+E8Ec2vLPs87hXeWkNhL3Tb9o57Im0
 Gk4KYYwqF/f/snBfZuk1fBVBImp8B9ruQwEZf0zR4X1/Eu49OAsAJ4xb5zVgSJmYOKyX
 s5X+ZNwOr9vN5J0TxBIkZL8MfvHXjLlnvgeQ6QTKTr482KycrhhYRtVIrVlM/4SFu/XD
 efwjH+VBFvhZAkwiA+w0tiPsam+J4cYhR5NqU1DRcLqzTp/BJgBh1/rAsttZX2L+VGd/
 7J8g==
X-Gm-Message-State: ACrzQf0zL1kg6brOtj5Xs+odwJE1hqrPnzGsS66nxCEpLCYkFtxysv5C
 vFwSP0Efia/kwrZ+GsGofPK2oRaZ5JofzqmB
X-Google-Smtp-Source: 
 AMsMyM4RqPszcHs2YI536yENKk/uwMs7jdJ9hAioLm0RxStdpLVbKIVpEmfeUWcewwqErPH2ld9qQQ==
X-Received: by 2002:a9f:3767:0:b0:3d1:2de3:12a8 with SMTP id
 a36-20020a9f3767000000b003d12de312a8mr9985634uae.47.1664806691036;
 Mon, 03 Oct 2022 07:18:11 -0700 (PDT)
Received: from mandiga.. ([2804:1b3:a7c2:b87d:9b82:360a:4a69:3f62])
 by smtp.gmail.com with ESMTPSA id
 m83-20020a1fa356000000b003aa6e234181sm1802685vke.19.2022.10.03.07.18.09
 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256);
 Mon, 03 Oct 2022 07:18:10 -0700 (PDT)
To: libc-alpha@sourceware.org, Fangrui Song <maskray@google.com>,
 "H . J . Lu" <hjl.tools@gmail.com>,
 Noah Goldstein <goldstein.w.n@gmail.com>
Subject: [PATCH v2] x86: Remove .tfloat usage
Date: Mon,  3 Oct 2022 11:18:02 -0300
Message-Id: <20221003141802.281647-1-adhemerval.zanella@linaro.org>
X-Mailer: git-send-email 2.34.1
MIME-Version: 1.0
X-Spam-Status: No, score=-12.7 required=5.0 tests=BAYES_00, DKIM_SIGNED,
 DKIM_VALID, DKIM_VALID_AU, DKIM_VALID_EF, GIT_PATCH_0, KAM_LOTSOFHASH,
 RCVD_IN_DNSWL_NONE, SPF_HELO_NONE, SPF_PASS,
 TXREP autolearn=ham autolearn_force=no version=3.4.6
X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on
 server2.sourceware.org
X-BeenThere: libc-alpha@sourceware.org
X-Mailman-Version: 2.1.29
Precedence: list
List-Id: Libc-alpha mailing list <libc-alpha.sourceware.org>
List-Unsubscribe: <https://sourceware.org/mailman/options/libc-alpha>,
 <mailto:libc-alpha-request@sourceware.org?subject=unsubscribe>
List-Archive: <https://sourceware.org/pipermail/libc-alpha/>
List-Post: <mailto:libc-alpha@sourceware.org>
List-Help: <mailto:libc-alpha-request@sourceware.org?subject=help>
List-Subscribe: <https://sourceware.org/mailman/listinfo/libc-alpha>,
 <mailto:libc-alpha-request@sourceware.org?subject=subscribe>
X-Patchwork-Original-From: Adhemerval Zanella via Libc-alpha
 <libc-alpha@sourceware.org>
From: Adhemerval Zanella <adhemerval.zanella@linaro.org>
Reply-To: Adhemerval Zanella <adhemerval.zanella@linaro.org>
Errors-To: libc-alpha-bounces+patchwork=sourceware.org@sourceware.org
Sender: "Libc-alpha"
 <libc-alpha-bounces+patchwork=sourceware.org@sourceware.org>

This is what I intend to commit, I saw no regression or changes is 
generated code.
Reviewed-by: Fangrui Song <maskray@google.com>
---

Some compiler does not support it (such as clang integrated assembler)
neither gcc emits it.
---
 sysdeps/i386/fpu/e_atanh.S    |  3 ++-
 sysdeps/i386/fpu/e_atanhf.S   |  3 ++-
 sysdeps/i386/fpu/e_atanhl.S   |  3 ++-
 sysdeps/i386/fpu/s_asinhl.S   |  3 ++-
 sysdeps/i386/fpu/s_cbrtl.S    | 49 ++++++++++++++++++++++-------------
 sysdeps/i386/fpu/s_expm1.S    |  3 ++-
 sysdeps/i386/fpu/s_expm1f.S   |  3 ++-
 sysdeps/i386/fpu/s_log1pl.S   |  3 ++-
 sysdeps/x86_64/fpu/s_log1pl.S |  3 ++-
 9 files changed, 47 insertions(+), 26 deletions(-)

diff --git a/sysdeps/i386/fpu/e_atanh.S b/sysdeps/i386/fpu/e_atanh.S
index 6e4fef06b2..74d8f0e083 100644
--- a/sysdeps/i386/fpu/e_atanh.S
+++ b/sysdeps/i386/fpu/e_atanh.S
@@ -33,7 +33,8 @@ one:	.double 1.0
 limit:	.double 0.29
 	ASM_SIZE_DIRECTIVE(limit)
 	.type ln2_2,@object
-ln2_2:	.tfloat 0.3465735902799726547086160
+ln2_2:	.quad  0xb17217f7d1cf79ac /* 0.3465735902799726547086160  */
+	.short 0x3ffd
 	ASM_SIZE_DIRECTIVE(ln2_2)
 
 DEFINE_DBL_MIN
diff --git a/sysdeps/i386/fpu/e_atanhf.S b/sysdeps/i386/fpu/e_atanhf.S
index 146196eced..1803f55735 100644
--- a/sysdeps/i386/fpu/e_atanhf.S
+++ b/sysdeps/i386/fpu/e_atanhf.S
@@ -34,7 +34,8 @@ limit:	.double 0.29
 	ASM_SIZE_DIRECTIVE(limit)
 	.align ALIGNARG(4)
 	.type ln2_2,@object
-ln2_2:	.tfloat 0.3465735902799726547086160
+ln2_2:	.quad   0xb17217f7d1cf79ac  /* 0.3465735902799726547086160  */
+	.short  0x3ffd
 	ASM_SIZE_DIRECTIVE(ln2_2)
 
 DEFINE_FLT_MIN
diff --git a/sysdeps/i386/fpu/e_atanhl.S b/sysdeps/i386/fpu/e_atanhl.S
index 1f6eb7ce48..df3f1b8f84 100644
--- a/sysdeps/i386/fpu/e_atanhl.S
+++ b/sysdeps/i386/fpu/e_atanhl.S
@@ -39,7 +39,8 @@ limit:	.double 0.29
 	ASM_SIZE_DIRECTIVE(limit)
 	.align ALIGNARG(4)
 	.type ln2_2,@object
-ln2_2:	.tfloat 0.3465735902799726547086160
+ln2_2:	.quad   0xb17217f7d1cf79ac  /* 0.3465735902799726547086160  */
+	.short  0x3ffd
 	ASM_SIZE_DIRECTIVE(ln2_2)
 
 #ifdef PIC
diff --git a/sysdeps/i386/fpu/s_asinhl.S b/sysdeps/i386/fpu/s_asinhl.S
index bd442c6a09..f4f420d060 100644
--- a/sysdeps/i386/fpu/s_asinhl.S
+++ b/sysdeps/i386/fpu/s_asinhl.S
@@ -23,7 +23,8 @@
 
 	.align ALIGNARG(4)
 	.type huge,@object
-huge:	.tfloat 1e+4930
+huge:	.quad   0x89b634e7456ffa1d  /* 1e+4930  */
+	.short  0x7ff8
 	ASM_SIZE_DIRECTIVE(huge)
 	.align ALIGNARG(4)
 	/* Please note that we use double value for 1.0.  This number
diff --git a/sysdeps/i386/fpu/s_cbrtl.S b/sysdeps/i386/fpu/s_cbrtl.S
index 8802164706..23cc308e3c 100644
--- a/sysdeps/i386/fpu/s_cbrtl.S
+++ b/sysdeps/i386/fpu/s_cbrtl.S
@@ -23,55 +23,68 @@
 
         .align ALIGNARG(4)
         .type f8,@object
-f8:	.tfloat 0.161617097923756032
+f8:	.quad   0xa57ef3d83a542839  /* 0.161617097923756032  */
+	.short  0x3ffc
 	ASM_SIZE_DIRECTIVE(f8)
         .align ALIGNARG(4)
         .type f7,@object
-f7:	.tfloat -0.988553671195413709
+f7:	.quad   0xfd11da7820029014  /* -0.988553671195413709  */
+	.short  0xbffe
 	ASM_SIZE_DIRECTIVE(f7)
         .align ALIGNARG(4)
         .type f6,@object
-f6:	.tfloat 2.65298938441952296
+f6:	.quad   0xa9ca93fcade3b4ad  /* 2.65298938441952296  */
+	.short  0x4000
 	ASM_SIZE_DIRECTIVE(f6)
         .align ALIGNARG(4)
         .type f5,@object
-f5:	.tfloat -4.11151425200350531
+f5:	.quad   0x839186562c931c34  /* -4.11151425200350531  */
+	.short  0xc001
 	ASM_SIZE_DIRECTIVE(f5)
         .align ALIGNARG(4)
         .type f4,@object
-f4:	.tfloat 4.09559907378707839
+f4:	.quad   0x830f25c9ee304594  /* 4.09559907378707839  */
+	.short  0x4001
 	ASM_SIZE_DIRECTIVE(f4)
         .align ALIGNARG(4)
         .type f3,@object
-f3:	.tfloat -2.82414939754975962
+f3:	.quad   0xb4bedd1d5fa2f0c6  /* -2.82414939754975962  */
+	.short  0xc000
 	ASM_SIZE_DIRECTIVE(f3)
         .align ALIGNARG(4)
         .type f2,@object
-f2:	.tfloat 1.67595307700780102
+f2:	.quad   0xd685a163b08586e3  /* 1.67595307700780102  */
+	.short  0x3fff
 	ASM_SIZE_DIRECTIVE(f2)
         .align ALIGNARG(4)
         .type f1,@object
-f1:	.tfloat 0.338058687610520237
+f1:	.quad   0xad16073ed4ec3b45  /* 0.338058687610520237  */
+	.short  0x3ffd
 	ASM_SIZE_DIRECTIVE(f1)
 
-#define CBRT2		1.2599210498948731648
-#define ONE_CBRT2	0.793700525984099737355196796584
-#define SQR_CBRT2	1.5874010519681994748
-#define ONE_SQR_CBRT2	0.629960524947436582364439673883
-
 	/* We make the entries in the following table all 16 bytes
 	   wide to avoid having to implement a multiplication by 10.  */
 	.type factor,@object
         .align ALIGNARG(4)
-factor:	.tfloat ONE_SQR_CBRT2
+factor:	/* 1.0 / cbrt (2.0) ^ 2 / 0.629960524947436582364439673883  */
+	.quad 0xa14517cc6b945711
+	.short 0x3ffe
 	.byte 0, 0, 0, 0, 0, 0
-	.tfloat ONE_CBRT2
+	/* 1.0 / cbrt (2.0) / 0.793700525984099737355196796584  */
+	.quad 0xcb2ff529eb71e415
+	.short 0x3ffe
 	.byte 0, 0, 0, 0, 0, 0
-	.tfloat 1.0
+	/* 1.0L  */
+	.quad 0x8000000000000000
+	.short 0x3fff
 	.byte 0, 0, 0, 0, 0, 0
-	.tfloat CBRT2
+	/* cbrt (2.0) / 1.2599210498948731648  */
+	.quad 0xa14517cc6b945711
+	.short 0x3fff
 	.byte 0, 0, 0, 0, 0, 0
-	.tfloat SQR_CBRT2
+	/* cbrt (2.0) ^ 2 / 1.5874010519681994748  */
+	.quad 0xcb2ff529eb71e416
+	.short 0x3fff
 	ASM_SIZE_DIRECTIVE(factor)
 
         .type two64,@object
diff --git a/sysdeps/i386/fpu/s_expm1.S b/sysdeps/i386/fpu/s_expm1.S
index 7199d681ba..038ff72feb 100644
--- a/sysdeps/i386/fpu/s_expm1.S
+++ b/sysdeps/i386/fpu/s_expm1.S
@@ -33,7 +33,8 @@ minus1:	.double -1.0
 one:	.double 1.0
 	ASM_SIZE_DIRECTIVE(one)
 	.type l2e,@object
-l2e:	.tfloat 1.442695040888963407359924681002
+l2e:	.quad   0xb8aa3b295c17f0bc  /* 1.442695040888963407359924681002 */
+	.short  0x3fff
 	ASM_SIZE_DIRECTIVE(l2e)
 
 DEFINE_DBL_MIN
diff --git a/sysdeps/i386/fpu/s_expm1f.S b/sysdeps/i386/fpu/s_expm1f.S
index 04c37bda1b..59a2bb81ba 100644
--- a/sysdeps/i386/fpu/s_expm1f.S
+++ b/sysdeps/i386/fpu/s_expm1f.S
@@ -33,7 +33,8 @@ minus1:	.double -1.0
 one:	.double 1.0
 	ASM_SIZE_DIRECTIVE(one)
 	.type l2e,@object
-l2e:	.tfloat 1.442695040888963407359924681002
+l2e:	.quad  0xb8aa3b295c17f0bc  /* 1.442695040888963407359924681002  */
+	.short 0x3fff
 	ASM_SIZE_DIRECTIVE(l2e)
 
 DEFINE_FLT_MIN
diff --git a/sysdeps/i386/fpu/s_log1pl.S b/sysdeps/i386/fpu/s_log1pl.S
index f28349f7d2..86aa438f01 100644
--- a/sysdeps/i386/fpu/s_log1pl.S
+++ b/sysdeps/i386/fpu/s_log1pl.S
@@ -14,7 +14,8 @@ RCSID("$NetBSD: s_log1p.S,v 1.7 1995/05/09 00:10:58 jtc Exp $")
 		-1 + sqrt(2) / 2 <= x <= 1 - sqrt(2) / 2
 	   0.29 is a safe value.
 	*/
-limit:	.tfloat 0.29
+limit:	.quad   0x947ae147ae147ae1 /* 0.29  */
+	.short  0x3ffd
 	/* Please note:	 we use a double value here.  Since 1.0 has
 	   an exact representation this does not effect the accuracy
 	   but it helps to optimize the code.  */
diff --git a/sysdeps/x86_64/fpu/s_log1pl.S b/sysdeps/x86_64/fpu/s_log1pl.S
index 8219f6fbcc..187c65e668 100644
--- a/sysdeps/x86_64/fpu/s_log1pl.S
+++ b/sysdeps/x86_64/fpu/s_log1pl.S
@@ -14,7 +14,8 @@ RCSID("$NetBSD: s_log1p.S,v 1.7 1995/05/09 00:10:58 jtc Exp $")
 		-1 + sqrt(2) / 2 <= x <= 1 - sqrt(2) / 2
 	   0.29 is a safe value.
 	*/
-limit:	.tfloat 0.29
+limit:	.quad   0x947ae147ae147ae1	/* 0.29  */
+	.short	0x3ffd
 	/* Please note:	 we use a double value here.  Since 1.0 has
 	   an exact representation this does not effect the accuracy
 	   but it helps to optimize the code.  */