From patchwork Thu Mar 9 18:33:12 2023 Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit X-Patchwork-Submitter: "H.J. Lu" X-Patchwork-Id: 66182 Return-Path: X-Original-To: patchwork@sourceware.org Delivered-To: patchwork@sourceware.org Received: from server2.sourceware.org (localhost [IPv6:::1]) by sourceware.org (Postfix) with ESMTP id 148743858C5E for ; Thu, 9 Mar 2023 18:33:37 +0000 (GMT) DKIM-Filter: OpenDKIM Filter v2.11.0 sourceware.org 148743858C5E DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=sourceware.org; s=default; t=1678386817; bh=HOU0TLcba1HHWpPeb01GrU2k3ZG3+ZAMmCaO/4ZI/uE=; h=To:Subject:Date:List-Id:List-Unsubscribe:List-Archive:List-Post: List-Help:List-Subscribe:From:Reply-To:From; b=pneX0TyvbeuXUKzZ3dm3QABXEalhOmqvg9YTXdjMNPC8sA0KXARCfsMZ6nxh6c1RS 7IUqW89pJhbFYCIbfmzbgMLoWMkU2JNVKoIsDvZcNLN6LLRTqLUeNPje+tMK5jTIcF RdNIpAfkjhuOaEv2HpQZThZ7H73aq0yA9kd/sS2k= X-Original-To: libc-alpha@sourceware.org Delivered-To: libc-alpha@sourceware.org Received: from mail-pl1-x636.google.com (mail-pl1-x636.google.com [IPv6:2607:f8b0:4864:20::636]) by sourceware.org (Postfix) with ESMTPS id 937243858D20 for ; Thu, 9 Mar 2023 18:33:15 +0000 (GMT) DMARC-Filter: OpenDMARC Filter v1.4.2 sourceware.org 937243858D20 Received: by mail-pl1-x636.google.com with SMTP id ky4so2979606plb.3 for ; Thu, 09 Mar 2023 10:33:15 -0800 (PST) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20210112; t=1678386794; h=content-transfer-encoding:mime-version:message-id:date:subject:to :from:x-gm-message-state:from:to:cc:subject:date:message-id:reply-to; bh=HOU0TLcba1HHWpPeb01GrU2k3ZG3+ZAMmCaO/4ZI/uE=; b=eDLu1pG41hOiyVR6jM9STbam7bEDJi3dgIMaq9mUy8VEuKyqsviU2gR6Tr/lBDQxvA HTBP2tm10AhmUgnyB056DCiCAjoqZy3p/r5FFRHVpwC+gl30LAGztjN62j4pDHzubk80 3yL9l5TWKReeRS/QquNdeL23QcJjyGAEeLLOQ06SWndCobI952V8tyzgb3gKedwv6zYl oBlli5dY56BZyNTkDBSJvrmE0nPJLILw/4Q47orB7xtt87sNqB8vwwH9W7+rMzeTK3xw 8FNFCZSur1MSiUoN0l+mf4lQusEZ75r8VyWSVhPSxJu10glznon0DW8BjcuhOHJRizYk uCCQ== X-Gm-Message-State: AO0yUKXuZxq3V8KrgOC20O30q36aIjC+Dp224WnkPnYOEJfB/U1EKefY C86cRWzKKiYuIZZUgBBS/c/8tT4eWCY= X-Google-Smtp-Source: AK7set+fGpiatHSOUqLhUubEVdnqQfRLNYhcAy65IaL5zrF/fYQweqPGW7qM5+udkpI0lIq+5QUX4g== X-Received: by 2002:a17:902:bd93:b0:19c:be03:d1a3 with SMTP id q19-20020a170902bd9300b0019cbe03d1a3mr19526832pls.40.1678386794003; Thu, 09 Mar 2023 10:33:14 -0800 (PST) Received: from gnu-cfl-3.localdomain ([172.59.161.113]) by smtp.gmail.com with ESMTPSA id g22-20020a1709029f9600b001991d6c6c64sm10448761plq.185.2023.03.09.10.33.13 for (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Thu, 09 Mar 2023 10:33:13 -0800 (PST) Received: from gnu-cfl-3.. (localhost [IPv6:::1]) by gnu-cfl-3.localdomain (Postfix) with ESMTP id 6E416740146 for ; Thu, 9 Mar 2023 10:33:12 -0800 (PST) To: libc-alpha@sourceware.org Subject: [PATCH] x86-64: Add x87 fmod and remainder [BZ #30179] Date: Thu, 9 Mar 2023 10:33:12 -0800 Message-Id: <20230309183312.205763-1-hjl.tools@gmail.com> X-Mailer: git-send-email 2.39.2 MIME-Version: 1.0 X-Spam-Status: No, score=-3025.6 required=5.0 tests=BAYES_00, DKIM_SIGNED, DKIM_VALID, DKIM_VALID_AU, DKIM_VALID_EF, FREEMAIL_FROM, GIT_PATCH_0, RCVD_IN_DNSWL_NONE, SPF_HELO_NONE, SPF_PASS, TXREP autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on server2.sourceware.org X-BeenThere: libc-alpha@sourceware.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: Libc-alpha mailing list List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-Patchwork-Original-From: "H.J. Lu via Libc-alpha" From: "H.J. Lu" Reply-To: "H.J. Lu" Errors-To: libc-alpha-bounces+patchwork=sourceware.org@sourceware.org Sender: "Libc-alpha" X87 (fprem/fprem1) implementations of fmod and remainder are much faster than generic fmod and remainder. Add e_fmod.S, e_fmodf.S, e_remainder.S and e_remainderf.S with fprem/fprem1. This fixes BZ #30179. --- sysdeps/x86_64/fpu/e_fmod.S | 22 ++++++++++++++++++++++ sysdeps/x86_64/fpu/e_fmodf.S | 22 ++++++++++++++++++++++ sysdeps/x86_64/fpu/e_remainder.S | 22 ++++++++++++++++++++++ sysdeps/x86_64/fpu/e_remainderf.S | 22 ++++++++++++++++++++++ 4 files changed, 88 insertions(+) create mode 100644 sysdeps/x86_64/fpu/e_fmod.S create mode 100644 sysdeps/x86_64/fpu/e_fmodf.S create mode 100644 sysdeps/x86_64/fpu/e_remainder.S create mode 100644 sysdeps/x86_64/fpu/e_remainderf.S diff --git a/sysdeps/x86_64/fpu/e_fmod.S b/sysdeps/x86_64/fpu/e_fmod.S new file mode 100644 index 0000000000..4bdc8a1ab0 --- /dev/null +++ b/sysdeps/x86_64/fpu/e_fmod.S @@ -0,0 +1,22 @@ +/* + * Public domain. + */ + +#include +#include + +ENTRY(__ieee754_fmod) + movsd %xmm0, -16(%rsp) + movsd %xmm1, -8(%rsp) + fldl -8(%rsp) + fldl -16(%rsp) +1: fprem + fstsw %ax + sahf + jp 1b + fstp %st(1) + fstpl -8(%rsp) + movsd -8(%rsp), %xmm0 + ret +END (__ieee754_fmod) +libm_alias_finite (__ieee754_fmod, __fmod) diff --git a/sysdeps/x86_64/fpu/e_fmodf.S b/sysdeps/x86_64/fpu/e_fmodf.S new file mode 100644 index 0000000000..6f76daff01 --- /dev/null +++ b/sysdeps/x86_64/fpu/e_fmodf.S @@ -0,0 +1,22 @@ +/* + * Public domain. + */ + +#include +#include + +ENTRY(__ieee754_fmodf) + movss %xmm0, -8(%rsp) + movss %xmm1, -4(%rsp) + flds -4(%rsp) + flds -8(%rsp) +1: fprem + fstsw %ax + sahf + jp 1b + fstp %st(1) + fstps -4(%rsp) + movss -4(%rsp), %xmm0 + ret +END (__ieee754_fmodf) +libm_alias_finite (__ieee754_fmodf, __fmodf) diff --git a/sysdeps/x86_64/fpu/e_remainder.S b/sysdeps/x86_64/fpu/e_remainder.S new file mode 100644 index 0000000000..be2184f25a --- /dev/null +++ b/sysdeps/x86_64/fpu/e_remainder.S @@ -0,0 +1,22 @@ +/* + * Public domain. + */ + +#include +#include + +ENTRY(__ieee754_remainder) + movsd %xmm0, -16(%rsp) + movsd %xmm1, -8(%rsp) + fldl -8(%rsp) + fldl -16(%rsp) +1: fprem1 + fstsw %ax + sahf + jp 1b + fstp %st(1) + fstpl -8(%rsp) + movsd -8(%rsp), %xmm0 + ret +END (__ieee754_remainder) +libm_alias_finite (__ieee754_remainder, __remainder) diff --git a/sysdeps/x86_64/fpu/e_remainderf.S b/sysdeps/x86_64/fpu/e_remainderf.S new file mode 100644 index 0000000000..42972d3f84 --- /dev/null +++ b/sysdeps/x86_64/fpu/e_remainderf.S @@ -0,0 +1,22 @@ +/* + * Public domain. + */ + +#include +#include + +ENTRY(__ieee754_remainderf) + movss %xmm0, -8(%rsp) + movss %xmm1, -4(%rsp) + flds -4(%rsp) + flds -8(%rsp) +1: fprem1 + fstsw %ax + sahf + jp 1b + fstp %st(1) + fstps -4(%rsp) + movss -4(%rsp), %xmm0 + ret +END (__ieee754_remainderf) +libm_alias_finite (__ieee754_remainderf, __remainderf)