[v2,1/2] PPC64: Add libmvec SIMD single-precision natural exponent function.

  Passes all tests.

Based off the ./sysdeps/ieee754/dbl-64/e_exp.c implementation,
and thus provides identical results.

Unlike other libmvec functions, this sets the underflow and overflow bits.
The caller can check these flags, and possibly re-run the calculations with
scalar expf to figure out what is causing the overflow or underflow.

Suprisingly the special-case path performs as well as the normal path.
(both of which are vectorized)
Running 20 times over 32MiB
vector: mean 432.263032 MiB/s (sd 0.486733)
scalar: mean 178.646197 MiB/s (sd 0.050013)

v2: NFC, fixed some style issues, note that this is based on another impl.

2019-05-11  Shawn Landden  <shawn@git.icu>

	[BZ #24209]
        * NEWS: Noted the addition of PPC64 vector expf function.
        * sysdeps/powerpc/bits/math-vector.h: Added entry for vector expf.
        * sysdeps/powerpc/powerpc64/fpu/Versions: Added vector expf entry.
        * sysdeps/powerpc/powerpc64/fpu/multiarch/Makefile:
        (libmvec-sysdep_routines, CFLAGS-vec_s_expf4_vsx.c):
        (CFLAGS-vec_s_exp2f_data.c): Added build of VSX SIMD expf
        function and tests. Added vec_math_errf.c to build.
        * sysdeps/powerpc/powerpc64/fpu/multiarch/math_config_flt.h: Modified for expf.
        * sysdeps/powerpc/powerpc64/fpu/multiarch/test-float-vlen4-wrappers.c:
        Added entry for vector expf.
        * sysdeps/powerpc/powerpc64/fpu/multiarch/vec_s_expf4_vsx.c: New file.
        * sysdeps/powerpc/powerpc64/fpu/multiarch/vec_s_exp2f_data.c: Likewise.
        * sysdeps/powerpc/powerpc64/fpu/multiarch/vec_math_errf.c: Likewise.
        * sysdeps/unix/sysv/linux/powerpc/powerpc64/libmvec.abilist: SIMD expf
        added.
---
 NEWS                                          |   1 +
 sysdeps/powerpc/bits/math-vector.h            |   2 +
 sysdeps/powerpc/powerpc64/fpu/Versions        |   1 +
 .../powerpc/powerpc64/fpu/multiarch/Makefile  |   6 +-
 .../powerpc64/fpu/multiarch/math_config_flt.h |  64 +++++--
 .../fpu/multiarch/test-float-vlen4-wrappers.c |   1 +
 .../powerpc64/fpu/multiarch/vec_math_errf.c   |  39 +++++
 .../fpu/multiarch/vec_s_exp2f_data.c          |  58 +++++++
 .../powerpc64/fpu/multiarch/vec_s_expf4_vsx.c | 159 ++++++++++++++++++
 .../linux/powerpc/powerpc64/libmvec.abilist   |   1 +
 10 files changed, 321 insertions(+), 11 deletions(-)
 create mode 100644 sysdeps/powerpc/powerpc64/fpu/multiarch/vec_math_errf.c
 create mode 100644 sysdeps/powerpc/powerpc64/fpu/multiarch/vec_s_exp2f_data.c
 create mode 100644 sysdeps/powerpc/powerpc64/fpu/multiarch/vec_s_expf4_vsx.c

Message ID	20190527174717.21619-1-shawn@git.icu
State	Committed
Delegated to:	Tulio Magno Quites Machado Filho
Headers	Mailing-List: contact libc-alpha-help@sourceware.org; run by ezmlm Precedence: bulk Sender: libc-alpha-owner@sourceware.org From: Shawn Landden <shawn@git.icu> To: libc-alpha@sourceware.org Cc: Tulio Magno Quites Machado Filho <tuliom@ascii.art.br>, Shawn Landden <shawn@git.icu> Subject: [v2 1/2] PPC64: Add libmvec SIMD single-precision natural exponent function. Date: Mon, 27 May 2019 12:47:16 -0500 Message-Id: <20190527174717.21619-1-shawn@git.icu> In-Reply-To: <20190512032812.22021-1-shawn@git.icu> References: <20190512032812.22021-1-shawn@git.icu> MIME-Version: 1.0 Content-Transfer-Encoding: 8bit

[v2,1/2] PPC64: Add libmvec SIMD single-precision natural exponent function.

Commit Message

Comments

Patch