[v4,03/18] x86-64: Add vector hypot/hypotf implementation to libmvec

  Implement vectorized hypot/hypotf containing SSE, AVX, AVX2 and
AVX512 versions for libmvec as per vector ABI.  It also contains
accuracy and ABI tests for vector hypot/hypotf with regenerated ulps.
---
 bits/libm-simd-decl-stubs.h                   |  11 +
 math/bits/mathcalls.h                         |   2 +-
 .../unix/sysv/linux/x86_64/libmvec.abilist    |   8 +
 sysdeps/x86/fpu/bits/math-vector.h            |   4 +
 .../x86/fpu/finclude/math-vector-fortran.h    |   4 +
 sysdeps/x86_64/fpu/Makeconfig                 |   1 +
 sysdeps/x86_64/fpu/Versions                   |   2 +
 sysdeps/x86_64/fpu/libm-test-ulps             |  20 ++
 .../fpu/multiarch/svml_d_hypot2_core-sse2.S   |  20 ++
 .../x86_64/fpu/multiarch/svml_d_hypot2_core.c |  28 ++
 .../fpu/multiarch/svml_d_hypot2_core_sse4.S   | 279 +++++++++++++++++
 .../fpu/multiarch/svml_d_hypot4_core-sse.S    |  20 ++
 .../x86_64/fpu/multiarch/svml_d_hypot4_core.c |  28 ++
 .../fpu/multiarch/svml_d_hypot4_core_avx2.S   | 289 ++++++++++++++++++
 .../fpu/multiarch/svml_d_hypot8_core-avx2.S   |  20 ++
 .../x86_64/fpu/multiarch/svml_d_hypot8_core.c |  28 ++
 .../fpu/multiarch/svml_d_hypot8_core_avx512.S | 235 ++++++++++++++
 .../fpu/multiarch/svml_s_hypotf16_core-avx2.S |  20 ++
 .../fpu/multiarch/svml_s_hypotf16_core.c      |  28 ++
 .../multiarch/svml_s_hypotf16_core_avx512.S   | 239 +++++++++++++++
 .../fpu/multiarch/svml_s_hypotf4_core-sse2.S  |  20 ++
 .../fpu/multiarch/svml_s_hypotf4_core.c       |  28 ++
 .../fpu/multiarch/svml_s_hypotf4_core_sse4.S  | 265 ++++++++++++++++
 .../fpu/multiarch/svml_s_hypotf8_core-sse.S   |  20 ++
 .../fpu/multiarch/svml_s_hypotf8_core.c       |  28 ++
 .../fpu/multiarch/svml_s_hypotf8_core_avx2.S  | 269 ++++++++++++++++
 sysdeps/x86_64/fpu/svml_d_hypot2_core.S       |  29 ++
 sysdeps/x86_64/fpu/svml_d_hypot4_core.S       |  29 ++
 sysdeps/x86_64/fpu/svml_d_hypot4_core_avx.S   |  25 ++
 sysdeps/x86_64/fpu/svml_d_hypot8_core.S       |  25 ++
 sysdeps/x86_64/fpu/svml_s_hypotf16_core.S     |  25 ++
 sysdeps/x86_64/fpu/svml_s_hypotf4_core.S      |  29 ++
 sysdeps/x86_64/fpu/svml_s_hypotf8_core.S      |  29 ++
 sysdeps/x86_64/fpu/svml_s_hypotf8_core_avx.S  |  25 ++
 .../fpu/test-double-libmvec-hypot-avx.c       |   1 +
 .../fpu/test-double-libmvec-hypot-avx2.c      |   1 +
 .../fpu/test-double-libmvec-hypot-avx512f.c   |   1 +
 .../x86_64/fpu/test-double-libmvec-hypot.c    |   3 +
 .../x86_64/fpu/test-double-vlen2-wrappers.c   |   1 +
 .../fpu/test-double-vlen4-avx2-wrappers.c     |   1 +
 .../x86_64/fpu/test-double-vlen4-wrappers.c   |   1 +
 .../x86_64/fpu/test-double-vlen8-wrappers.c   |   1 +
 .../fpu/test-float-libmvec-hypotf-avx.c       |   1 +
 .../fpu/test-float-libmvec-hypotf-avx2.c      |   1 +
 .../fpu/test-float-libmvec-hypotf-avx512f.c   |   1 +
 .../x86_64/fpu/test-float-libmvec-hypotf.c    |   3 +
 .../x86_64/fpu/test-float-vlen16-wrappers.c   |   1 +
 .../x86_64/fpu/test-float-vlen4-wrappers.c    |   1 +
 .../fpu/test-float-vlen8-avx2-wrappers.c      |   1 +
 .../x86_64/fpu/test-float-vlen8-wrappers.c    |   1 +
 50 files changed, 2151 insertions(+), 1 deletion(-)
 create mode 100644 sysdeps/x86_64/fpu/multiarch/svml_d_hypot2_core-sse2.S
 create mode 100644 sysdeps/x86_64/fpu/multiarch/svml_d_hypot2_core.c
 create mode 100644 sysdeps/x86_64/fpu/multiarch/svml_d_hypot2_core_sse4.S
 create mode 100644 sysdeps/x86_64/fpu/multiarch/svml_d_hypot4_core-sse.S
 create mode 100644 sysdeps/x86_64/fpu/multiarch/svml_d_hypot4_core.c
 create mode 100644 sysdeps/x86_64/fpu/multiarch/svml_d_hypot4_core_avx2.S
 create mode 100644 sysdeps/x86_64/fpu/multiarch/svml_d_hypot8_core-avx2.S
 create mode 100644 sysdeps/x86_64/fpu/multiarch/svml_d_hypot8_core.c
 create mode 100644 sysdeps/x86_64/fpu/multiarch/svml_d_hypot8_core_avx512.S
 create mode 100644 sysdeps/x86_64/fpu/multiarch/svml_s_hypotf16_core-avx2.S
 create mode 100644 sysdeps/x86_64/fpu/multiarch/svml_s_hypotf16_core.c
 create mode 100644 sysdeps/x86_64/fpu/multiarch/svml_s_hypotf16_core_avx512.S
 create mode 100644 sysdeps/x86_64/fpu/multiarch/svml_s_hypotf4_core-sse2.S
 create mode 100644 sysdeps/x86_64/fpu/multiarch/svml_s_hypotf4_core.c
 create mode 100644 sysdeps/x86_64/fpu/multiarch/svml_s_hypotf4_core_sse4.S
 create mode 100644 sysdeps/x86_64/fpu/multiarch/svml_s_hypotf8_core-sse.S
 create mode 100644 sysdeps/x86_64/fpu/multiarch/svml_s_hypotf8_core.c
 create mode 100644 sysdeps/x86_64/fpu/multiarch/svml_s_hypotf8_core_avx2.S
 create mode 100644 sysdeps/x86_64/fpu/svml_d_hypot2_core.S
 create mode 100644 sysdeps/x86_64/fpu/svml_d_hypot4_core.S
 create mode 100644 sysdeps/x86_64/fpu/svml_d_hypot4_core_avx.S
 create mode 100644 sysdeps/x86_64/fpu/svml_d_hypot8_core.S
 create mode 100644 sysdeps/x86_64/fpu/svml_s_hypotf16_core.S
 create mode 100644 sysdeps/x86_64/fpu/svml_s_hypotf4_core.S
 create mode 100644 sysdeps/x86_64/fpu/svml_s_hypotf8_core.S
 create mode 100644 sysdeps/x86_64/fpu/svml_s_hypotf8_core_avx.S
 create mode 100644 sysdeps/x86_64/fpu/test-double-libmvec-hypot-avx.c
 create mode 100644 sysdeps/x86_64/fpu/test-double-libmvec-hypot-avx2.c
 create mode 100644 sysdeps/x86_64/fpu/test-double-libmvec-hypot-avx512f.c
 create mode 100644 sysdeps/x86_64/fpu/test-double-libmvec-hypot.c
 create mode 100644 sysdeps/x86_64/fpu/test-float-libmvec-hypotf-avx.c
 create mode 100644 sysdeps/x86_64/fpu/test-float-libmvec-hypotf-avx2.c
 create mode 100644 sysdeps/x86_64/fpu/test-float-libmvec-hypotf-avx512f.c
 create mode 100644 sysdeps/x86_64/fpu/test-float-libmvec-hypotf.c

Message ID	20211228201130.737370-4-skpgkp2@gmail.com
State	Superseded
Headers	DMARC-Filter: OpenDMARC Filter v1.4.1 sourceware.org 974703858004 To: libc-alpha@sourceware.org Subject: [PATCH v4 03/18] x86-64: Add vector hypot/hypotf implementation to libmvec Date: Tue, 28 Dec 2021 12:11:15 -0800 Message-Id: <20211228201130.737370-4-skpgkp2@gmail.com> In-Reply-To: <20211228201130.737370-1-skpgkp2@gmail.com> References: <CAMe9rOrLdPcoaPxUA5oXqJPsfriPbbo=q4z7v5GQrKBq1LXARw@mail.gmail.com> <20211228201130.737370-1-skpgkp2@gmail.com> MIME-Version: 1.0 Content-Transfer-Encoding: 8bit Precedence: list From: Sunil K Pandey via Libc-alpha <libc-alpha@sourceware.org> Reply-To: Sunil K Pandey <skpgkp2@gmail.com> Cc: andrey.kolesov@intel.com, marius.cornea@intel.com Errors-To: libc-alpha-bounces+patchwork=sourceware.org@sourceware.org Sender: "Libc-alpha" <libc-alpha-bounces+patchwork=sourceware.org@sourceware.org>
Series	x86-64: Add vector math functions to libmvec \| [v4,00/18] x86-64: Add vector math functions to libmvec [v4,01/18] x86-64: Add vector atan/atanf implementation to libmvec [v4,02/18] x86-64: Add vector asin/asinf implementation to libmvec [v4,03/18] x86-64: Add vector hypot/hypotf implementation to libmvec [v4,04/18] x86-64: Add vector exp2/exp2f implementation to libmvec [v4,05/18] x86-64: Add vector exp10/exp10f implementation to libmvec [v4,06/18] x86-64: Add vector cosh/coshf implementation to libmvec [v4,07/18] x86-64: Add vector expm1/expm1f implementation to libmvec [v4,08/18] x86-64: Add vector sinh/sinhf implementation to libmvec [v4,09/18] x86-64: Add vector cbrt/cbrtf implementation to libmvec [v4,10/18] x86-64: Add vector atan2/atan2f implementation to libmvec [v4,11/18] x86-64: Add vector log10/log10f implementation to libmvec [v4,12/18] x86-64: Add vector log2/log2f implementation to libmvec [v4,13/18] x86-64: Add vector log1p/log1pf implementation to libmvec [v4,14/18] x86-64: Add vector atanh/atanhf implementation to libmvec [v4,15/18] x86-64: Add vector acosh/acoshf implementation to libmvec [v4,16/18] x86-64: Add vector erf/erff implementation to libmvec [v4,17/18] x86-64: Add vector tanh/tanhf implementation to libmvec [v4,18/18] x86-64: Add vector asinh/asinhf implementation to libmvec

[v4,03/18] x86-64: Add vector hypot/hypotf implementation to libmvec

Checks

Commit Message

Patch