[v3,01/18] x86-64: Add vector atan/atanf implementation to libmvec

  Implement vectorized atan/atanf containing SSE, AVX, AVX2 and
AVX512 versions for libmvec as per vector ABI.  It also contains
accuracy and ABI tests for vector atan/atanf with regenerated ulps.
---
 bits/libm-simd-decl-stubs.h                   |  11 +
 math/bits/mathcalls.h                         |   2 +-
 .../unix/sysv/linux/x86_64/libmvec.abilist    |   8 +
 sysdeps/x86/fpu/bits/math-vector.h            |   4 +
 .../x86/fpu/finclude/math-vector-fortran.h    |   4 +
 sysdeps/x86_64/fpu/Makeconfig                 |   1 +
 sysdeps/x86_64/fpu/Versions                   |   2 +
 sysdeps/x86_64/fpu/libm-test-ulps             |  20 ++
 .../fpu/multiarch/svml_d_atan2_core-sse2.S    |  20 ++
 .../x86_64/fpu/multiarch/svml_d_atan2_core.c  |  27 ++
 .../fpu/multiarch/svml_d_atan2_core_sse4.S    | 245 ++++++++++++++++++
 .../fpu/multiarch/svml_d_atan4_core-sse.S     |  20 ++
 .../x86_64/fpu/multiarch/svml_d_atan4_core.c  |  27 ++
 .../fpu/multiarch/svml_d_atan4_core_avx2.S    | 225 ++++++++++++++++
 .../fpu/multiarch/svml_d_atan8_core-avx2.S    |  20 ++
 .../x86_64/fpu/multiarch/svml_d_atan8_core.c  |  27 ++
 .../fpu/multiarch/svml_d_atan8_core_avx512.S  | 213 +++++++++++++++
 .../fpu/multiarch/svml_s_atanf16_core-avx2.S  |  20 ++
 .../fpu/multiarch/svml_s_atanf16_core.c       |  28 ++
 .../multiarch/svml_s_atanf16_core_avx512.S    | 174 +++++++++++++
 .../fpu/multiarch/svml_s_atanf4_core-sse2.S   |  20 ++
 .../x86_64/fpu/multiarch/svml_s_atanf4_core.c |  28 ++
 .../fpu/multiarch/svml_s_atanf4_core_sse4.S   | 164 ++++++++++++
 .../fpu/multiarch/svml_s_atanf8_core-sse.S    |  20 ++
 .../x86_64/fpu/multiarch/svml_s_atanf8_core.c |  28 ++
 .../fpu/multiarch/svml_s_atanf8_core_avx2.S   | 148 +++++++++++
 sysdeps/x86_64/fpu/svml_d_atan2_core.S        |  29 +++
 sysdeps/x86_64/fpu/svml_d_atan4_core.S        |  29 +++
 sysdeps/x86_64/fpu/svml_d_atan4_core_avx.S    |  25 ++
 sysdeps/x86_64/fpu/svml_d_atan8_core.S        |  25 ++
 sysdeps/x86_64/fpu/svml_s_atanf16_core.S      |  25 ++
 sysdeps/x86_64/fpu/svml_s_atanf4_core.S       |  29 +++
 sysdeps/x86_64/fpu/svml_s_atanf8_core.S       |  29 +++
 sysdeps/x86_64/fpu/svml_s_atanf8_core_avx.S   |  25 ++
 .../x86_64/fpu/test-double-libmvec-atan-avx.c |   1 +
 .../fpu/test-double-libmvec-atan-avx2.c       |   1 +
 .../fpu/test-double-libmvec-atan-avx512f.c    |   1 +
 sysdeps/x86_64/fpu/test-double-libmvec-atan.c |   3 +
 .../x86_64/fpu/test-double-vlen2-wrappers.c   |   1 +
 .../fpu/test-double-vlen4-avx2-wrappers.c     |   1 +
 .../x86_64/fpu/test-double-vlen4-wrappers.c   |   1 +
 .../x86_64/fpu/test-double-vlen8-wrappers.c   |   1 +
 .../x86_64/fpu/test-float-libmvec-atanf-avx.c |   1 +
 .../fpu/test-float-libmvec-atanf-avx2.c       |   1 +
 .../fpu/test-float-libmvec-atanf-avx512f.c    |   1 +
 sysdeps/x86_64/fpu/test-float-libmvec-atanf.c |   3 +
 .../x86_64/fpu/test-float-vlen16-wrappers.c   |   1 +
 .../x86_64/fpu/test-float-vlen4-wrappers.c    |   1 +
 .../fpu/test-float-vlen8-avx2-wrappers.c      |   1 +
 .../x86_64/fpu/test-float-vlen8-wrappers.c    |   1 +
 50 files changed, 1741 insertions(+), 1 deletion(-)
 create mode 100644 sysdeps/x86_64/fpu/multiarch/svml_d_atan2_core-sse2.S
 create mode 100644 sysdeps/x86_64/fpu/multiarch/svml_d_atan2_core.c
 create mode 100644 sysdeps/x86_64/fpu/multiarch/svml_d_atan2_core_sse4.S
 create mode 100644 sysdeps/x86_64/fpu/multiarch/svml_d_atan4_core-sse.S
 create mode 100644 sysdeps/x86_64/fpu/multiarch/svml_d_atan4_core.c
 create mode 100644 sysdeps/x86_64/fpu/multiarch/svml_d_atan4_core_avx2.S
 create mode 100644 sysdeps/x86_64/fpu/multiarch/svml_d_atan8_core-avx2.S
 create mode 100644 sysdeps/x86_64/fpu/multiarch/svml_d_atan8_core.c
 create mode 100644 sysdeps/x86_64/fpu/multiarch/svml_d_atan8_core_avx512.S
 create mode 100644 sysdeps/x86_64/fpu/multiarch/svml_s_atanf16_core-avx2.S
 create mode 100644 sysdeps/x86_64/fpu/multiarch/svml_s_atanf16_core.c
 create mode 100644 sysdeps/x86_64/fpu/multiarch/svml_s_atanf16_core_avx512.S
 create mode 100644 sysdeps/x86_64/fpu/multiarch/svml_s_atanf4_core-sse2.S
 create mode 100644 sysdeps/x86_64/fpu/multiarch/svml_s_atanf4_core.c
 create mode 100644 sysdeps/x86_64/fpu/multiarch/svml_s_atanf4_core_sse4.S
 create mode 100644 sysdeps/x86_64/fpu/multiarch/svml_s_atanf8_core-sse.S
 create mode 100644 sysdeps/x86_64/fpu/multiarch/svml_s_atanf8_core.c
 create mode 100644 sysdeps/x86_64/fpu/multiarch/svml_s_atanf8_core_avx2.S
 create mode 100644 sysdeps/x86_64/fpu/svml_d_atan2_core.S
 create mode 100644 sysdeps/x86_64/fpu/svml_d_atan4_core.S
 create mode 100644 sysdeps/x86_64/fpu/svml_d_atan4_core_avx.S
 create mode 100644 sysdeps/x86_64/fpu/svml_d_atan8_core.S
 create mode 100644 sysdeps/x86_64/fpu/svml_s_atanf16_core.S
 create mode 100644 sysdeps/x86_64/fpu/svml_s_atanf4_core.S
 create mode 100644 sysdeps/x86_64/fpu/svml_s_atanf8_core.S
 create mode 100644 sysdeps/x86_64/fpu/svml_s_atanf8_core_avx.S
 create mode 100644 sysdeps/x86_64/fpu/test-double-libmvec-atan-avx.c
 create mode 100644 sysdeps/x86_64/fpu/test-double-libmvec-atan-avx2.c
 create mode 100644 sysdeps/x86_64/fpu/test-double-libmvec-atan-avx512f.c
 create mode 100644 sysdeps/x86_64/fpu/test-double-libmvec-atan.c
 create mode 100644 sysdeps/x86_64/fpu/test-float-libmvec-atanf-avx.c
 create mode 100644 sysdeps/x86_64/fpu/test-float-libmvec-atanf-avx2.c
 create mode 100644 sysdeps/x86_64/fpu/test-float-libmvec-atanf-avx512f.c
 create mode 100644 sysdeps/x86_64/fpu/test-float-libmvec-atanf.c

Message ID	20211227150457.1680245-2-skpgkp2@gmail.com
State	Superseded
Headers	DMARC-Filter: OpenDMARC Filter v1.4.1 sourceware.org 1BC0B3858418 To: libc-alpha@sourceware.org Subject: [PATCH v3 01/18] x86-64: Add vector atan/atanf implementation to libmvec Date: Mon, 27 Dec 2021 07:04:40 -0800 Message-Id: <20211227150457.1680245-2-skpgkp2@gmail.com> In-Reply-To: <20211227150457.1680245-1-skpgkp2@gmail.com> References: <20211227150457.1680245-1-skpgkp2@gmail.com> MIME-Version: 1.0 Content-Transfer-Encoding: 8bit Precedence: list From: Sunil K Pandey via Libc-alpha <libc-alpha@sourceware.org> Reply-To: Sunil K Pandey <skpgkp2@gmail.com> Cc: andrey.kolesov@intel.com, marius.cornea@intel.com Errors-To: libc-alpha-bounces+patchwork=sourceware.org@sourceware.org Sender: "Libc-alpha" <libc-alpha-bounces+patchwork=sourceware.org@sourceware.org>
Series	x86-64: Add vector math functions to libmvec \| [v3,00/18] x86-64: Add vector math functions to libmvec [v3,01/18] x86-64: Add vector atan/atanf implementation to libmvec [v3,02/18] x86-64: Add vector asin/asinf implementation to libmvec [v3,03/18] x86-64: Add vector hypot/hypotf implementation to libmvec [v3,04/18] x86-64: Add vector exp2/exp2f implementation to libmvec [v3,05/18] x86-64: Add vector exp10/exp10f implementation to libmvec [v3,06/18] x86-64: Add vector cosh/coshf implementation to libmvec [v3,07/18] x86-64: Add vector expm1/expm1f implementation to libmvec [v3,08/18] x86-64: Add vector sinh/sinhf implementation to libmvec [v3,09/18] x86-64: Add vector cbrt/cbrtf implementation to libmvec [v3,10/18] x86-64: Add vector atan2/atan2f implementation to libmvec [v3,11/18] x86-64: Add vector log10/log10f implementation to libmvec [v3,12/18] x86-64: Add vector log2/log2f implementation to libmvec [v3,13/18] x86-64: Add vector log1p/log1pf implementation to libmvec [v3,14/18] x86-64: Add vector atanh/atanhf implementation to libmvec [v3,15/18] x86-64: Add vector acosh/acoshf implementation to libmvec [v3,16/18] x86-64: Add vector erf/erff implementation to libmvec [v3,17/18] x86-64: Add vector tanh/tanhf implementation to libmvec [v3,18/18] x86-64: Add vector asinh/asinhf implementation to libmvec

[v3,01/18] x86-64: Add vector atan/atanf implementation to libmvec

Checks

Commit Message

Patch