[RFC,v3] powerpc: add libmvec implementations of log and logf

  When fed numbers in the range of 0 to 2^32-1 (as doubles) the vector log
is about 75% faster than scalar log.

When fed numbers in the range of 0 to 2^16-1 (as floats) the vector logf
is about 30% faster than scalar logf. This should probably be faster,
and did not spend much time in perf looking into this.[2]

logf requires Power 7
log requires Power 8

I have not completed a FSF copyright assignement, but would be happy to do so.
These are base on the routines here: https://github.com/ARM-software/optimized-routines
of which the ability to incorporate in glibc is a specific goal.
CCing its' maintainer Szabolcs Nagy.

[2] benchmark programs: https://github.com/shawnl/libmvec

v2: rebase on top of the appropiate branch
v3: get the function name right
---
 .../powerpc/powerpc64/fpu/multiarch/Makefile  |   8 +-
 .../multiarch/test-double-vlen2-wrappers.c    |   1 +
 .../fpu/multiarch/test-float-vlen4-wrappers.c |   1 +
 .../powerpc64/fpu/multiarch/vec_s_log2_vsx.c  | 173 +++++++++++++++
 .../powerpc64/fpu/multiarch/vec_s_log_data.c  | 208 ++++++++++++++++++
 .../powerpc64/fpu/multiarch/vec_s_log_data.h  |  36 +++
 .../powerpc64/fpu/multiarch/vec_s_logf4_vsx.c | 126 +++++++++++
 .../powerpc64/fpu/multiarch/vec_s_logf_data.c |  44 ++++
 .../powerpc64/fpu/multiarch/vec_s_logf_data.h |  32 +++
 9 files changed, 627 insertions(+), 2 deletions(-)
 create mode 100644 sysdeps/powerpc/powerpc64/fpu/multiarch/vec_s_log2_vsx.c
 create mode 100644 sysdeps/powerpc/powerpc64/fpu/multiarch/vec_s_log_data.c
 create mode 100644 sysdeps/powerpc/powerpc64/fpu/multiarch/vec_s_log_data.h
 create mode 100644 sysdeps/powerpc/powerpc64/fpu/multiarch/vec_s_logf4_vsx.c
 create mode 100644 sysdeps/powerpc/powerpc64/fpu/multiarch/vec_s_logf_data.c
 create mode 100644 sysdeps/powerpc/powerpc64/fpu/multiarch/vec_s_logf_data.h

[RFC,v3] powerpc: add libmvec implementations of log and logf

Commit Message

Comments

Patch