[RFC] powerpc: add libmvec implementations of log and logf

  The ABI spec[1] Is x86 specific. I removed the arch field,
because this can be handled using IFUNC (the vector width is
part of the name).

GCC will have to be taught about these.

When fed numbers in the range of 0 to 2^32 (as doubles) the vector log
is about 75% faster than scalar log.

When fed numbers in the range of 0 to 2^16 (as floats) the vector logf
is about 30% faster than scalar logf. This should probably be faster,
and did not spend much time in perf looking into this.[2]

logf requires Power 7
log requires Power 8
(according to gcc allowing them to be compiled with -mcpu=powerx)

I have not completed a FSF copyright assignement, but would be happy to do so.

[1] https://sourceware.org/glibc/wiki/libmvec?action=AttachFile&do=view&target=VectorABI.txt
[2] benchmark programs: https://github.com/shawnl/libmvec
---
 sysdeps/powerpc/fpu/Makefile                  |  16 +-
 sysdeps/powerpc/fpu/Versions                  |   7 +
 sysdeps/powerpc/fpu/svml_s_log2.c             | 376 ++++++++++++++++++
 sysdeps/powerpc/fpu/svml_s_logf4.c            | 163 ++++++++
 .../powerpc/fpu/test-double-vlen2-wrappers.c  |  24 ++
 .../powerpc/fpu/test-float-vlen4-wrappers.c   |  24 ++
 6 files changed, 609 insertions(+), 1 deletion(-)
 create mode 100644 sysdeps/powerpc/fpu/Versions
 create mode 100644 sysdeps/powerpc/fpu/svml_s_log2.c
 create mode 100644 sysdeps/powerpc/fpu/svml_s_logf4.c
 create mode 100644 sysdeps/powerpc/fpu/test-double-vlen2-wrappers.c
 create mode 100644 sysdeps/powerpc/fpu/test-float-vlen4-wrappers.c

[RFC] powerpc: add libmvec implementations of log and logf

Commit Message

Comments

Patch