v12 Improves __ieee754_exp() performance by 6-11% on aarch64/sparc/x86.

  New with this version:
Only updates to e_exp.c and eexp.tbl plus revised
libm-test-ulps for aarch64/sparc/x86_64 as removal of slowexp()
was accomplished by prior patch.

Summary of patch rationale

These changes will be active for all platforms that don't provide
their own exp() routines. They will also be active for ieee754
versions of ccos, ccosh, cosh, csin, csinh, sinh, exp10, gamma, and
erf.

Typical performance gains are 6% on aarch64, 28% on Sparc s7 and 11%
on x86_64 based on the glibc_perf tests.

Glibc correctness tests for exp() and expf() were run. Within the test
suite 1 input value was found to cause a 1 ulp difference when
"FE_TONEAREST" rounding mode is set. No differences in exp()
were seen for the tested values for the other rounding modes.

When tested over a range of 10 million input values, the new code
gets a 1 ulp error approximately 1.6 times per 1000 values.
That rate was similar for all four rounding modes.
The patch uses a 64 entry scaling table. The existing
code uses a 512 entry table.

Further optimization is possible in the handling of rounding
modes. Using get_rounding_mode and libc_fesetround() instead of
SET_RESTORE_ROUND provides a measurable gain for Sparc.
Unfortunately, on x86, one works with sse fp unit rounding mode while
the other works on x87 fp unit rounding mode.  Adding libc_fegetround,
libc_fegetroundf and libc_fegetroundl to to match libc_fesetround()
should not be too large a task but outside the scope of this patch.
---
 sysdeps/aarch64/libm-test-ulps    |    2 +
 sysdeps/ieee754/dbl-64/e_exp.c    |  307 ++++++++++++++++++++-----------------
 sysdeps/ieee754/dbl-64/eexp.tbl   |  255 ++++++++++++++++++++++++++++++
 sysdeps/sparc/fpu/libm-test-ulps  |    2 +
 sysdeps/x86_64/fpu/libm-test-ulps |    2 +
 5 files changed, 424 insertions(+), 144 deletions(-)
 create mode 100644 sysdeps/ieee754/dbl-64/eexp.tbl

v12 Improves __ieee754_exp() performance by 6-11% on aarch64/sparc/x86.

Commit Message

Comments

Patch