x86: Use generic vector computations in s_sincosf.h

  On Wed, Dec 26, 2018 at 03:21:14PM +0530, Siddhesh Poyarekar wrote:
> On 17/12/18 4:27 AM, H.J. Lu wrote:
> > Here is the updated patch to use generic vector computations only for
> > x86.  Tested on i686 and x86-64.  OK for master branch?
> 
> I assume you'll add the original description to this so that there's a
> clearer git commit log.
> 
> > 
> > Thanks.
> > 
> > H.J.
> > --
> > Use generic vector computations in s_sincosf.h to support vectorized
> > s_sincosf.  Update __sincosf_table for vectorized s_sincosf.  On
> > Broadwell, bench-sincosf shows:
> > 
> >         Before         After      Improvement
> > max    160.273        114.198        40%
> > min    6.25           5.625          11%
> > mean   13.0325        10.6462        22%
> > 
> > Vectorized sincosf_poly shows
> > 
> >         Before         After      Improvement
> > max    138.653        114.198        21%
> > min    5.004          5.625          -11%
> > mean   11.5934        10.6462        9%
> 
> I assume Wilco's performance gain also remains with this patch given that
> the crux of the code hasn't changed.
> 
> Looks OK to me.
> 

This is the patch I am checking in.

Thanks.

H.J.
----
Add <sincosf_poly.h> and include it in s_sincosf.h to allow vectorized
sincosf_poly.  Add x86 sincosf_poly.h to vectorize sincosf_poly.  On
Broadwell, bench-sincosf shows:

       Before         After      Improvement
max    160.273        114.198        40%
min    6.25           5.625          11%
mean   13.0325        10.6462        22%

Vectorized sincosf_poly shows

       Before         After      Improvement
max    138.653        114.198        21%
min    5.004          5.625          -11%
mean   11.5934        10.6462        9%

Tested on x86-64 and i686 as well as with build-many-glibcs.py.

	* sysdeps/ieee754/flt-32/s_sincosf.h: Include <sincosf_poly.h>.
	(sincos_t, sincosf_poly, sinf_poly): Moved to ...
	* sysdeps/ieee754/flt-32/sincosf_poly.h: Here.  New file.
	* sysdeps/x86/fpu/s_sincosf_data.c: New file.
	* sysdeps/x86/fpu/sincosf_poly.h: Likewise.
	* sysdeps/x86_64/fpu/multiarch/s_sincosf-fma.c: Just include
	<sysdeps/ieee754/flt-32/s_sincosf.c>.
---
 sysdeps/ieee754/flt-32/s_sincosf.h           |  71 +----
 sysdeps/ieee754/flt-32/sincosf_poly.h        |  87 ++++++
 sysdeps/x86/fpu/s_sincosf_data.c             |  68 +++++
 sysdeps/x86/fpu/sincosf_poly.h               | 111 ++++++++
 sysdeps/x86_64/fpu/multiarch/s_sincosf-fma.c | 271 +------------------
 5 files changed, 268 insertions(+), 340 deletions(-)
 create mode 100644 sysdeps/ieee754/flt-32/sincosf_poly.h
 create mode 100644 sysdeps/x86/fpu/s_sincosf_data.c
 create mode 100644 sysdeps/x86/fpu/sincosf_poly.h

Message ID	20181226145526.GA4889@gmail.com
State	Committed
Commit	8700a7851bccce63335f937f930382de58b8a249
Headers	Mailing-List: contact libc-alpha-help@sourceware.org; run by ezmlm Precedence: bulk Sender: libc-alpha-owner@sourceware.org Date: Wed, 26 Dec 2018 06:55:26 -0800 From: "H.J. Lu" <hjl.tools@gmail.com> To: Siddhesh Poyarekar <siddhesh@gotplt.org> Cc: Carlos O'Donell <carlos@redhat.com>, Wilco Dijkstra <Wilco.Dijkstra@arm.com>, 'GNU C Library' <libc-alpha@sourceware.org>, nd <nd@arm.com> Subject: Re: [PATCH] x86: Use generic vector computations in s_sincosf.h Message-ID: <20181226145526.GA4889@gmail.com> References: <DB5PR08MB10309CAA6797BBA957E8BC6B83A10@DB5PR08MB1030.eurprd08.prod.outlook.com> <199249f8-a482-d424-c308-6dffddef838e@redhat.com> <20181216225702.GA8505@gmail.com> <9641865c-ca19-e429-c474-cd2e4eb3b410@gotplt.org> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <9641865c-ca19-e429-c474-cd2e4eb3b410@gotplt.org> User-Agent: Mutt/1.10.1 (2018-07-13)

x86: Use generic vector computations in s_sincosf.h

Commit Message

Patch