From patchwork Thu Jun 30 11:43:40 2016
Content-Type: text/plain; charset="utf-8"
MIME-Version: 1.0
Content-Transfer-Encoding: 7bit
X-Patchwork-Submitter: Andrew Senkevich <andrew.n.senkevich@gmail.com>
X-Patchwork-Id: 13505
Received: (qmail 129139 invoked by alias); 30 Jun 2016 11:44:24 -0000
Mailing-List: contact libc-alpha-help@sourceware.org; run by ezmlm
Precedence: bulk
List-Id: <libc-alpha.sourceware.org>
List-Unsubscribe: <mailto:libc-alpha-unsubscribe-##L=##H@sourceware.org>
List-Subscribe: <mailto:libc-alpha-subscribe@sourceware.org>
List-Archive: <http://sourceware.org/ml/libc-alpha/>
List-Post: <mailto:libc-alpha@sourceware.org>
List-Help: <mailto:libc-alpha-help@sourceware.org>,
	<http://sourceware.org/ml/#faqs>
Sender: libc-alpha-owner@sourceware.org
Delivered-To: mailing list libc-alpha@sourceware.org
Received: (qmail 126829 invoked by uid 89); 30 Jun 2016 11:44:22 -0000
Authentication-Results: sourceware.org; auth=none
X-Virus-Found: No
X-Spam-SWARE-Status: No, score=-1.9 required=5.0 tests=AWL, BAYES_00,
	FREEMAIL_FROM, RCVD_IN_DNSWL_LOW, SPF_PASS,
	URIBL_RED autolearn=ham version=3.3.2 spammy=21, 7, 21, 6,
	H*RU:209.85.213.67, Hx-spam-relays-external:209.85.213.67
X-HELO: mail-vk0-f67.google.com
X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed;
	d=1e100.net; s=20130820;
	h=x-gm-message-state:mime-version:in-reply-to:references:from:date
	:message-id:subject:to:cc;
	bh=z2wHGzu5TmrWB0fKjYWWHQtH6YvuINR3fu5NK/30GHI=;
	b=fJai5FyazRU5yu96i3ugV0VPEIVvgi14FO1VJmNqjKTaeiLFZIf1oxQ43SFeIWF+IM
	qKbQo9ZD9E7TuYbLhUgLcsaF0HNHw4MPRNPnVYGAjqBB9FH4SPBVu0bFVx1NfIFh/Zcm
	Ax9mBrwaqs7VhD218l8nijbAtm7J4akAWrQRe3pb+McOyk7IgY8U2wYfBlXMdKkmjbwv
	1qoP4zFUj1PApCyD9EanXxgjGrSJkA95Weso9d/yKTO/HUYVlbhlzoyfRpjXEobpZAsX
	o8VVVHT+vLHk6DmOi8hxhjWJSW6ZmNzadnqWZ0U/4zZuMIVF98ehPcSq7CyzrD/4eN+p
	dSRg==
X-Gm-Message-State: 
 ALyK8tLLsqGg5KSwQDJ7qPQlX0tU95NBcZUG+t+AZJloPthdgkXGjIweY1pa597TTefwY7hvRCfKBDtNbsy8IA==
X-Received: by 10.31.148.23 with SMTP id w23mr6587721vkd.0.1467287050076;
	Thu, 30 Jun 2016 04:44:10 -0700 (PDT)
MIME-Version: 1.0
In-Reply-To: <alpine.DEB.2.20.1606061404540.15183@digraph.polyomino.org.uk>
References: 
 <CAMXFM3tO0RGFjbrQaEV80rTCWmUy8LQjkZ7gRd9j+Q3YQ2+nAQ@mail.gmail.com>
	<alpine.DEB.2.20.1606022247540.9542@digraph.polyomino.org.uk>
	<CAMXFM3vREvMPjBU23ZGV6eyNs-exwHGG6C2WE6N1MHf=CqbkvA@mail.gmail.com>
	<alpine.DEB.2.20.1606061404540.15183@digraph.polyomino.org.uk>
From: Andrew Senkevich <andrew.n.senkevich@gmail.com>
Date: Thu, 30 Jun 2016 14:43:40 +0300
Message-ID: 
 <CAMXFM3uHWDzdEmjDk-jX14D55vMr0QbaqAKKGe5KuWVDyXcRmA@mail.gmail.com>
Subject: Re: [PATCH x86-64][BZ #20024] Fixed vector sincos/sincosf ABI
To: Joseph Myers <joseph@codesourcery.com>
Cc: libc-alpha <libc-alpha@sourceware.org>

2016-06-06 17:08 GMT+03:00 Joseph Myers <joseph@codesourcery.com>:
> On Mon, 6 Jun 2016, Andrew Senkevich wrote:
>
>> 2016-06-03 1:50 GMT+03:00 Joseph Myers <joseph@codesourcery.com>:
>> > On Tue, 31 May 2016, Andrew Senkevich wrote:
>> >
>> >> Hi,
>> >>
>> >> this patch fixes wrong vector sincos/sincosf ABI to have it compatible with
>> >> current vector function declaration.  According to current vector function
>> >> declaration vectorized sincos should have vector of pointers for second and
>> >> third parameters, so it is fixed with implementation as wrapper to version
>> >> having second and third parameters as pointers.
>> >> Is it Ok for trunk, 2.22 and 2.23 releases branches?
>> >
>> > Do you intend a followup for trunk only that exports the new functions
>> > with the intended ABI and makes the old ones into compat symbols?
>>
>> Is it suitable way to have both simd declarations for sincos in headers?
>
> (a) Would that work usefully, and cause both functions to be used
> depending on the code to be vectorized?
>
> (b) How useful are the existing functions, i.e. would real code be likely
> to use both functions?

It is hard to say about real code, but GCC 6.1 can vectorize both
cases depending on user code.

So we can have both versions and test them both with the following
patch (after main patch from this thread):


(The same change for sysdeps/x86_64/fpu/test-float-libmvec-sincosf.c)

Is this change Ok generally?
---
WBR,
Andrew

diff --git a/sysdeps/unix/sysv/linux/x86_64/libmvec.abilist
b/sysdeps/unix/sysv/linux/x86_64/libmvec.abilist
index 80d028a..e3e450c 100644
--- a/sysdeps/unix/sysv/linux/x86_64/libmvec.abilist
+++ b/sysdeps/unix/sysv/linux/x86_64/libmvec.abilist
@@ -47,3 +47,12 @@ GLIBC_2.22 _ZGVeN8v_log F
 GLIBC_2.22 _ZGVeN8v_sin F
 GLIBC_2.22 _ZGVeN8vv_pow F
 GLIBC_2.22 _ZGVeN8vvv_sincos F
+GLIBC_2.24 GLIBC_2.24 A
+GLIBC_2.24 _ZGVbN2vl8l8_sincos F
+GLIBC_2.24 _ZGVbN4vl4l4_sincosf F
+GLIBC_2.24 _ZGVcN4vl8l8_sincos F
+GLIBC_2.24 _ZGVcN8vl4l4_sincosf F
+GLIBC_2.24 _ZGVdN4vl8l8_sincos F
+GLIBC_2.24 _ZGVdN8vl4l4_sincosf F
+GLIBC_2.24 _ZGVeN16vl4l4_sincosf F
+GLIBC_2.24 _ZGVeN8vl8l8_sincos F
diff --git a/sysdeps/x86/fpu/bits/math-vector.h
b/sysdeps/x86/fpu/bits/math-vector.h
index ca43cf4..74a6bf8 100644
--- a/sysdeps/x86/fpu/bits/math-vector.h
+++ b/sysdeps/x86/fpu/bits/math-vector.h
@@ -43,9 +43,9 @@
 #  undef __DECL_SIMD_sinf
 #  define __DECL_SIMD_sinf __DECL_SIMD_x86_64
 #  undef __DECL_SIMD_sincos
-#  define __DECL_SIMD_sincos __DECL_SIMD_x86_64
+#  define __DECL_SIMD_sincos __DECL_SIMD_x86_64 _Pragma ("omp declare
simd notinbranch linear (__sinx, __cosx: 1)")
 #  undef __DECL_SIMD_sincosf
-#  define __DECL_SIMD_sincosf __DECL_SIMD_x86_64
+#  define __DECL_SIMD_sincosf __DECL_SIMD_x86_64 _Pragma ("omp
declare simd notinbranch linear (__sinx, __cosx: 1)")
 #  undef __DECL_SIMD_log
 #  define __DECL_SIMD_log __DECL_SIMD_x86_64
 #  undef __DECL_SIMD_logf
diff --git a/sysdeps/x86_64/fpu/Makefile b/sysdeps/x86_64/fpu/Makefile
index 25aef40..ee281c2 100644
--- a/sysdeps/x86_64/fpu/Makefile
+++ b/sysdeps/x86_64/fpu/Makefile
@@ -170,8 +170,8 @@ float-vlen8-arch-ext-cflags = -mavx
 float-vlen8-arch-ext2-cflags = -mavx2
 float-vlen16-arch-ext-cflags = -mavx512f

-libmvec-sincos-cflags = $(libm-test-fast-math-cflags) -fopenmp
-Wno-unknown-pragmas
-libmvec-alias-cflags = $(libmvec-sincos-cflags) -fno-inline
-ffloat-store -ffinite-math-only
+libmvec-sincos-cflags = $(libm-test-fast-math-cflags) -fopenmp
-fno-inline -Wno-unknown-pragmas
+libmvec-alias-cflags = $(libmvec-sincos-cflags) -ffloat-store
-ffinite-math-only

 CFLAGS-test-double-libmvec-alias-mod.c = $(libmvec-alias-cflags)
 CFLAGS-test-double-libmvec-alias-avx-mod.c =
$(double-vlen4-arch-ext-cflags) $(libmvec-alias-cflags) -DREQUIRE_AVX
diff --git a/sysdeps/x86_64/fpu/Versions b/sysdeps/x86_64/fpu/Versions
index 0813204..02df4b5 100644
--- a/sysdeps/x86_64/fpu/Versions
+++ b/sysdeps/x86_64/fpu/Versions
@@ -13,4 +13,8 @@ libmvec {
     _ZGVbN4vv_powf; _ZGVcN8vv_powf; _ZGVdN8vv_powf; _ZGVeN16vv_powf;
     _ZGVbN4vvv_sincosf; _ZGVcN8vvv_sincosf; _ZGVdN8vvv_sincosf;
_ZGVeN16vvv_sincosf;
   }
+  GLIBC_2.24 {
+    _ZGVbN2vl8l8_sincos; _ZGVcN4vl8l8_sincos; _ZGVdN4vl8l8_sincos;
_ZGVeN8vl8l8_sincos;
+    _ZGVbN4vl4l4_sincosf; _ZGVcN8vl4l4_sincosf; _ZGVdN8vl4l4_sincosf;
_ZGVeN16vl4l4_sincosf;
+  }
 }
diff --git a/sysdeps/x86_64/fpu/test-double-libmvec-sincos.c
b/sysdeps/x86_64/fpu/test-double-libmvec-sincos.c
index 80348a2..8fe106d 100644
--- a/sysdeps/x86_64/fpu/test-double-libmvec-sincos.c
+++ b/sysdeps/x86_64/fpu/test-double-libmvec-sincos.c
@@ -21,6 +21,7 @@

 #define N 1000
 double x[N], s[N], c[N];
+double x1[N], s1[N], c1[N];
 double* s_ptrs[N];
 double* c_ptrs[N];
 int arch_check = 1;
@@ -28,15 +29,13 @@ int arch_check = 1;
 static void
 init_arg (void)
 {
-  int i;
-
   CHECK_ARCH_EXT;

   arch_check = 0;

-  for(i = 0; i < N; i++)
+  for(int i = 0; i < N; i++)
   {
-    x[i] = i / 3;
+    x[i] = x1[i] = i / 3;
     s_ptrs[i] = &s[i];
     c_ptrs[i] = &c[i];
   }
@@ -45,16 +44,19 @@ init_arg (void)
 static int
 test_sincos_abi (void)
 {
-  int i;
-
-  init_arg ();
+#pragma omp simd
+  for(int i = 0; i < N; i++)
+    sincos (x[i], s_ptrs[i], c_ptrs[i]);

-  if (arch_check)
-    return 77;
+  return 0;
+}

+static int
+test_sincos_linear_abi (void)
+{
 #pragma omp simd
-  for(i = 0; i < N; i++)
-    sincos (x[i], s_ptrs[i], c_ptrs[i]);
+  for(int i = 0; i < N; i++)
+    sincos (x1[i], &s1[i], &c1[i]);

   return 0;
 }
@@ -62,7 +64,16 @@ test_sincos_abi (void)
 static int
 do_test (void)
 {
-    return test_sincos_abi ();
+  init_arg ();
+
+  if (arch_check)
+    return 77;
+
+  test_sincos_abi ();
+
+  test_sincos_linear_abi ();
+
+  return 0;
 }

 #define TEST_FUNCTION do_test ()