[PATCHv2] powerpc: Spinlock optimization and cleanup

  Changes from V1:

* Use C macro for atomic_store_release as suggested in
comments.

* Run benchmarks to quantize the performance changes and
note in commit message.

---8<---
This patch optimizes powerpc spinlock implementation by:

* Current algorithm spin over a lwzx, but only after issuing a lwarx.
  first.  The optimization for such case is to avoid the first lwarx
  in contention case (which is much more costly than normal loads).

* Use the correct EH hint bit on the larx for supported ISA.  For lock
  acquisition, the thread that acquired the lock with a successful stcx
  do not want to give away the write ownership on the cacheline.  The
  idea is to make the load reservation "sticky" about retaining write
  authority to the line.  That way, the store that must inevitably come
  to release the lock can succeed quickly and not contend with other
  threads issuing lwarx.  If another thread does a store to the line
  (false sharing), the winning thread must give up write authority to
  The proper value of EH for the larx for a lock acquisition is 1.

* Increase contented lock performance by up to 40%, and no measurable
  impact on uncontended locks on P8.

It also adds some cleanup to use the defined acquire semantic
instructions and function prototype using default C style.

Thanks to Adhemerval Zanella who did most of the work.  I've run some
tests, and addressed some minor feedback.

2015-09-25  Paul E. Murphy  <murphyp@linux.vnet.ibm.com>

	* sysdeps/powerpc/nptl/pthread_spin_lock.c (pthread_spin_lock):
	Optimize first check for contention case and add lwarx hint.
	* sysdeps/powerpc/nptl/pthread_spin_trylock.c (pthread_spin_trylock):
	Use ANSI prototype.
	* sysdep/unix/sysv/linux/powerpc/pthread_spin_unlock.c: Move to ...
	* sysdeps/powerpc/nptl/pthread_spin_unlock.c: ... here, and
	update to new atomic macros.
---
 sysdeps/powerpc/nptl/pthread_spin_lock.c           |   22 ++++++++-------
 sysdeps/powerpc/nptl/pthread_spin_trylock.c        |    3 +-
 sysdeps/powerpc/nptl/pthread_spin_unlock.c         |   27 +++++++++++++++++++
 .../unix/sysv/linux/powerpc/pthread_spin_unlock.c  |   28 --------------------
 4 files changed, 40 insertions(+), 40 deletions(-)
 create mode 100644 sysdeps/powerpc/nptl/pthread_spin_unlock.c
 delete mode 100644 sysdeps/unix/sysv/linux/powerpc/pthread_spin_unlock.c

Message ID	560C0DA6.5060409@linux.vnet.ibm.com
State	Superseded
Delegated to:	Tulio Magno Quites Machado Filho
Headers	Received: (qmail 17740 invoked by alias); 30 Sep 2015 16:28:31 -0000 Mailing-List: contact libc-alpha-help@sourceware.org; run by ezmlm Precedence: bulk List-Id: <libc-alpha.sourceware.org> List-Unsubscribe: <mailto:libc-alpha-unsubscribe-##L=##H@sourceware.org> List-Subscribe: <mailto:libc-alpha-subscribe@sourceware.org> List-Archive: <http://sourceware.org/ml/libc-alpha/> List-Post: <mailto:libc-alpha@sourceware.org> List-Help: <mailto:libc-alpha-help@sourceware.org>, <http://sourceware.org/ml/#faqs> Sender: libc-alpha-owner@sourceware.org Delivered-To: mailing list libc-alpha@sourceware.org Received: (qmail 17727 invoked by uid 89); 30 Sep 2015 16:28:31 -0000 Authentication-Results: sourceware.org; auth=none X-Virus-Found: No X-Spam-SWARE-Status: No, score=-0.9 required=5.0 tests=AWL, BAYES_00, KAM_LAZY_DOMAIN_SECURITY, T_RP_MATCHES_RCVD autolearn=no version=3.3.2 X-HELO: e17.ny.us.ibm.com X-IBM-Helo: d01dlp01.pok.ibm.com X-IBM-MailFrom: murphyp@linux.vnet.ibm.com X-IBM-RcptTo: libc-alpha@sourceware.org From: "Paul E. Murphy" <murphyp@linux.vnet.ibm.com> Subject: [PATCHv2] powerpc: Spinlock optimization and cleanup To: "libc-alpha@sourceware.org" <libc-alpha@sourceware.org>, triegel@redhat.com, rth@twiddle.net, Tulio Magno Quites Machado Filho <tuliom@linux.vnet.ibm.com>, rth@twiddle.net Cc: Adhemerval Zanella <adhemerval.zanella@linaro.org>, Steve Munroe <sjmunroe@us.ibm.com> Message-ID: <560C0DA6.5060409@linux.vnet.ibm.com> Date: Wed, 30 Sep 2015 11:28:22 -0500 User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:38.0) Gecko/20100101 Thunderbird/38.2.0 MIME-Version: 1.0 Content-Type: text/plain; charset=windows-1252 Content-Transfer-Encoding: 7bit X-TM-AS-MML: disable X-Content-Scanned: Fidelis XPS MAILER x-cbid: 15093016-0041-0000-0000-000001B35FF9

[PATCHv2] powerpc: Spinlock optimization and cleanup

Commit Message

Comments

Patch