pthread wastes memory with mlockall(MCL_FUTURE)

Message ID	20150918201101.GD27881@eper
State	Superseded
Headers	Mailing-List: contact libc-alpha-help@sourceware.org; run by ezmlm Precedence: bulk Sender: libc-alpha-owner@sourceware.org Date: Fri, 18 Sep 2015 21:11:01 +0100 From: Balazs Kezes <rlblaster@gmail.com> To: Rich Felker <dalias@libc.org> Cc: libc-alpha@sourceware.org Subject: Re: pthread wastes memory with mlockall(MCL_FUTURE) Message-ID: <20150918201101.GD27881@eper> References: <20150918102734.GA27881@eper> <20150918143824.GB17773@brightrain.aerifal.cx> <20150918163842.GB27881@eper> <20150918170853.GC17773@brightrain.aerifal.cx> <20150918192952.GC27881@eper> <20150918194521.GD17773@brightrain.aerifal.cx> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20150918194521.GD17773@brightrain.aerifal.cx> User-Agent: Mutt/1.5.21 (2010-09-15)

Message ID

20150918201101.GD27881@eper

State

Superseded

Headers

Mailing-List: contact libc-alpha-help@sourceware.org; run by ezmlm
Precedence: bulk
Sender: libc-alpha-owner@sourceware.org
Date: Fri, 18 Sep 2015 21:11:01 +0100
From: Balazs Kezes <rlblaster@gmail.com>
To: Rich Felker <dalias@libc.org>
Cc: libc-alpha@sourceware.org
Subject: Re: pthread wastes memory with mlockall(MCL_FUTURE)
Message-ID: <20150918201101.GD27881@eper>
References: <20150918102734.GA27881@eper>
	<20150918143824.GB17773@brightrain.aerifal.cx>
	<20150918163842.GB27881@eper>
	<20150918170853.GC17773@brightrain.aerifal.cx>
	<20150918192952.GC27881@eper>
	<20150918194521.GD17773@brightrain.aerifal.cx>
MIME-Version: 1.0
Content-Type: text/plain; charset=us-ascii
Content-Disposition: inline
In-Reply-To: <20150918194521.GD17773@brightrain.aerifal.cx>
User-Agent: Mutt/1.5.21 (2010-09-15)

Commit Message

Balazs Kezes Sept. 18, 2015, 8:11 p.m. UTC

  On 2015-09-18 15:45 -0400, Rich Felker wrote:
> On Fri, Sep 18, 2015 at 08:29:52PM +0100, Balazs Kezes wrote:
> > So here's what I think pthreads should do: First mmap with PROT_NONE
> > and only then should mprotect read/write the stack pages.
> >
> > Does that sound reasonable?
>
> Yes.

So here's the simplest patch I could come up with:

I've verified in my pthreads example that pthreads doesn't waste memory
with this patch applied. That's not really a nice fix though but this
allocatestack() function looks too scary to me. :(

Comments

Rich Felker Sept. 18, 2015, 11:22 p.m. UTC | #1

On Fri, Sep 18, 2015 at 09:11:01PM +0100, Balazs Kezes wrote:
> On 2015-09-18 15:45 -0400, Rich Felker wrote:
> > On Fri, Sep 18, 2015 at 08:29:52PM +0100, Balazs Kezes wrote:
> > > So here's what I think pthreads should do: First mmap with PROT_NONE
> > > and only then should mprotect read/write the stack pages.
> > >
> > > Does that sound reasonable?
> >
> > Yes.
> 
> So here's the simplest patch I could come up with:
> 
> diff --git a/nptl/allocatestack.c b/nptl/allocatestack.c
> index 753da61..c6065dc 100644
> --- a/nptl/allocatestack.c
> +++ b/nptl/allocatestack.c
> @@ -501,12 +501,21 @@ allocate_stack (const struct pthread_attr *attr, struct pthread **pdp,
>  	    size += pagesize_m1 + 1;
>  #endif
>  
> -	  mem = mmap (NULL, size, prot,
> +	  /* Map with PROT_NONE first and only then mprotect the pages to avoid
> +	     the kernel unnecessary reserving the pages in the case of
> +	     mlockall(MCL_FUTURE).  */
> +	  mem = mmap (NULL, size, PROT_NONE,
>  		      MAP_PRIVATE | MAP_ANONYMOUS | MAP_STACK, -1, 0);
>  
>  	  if (__glibc_unlikely (mem == MAP_FAILED))
>  	    return errno;
>  
> +	  if (__glibc_unlikely (mprotect (mem, size, prot) != 0))
> +	    {
> +	      munmap(mem, size);
> +	      return errno;
> +	    }
> +
>  	  /* SIZE is guaranteed to be greater than zero.
>  	     So we can never get a null pointer back from mmap.  */
>  	  assert (mem != NULL);
> 
> 
> I've verified in my pthreads example that pthreads doesn't waste memory
> with this patch applied. That's not really a nice fix though but this
> allocatestack() function looks too scary to me. :(

If this works, I think it's only due to a kernel bug of failing to
apply the lock after mprotect. It's also going to be considerably
slower, I think. What I had in mind was switching around the existing
mmap/mprotect order, not adding an extra mprotect.

Rich

diff mbox

Patch

diff --git a/nptl/allocatestack.c b/nptl/allocatestack.c
index 753da61..c6065dc 100644
--- a/nptl/allocatestack.c
+++ b/nptl/allocatestack.c
@@ -501,12 +501,21 @@  allocate_stack (const struct pthread_attr *attr, struct pthread **pdp,
 	    size += pagesize_m1 + 1;
 #endif
 
-	  mem = mmap (NULL, size, prot,
+	  /* Map with PROT_NONE first and only then mprotect the pages to avoid
+	     the kernel unnecessary reserving the pages in the case of
+	     mlockall(MCL_FUTURE).  */
+	  mem = mmap (NULL, size, PROT_NONE,
 		      MAP_PRIVATE | MAP_ANONYMOUS | MAP_STACK, -1, 0);
 
 	  if (__glibc_unlikely (mem == MAP_FAILED))
 	    return errno;
 
+	  if (__glibc_unlikely (mprotect (mem, size, prot) != 0))
+	    {
+	      munmap(mem, size);
+	      return errno;
+	    }
+
 	  /* SIZE is guaranteed to be greater than zero.
 	     So we can never get a null pointer back from mmap.  */
 	  assert (mem != NULL);