powerpc: strcasestr optimization

  On 06/01/2015 09:52 PM, Ondřej Bílka wrote:
> On Mon, Jun 01, 2015 at 09:36:47AM -0500, Steven Munroe wrote:
>> On Mon, 2015-06-01 at 14:28 +0200, Ondřej Bílka wrote:
>>> On Mon, Jun 01, 2015 at 04:11:28PM +0530, Rajalakshmi Srinivasaraghavan wrote:
>>>>
>>>> This patch optimizes strcasestr function for power >= 7 systems.
>>>> This patch uses optimized strlen and strnlen for calculating
>>>> string length and the average improvement of this optimization is ~40%.
>>>> This patch is tested on powerpc64 and powerpc64le.
>>>> Attached the benchresults with this new patch.
>>>>
>>> Thats not enough. As strcasestr that I submited is around three times
>>> slower your implementation would likely be regression over generic one.
>>>
>>> A problem here is that you use moronic algorithm. Fix algorithm first
>>> before trying to optimize it.
>>>
>>
>> This is not very helpful. You are demanding changes without clear
>> explanation and justification.
>>
>> What is wrong with Raja's algorithm? What is insufficient in the
>> benchmark data she has provided? And why do you think your specific
>> design applies to PowerISA and POWER7/POWER8 micro-architecture.
>>
>> What data do you have that justified this objection?
>
> I replied on strstr patch thread on why what she submitted is
> performance regression. So I will repeat arguments from other thread
> which still apply.
>
> First was problem with quadratic behaviour. She tried to fix it but it
> isn't a fix at all. Just benchmark
>
> strcasestr ("aaa...(4000 times)...aaa", "aaa...(2000 times)...aab")
>
> That call would take around hundred times than before which is
> unacceptable.

This is already handled in the patch.If the needle len is more than
2048, it calls default string/strcasestr.c
>
> If we ignore that red flag second problem was that benchmark she used is
> bogus. It test with periodic haystacks, needle is copy of first bytes of
> haystack with last byte set to something else.
Which benchmark are you referring as bogus? The benchtest result 
attached in the previous thread was created using 
benchtests/bench-strcasestr.c . Since your proposed benchtest changes 
were not yet committed, I have used default ones.
.
.
.

>
> Just use same patch like I send with ((unsigned) rand())%16 + 1 and you
> will see completely different numbers in benchmark.
>
Benchtest results attached with these changes.
.
.

>
> As I don't have powerpc access now apply my patches
>
> [PATCH v5] Generic string skeleton
> [PATCH v5 4*] Generic string search functions (strstr, strcasestr, memmem)
>
I have attached the benchtest result with the above patches applied
along with benchtests/bench-strcasestr.c changes.(similiar changed as 
proposed by you for benchtests/bench-strstr.c).
The result attached clearly shows improvement.
> and run (preferably fixed) benchmark with these. As gains that I see on
> x64 are bigger than ones gained by this assembly you will likely see
> that generic implementation is indeed better and it would be pointless
> to try review that only to remove it shortly after adding to improve
> performance.
>
>

Message ID	556D705D.20201@linux.vnet.ibm.com
State	Dropped
Delegated to:	Tulio Magno Quites Machado Filho
Headers	Received: (qmail 31968 invoked by alias); 2 Jun 2015 09:01:11 -0000 Mailing-List: contact libc-alpha-help@sourceware.org; run by ezmlm Precedence: bulk List-Id: <libc-alpha.sourceware.org> List-Unsubscribe: <mailto:libc-alpha-unsubscribe-##L=##H@sourceware.org> List-Subscribe: <mailto:libc-alpha-subscribe@sourceware.org> List-Archive: <http://sourceware.org/ml/libc-alpha/> List-Post: <mailto:libc-alpha@sourceware.org> List-Help: <mailto:libc-alpha-help@sourceware.org>, <http://sourceware.org/ml/#faqs> Sender: libc-alpha-owner@sourceware.org Delivered-To: mailing list libc-alpha@sourceware.org Received: (qmail 31941 invoked by uid 89); 2 Jun 2015 09:01:08 -0000 Authentication-Results: sourceware.org; auth=none X-Virus-Found: No X-Spam-SWARE-Status: No, score=2.2 required=5.0 tests=AWL, BAYES_60, KAM_LAZY_DOMAIN_SECURITY, T_RP_MATCHES_RCVD autolearn=no version=3.3.2 X-HELO: e28smtp07.in.ibm.com Message-ID: <556D705D.20201@linux.vnet.ibm.com> Date: Tue, 02 Jun 2015 14:29:09 +0530 From: Rajalakshmi Srinivasaraghavan <raji@linux.vnet.ibm.com> User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:31.0) Gecko/20100101 Thunderbird/31.7.0 MIME-Version: 1.0 To: =?UTF-8?B?T25kxZllaiBCw61sa2E=?= <neleai@seznam.cz>, Steven Munroe <munroesj@linux.vnet.ibm.com> CC: GNU C Library <libc-alpha@sourceware.org>, Steve Munroe <sjmunroe@us.ibm.com> Subject: Re: [PATCH] powerpc: strcasestr optimization References: <55687597.1060101@linux.vnet.ibm.com> <556C36D8.2070208@linux.vnet.ibm.com> <20150601122830.GA14649@domone> <1433169407.10235.5.camel@sjmunroe-ThinkPad-W500> <20150601162215.GA8955@domone> In-Reply-To: <20150601162215.GA8955@domone> Content-Type: multipart/mixed; boundary="------------080704070703060808000908" X-TM-AS-MML: disable X-Content-Scanned: Fidelis XPS MAILER x-cbid: 15060208-0025-0000-0000-0000052241AF

powerpc: strcasestr optimization

Commit Message

Patch