[v6,4/*] Generic string search functions (strstr, strcasestr, memmem)

  Hi,

I collected strstr profile trace and it was surprising. The biggest one
is that strstr has in 26.5% needle of size 1. I expected that as gcc
already transforms strstr(x,"c") to strchr that it would be rare but no,
that made me optimize these cases as likely.

Then as I mentioned in tile optimization thread I somewhat though that
tolower will also ignore diacritics to make search also
diacritic-insensitive. That isn't case which allow me to optimize
strcasestr more.

I use that only two characters x for which tolower(x) == tolower(c)
are tolower(c) and toupper(c). This doesn't have to hold for all
encodings, I add _NL_CTYPE_NONBIJECTIVE_CASE to test that property.

For my computer there are following average values in bytes:

average size 24.06 needle size  4.31 comparisons  25.07 digraphs  0.10 trigraphs  0.08

calls   969516 succeed  40.7%

haystack aligned to 4 bytes  74.9% aligned to 8 bytes  71.7% aligned to 16 bytes  62.8%

needle aligned to 4 bytes  48.6% aligned to 8 bytes  29.7% aligned to 16 bytes  29.1%

needle found in n bytes: n <= 0:   6.2% n <= 1:   7.3% n <= 2:   9.7% n <= 3:  12.8% 
        n <= 4:  13.6% n <= 8:  16.4% n <= 16:  55.2% n <= 32: 72.3% n <= 64:  96.4%

needle size:   n <= 0:   1.7% n <= 1:   27.2% n <= 2:   60.3% n <= 3:  62.4%
  n <= 4:  64.2% n <= 8:  84.5% n <= 16:  98.8% n <= 32:  99.9% n <= 64: 100.0%

	* benchtests/bench-strcasestr.c: Remove simple_strcasestr.
        * string/test-strcasestr.c: Likewise.
        * locale/categories.def: Add _NL_CTYPE_NONBIJECTIVE_CASE.
        * locale/C-ctype.c: Likewise.
        * locale/langinfo.h (enum): Likewise.
        * locale/programs/ld-ctype.c: Detect _NL_CTYPE_NONBIJECTIVE_CASE.
        * string/memmem.c (struct): Use string skeleton.
        * string/strcasestr.c (struct): Likewise.
        * string/strstr.c (struct): Likewise.
	* sysdeps/generic/string_vector_search.h: New file.

[v6,4/*] Generic string search functions (strstr, strcasestr, memmem)

Commit Message

Patch