From patchwork Tue Jul 13 08:22:14 2021 Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit X-Patchwork-Submitter: Naohiro Tamura X-Patchwork-Id: 44332 Return-Path: X-Original-To: patchwork@sourceware.org Delivered-To: patchwork@sourceware.org Received: from server2.sourceware.org (localhost [IPv6:::1]) by sourceware.org (Postfix) with ESMTP id 80671394AC3E for ; Tue, 13 Jul 2021 08:23:08 +0000 (GMT) DKIM-Filter: OpenDKIM Filter v2.11.0 sourceware.org 80671394AC3E DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=sourceware.org; s=default; t=1626164588; bh=XAEl0kzfmO0aVxs3c8S79JcfLhq0FCrbKQI2bogxjOA=; h=To:Subject:Date:List-Id:List-Unsubscribe:List-Archive:List-Post: List-Help:List-Subscribe:From:Reply-To:From; b=egTsZWufrkNZ0OdvTyvcTtqHVAkH+Fxv+PoKJT8N3cAdB0L9Pl+kmO+lqx84K5gIa AA8hBjGLs2rIlY0t/cUGbE1eO7S+oDbcDWhTBtyCbqUuoPBP9eHI/ua63TMbqzxVkh d74i8PkU+WWeNZlUGKOs0orNsHfYNPWEayf3SuU0= X-Original-To: libc-alpha@sourceware.org Delivered-To: libc-alpha@sourceware.org Received: from esa6.hc1455-7.c3s2.iphmx.com (esa6.hc1455-7.c3s2.iphmx.com [68.232.139.139]) by sourceware.org (Postfix) with ESMTPS id 08785389850B for ; Tue, 13 Jul 2021 08:22:44 +0000 (GMT) DMARC-Filter: OpenDMARC Filter v1.4.1 sourceware.org 08785389850B IronPort-SDR: FtXOX5TG/IAAX+d+4Vp09j4pEJJCtmknOZACVVdjjjdcbW+mh6aBhNbofZ8Hqr5U4Xb6aVP37t hD43Gu05pBPzwMCS/qtj2ciCUTHe4ZSGbx5fXUr/Zsee21FGXln2pKESaE+d6sfhI7jeVXNi67 KZQJckU07P7M+7eeDN0i8Dda6FqNsi6zcAPyru7P4phlNL8TEG5eTGKWLoFHXi3/oHBEwQQUgD AHsZqu7wJ2ltLXzlTZFz6jDQZikUNKsfZtAXe7oaYc1+bhBk9VkQ0ff/A0Y3RGf8Kbm5eRcmn3 MZPD3HCtWiZhgq8SbqFH4qae X-IronPort-AV: E=McAfee;i="6200,9189,10043"; a="36502269" X-IronPort-AV: E=Sophos;i="5.84,236,1620658800"; d="scan'208";a="36502269" Received: from unknown (HELO yto-r2.gw.nic.fujitsu.com) ([218.44.52.218]) by esa6.hc1455-7.c3s2.iphmx.com with ESMTP; 13 Jul 2021 17:22:43 +0900 Received: from yto-m2.gw.nic.fujitsu.com (yto-nat-yto-m2.gw.nic.fujitsu.com [192.168.83.65]) by yto-r2.gw.nic.fujitsu.com (Postfix) with ESMTP id 85C60A80C6 for ; Tue, 13 Jul 2021 17:22:41 +0900 (JST) Received: from m3051.s.css.fujitsu.com (m3051.s.css.fujitsu.com [10.134.21.209]) by yto-m2.gw.nic.fujitsu.com (Postfix) with ESMTP id D90349B60C for ; Tue, 13 Jul 2021 17:22:40 +0900 (JST) Received: from bionic.lxd (unknown [10.126.53.116]) by m3051.s.css.fujitsu.com (Postfix) with ESMTP id CE7348E; Tue, 13 Jul 2021 17:22:40 +0900 (JST) To: libc-alpha@sourceware.org Subject: [PATCH] benchtests: Add memset zero fill benchmark tests Date: Tue, 13 Jul 2021 08:22:14 +0000 Message-Id: <20210713082214.307529-1-naohirot@fujitsu.com> X-Mailer: git-send-email 2.17.1 X-TM-AS-GCONF: 00 X-Spam-Status: No, score=-11.8 required=5.0 tests=BAYES_00, GIT_PATCH_0, KAM_DMARC_NONE, KAM_DMARC_STATUS, KAM_LAZY_DOMAIN_SECURITY, KAM_SHORT, SPF_HELO_PASS, SPF_NONE, TXREP autolearn=ham autolearn_force=no version=3.4.4 X-Spam-Checker-Version: SpamAssassin 3.4.4 (2020-01-24) on server2.sourceware.org X-BeenThere: libc-alpha@sourceware.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: Libc-alpha mailing list List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-Patchwork-Original-From: Naohiro Tamura via Libc-alpha From: Naohiro Tamura Reply-To: Naohiro Tamura Errors-To: libc-alpha-bounces+patchwork=sourceware.org@sourceware.org Sender: "Libc-alpha" Memset takes 0 as the second parameter in most cases. More than 95% of memset takes 0 as the second parameter in case of Linux Kernel source code. However, we cannot measure the zero fill performance by bench-memset-zerofill.c and bench-memset-large-zerofill.c. This patch provides bench-memset-zerofill.c and bench-memset-large-zerofill.c which are suitable to see the performance of zero fill by fixing the second parameter to 0. --- benchtests/Makefile | 3 +- benchtests/bench-memset-large-zerofill.c | 125 ++++++++++++++++++ benchtests/bench-memset-zerofill.c | 156 +++++++++++++++++++++++ 3 files changed, 283 insertions(+), 1 deletion(-) create mode 100644 benchtests/bench-memset-large-zerofill.c create mode 100644 benchtests/bench-memset-zerofill.c diff --git a/benchtests/Makefile b/benchtests/Makefile index 1530939a8ce8..1261f7650fc7 100644 --- a/benchtests/Makefile +++ b/benchtests/Makefile @@ -53,7 +53,8 @@ string-benchset := memccpy memchr memcmp memcpy memmem memmove \ strncasecmp strncat strncmp strncpy strnlen strpbrk strrchr \ strspn strstr strcpy_chk stpcpy_chk memrchr strsep strtok \ strcoll memcpy-large memcpy-random memmove-large memset-large \ - memcpy-walk memset-walk memmove-walk + memcpy-walk memset-walk memmove-walk memset-zerofill \ + memset-large-zerofill # Build and run locale-dependent benchmarks only if we're building natively. ifeq (no,$(cross-compiling)) diff --git a/benchtests/bench-memset-large-zerofill.c b/benchtests/bench-memset-large-zerofill.c new file mode 100644 index 000000000000..d8eae9d9789f --- /dev/null +++ b/benchtests/bench-memset-large-zerofill.c @@ -0,0 +1,125 @@ +/* Measure memset functions with large data sizes. + Copyright (C) 2016-2021 Free Software Foundation, Inc. + This file is part of the GNU C Library. + + The GNU C Library is free software; you can redistribute it and/or + modify it under the terms of the GNU Lesser General Public + License as published by the Free Software Foundation; either + version 2.1 of the License, or (at your option) any later version. + + The GNU C Library is distributed in the hope that it will be useful, + but WITHOUT ANY WARRANTY; without even the implied warranty of + MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU + Lesser General Public License for more details. + + You should have received a copy of the GNU Lesser General Public + License along with the GNU C Library; if not, see + . */ + +#define TEST_MAIN +#define TEST_NAME "memset" +#define START_SIZE (128 * 1024) +#define MIN_PAGE_SIZE (getpagesize () + 64 * 1024 * 1024) +#define TIMEOUT (20 * 60) +#include "bench-string.h" + +#include +#include "json-lib.h" + +void *generic_memset (void *, int, size_t); +typedef void *(*proto_t) (void *, int, size_t); + +IMPL (MEMSET, 1) +IMPL (generic_memset, 0) + +static void +do_one_test (json_ctx_t *json_ctx, impl_t *impl, CHAR *s, + int c __attribute ((unused)), size_t n) +{ + size_t i, iters = 16; + timing_t start, stop, cur; + + TIMING_NOW (start); + for (i = 0; i < iters; ++i) + { + CALL (impl, s, c, n); + } + TIMING_NOW (stop); + + TIMING_DIFF (cur, start, stop); + + json_element_double (json_ctx, (double) cur / (double) iters); +} + +static void +do_test (json_ctx_t *json_ctx, size_t align, int c, size_t len) +{ + align &= 63; + if ((align + len) * sizeof (CHAR) > page_size) + return; + + json_element_object_begin (json_ctx); + json_attr_uint (json_ctx, "length", len); + json_attr_uint (json_ctx, "alignment", align); + json_attr_int (json_ctx, "char", c); + json_array_begin (json_ctx, "timings"); + + FOR_EACH_IMPL (impl, 0) + { + do_one_test (json_ctx, impl, (CHAR *) (buf1) + align, c, len); + alloc_bufs (); + } + + json_array_end (json_ctx); + json_element_object_end (json_ctx); +} + +int +test_main (void) +{ + json_ctx_t json_ctx; + size_t i; + int c; + + test_init (); + + json_init (&json_ctx, 0, stdout); + + json_document_begin (&json_ctx); + json_attr_string (&json_ctx, "timing_type", TIMING_TYPE); + + json_attr_object_begin (&json_ctx, "functions"); + json_attr_object_begin (&json_ctx, TEST_NAME); + json_attr_string (&json_ctx, "bench-variant", "large-zerofill"); + + json_array_begin (&json_ctx, "ifuncs"); + FOR_EACH_IMPL (impl, 0) + json_element_string (&json_ctx, impl->name); + json_array_end (&json_ctx); + + json_array_begin (&json_ctx, "results"); + + c = 0; + for (i = START_SIZE; i <= MIN_PAGE_SIZE; i <<= 1) + { + do_test (&json_ctx, 0, c, i); + do_test (&json_ctx, 3, c, i); + } + + json_array_end (&json_ctx); + json_attr_object_end (&json_ctx); + json_attr_object_end (&json_ctx); + json_document_end (&json_ctx); + + return ret; +} + +#include + +#define libc_hidden_builtin_def(X) +#define libc_hidden_def(X) +#define libc_hidden_weak(X) +#define weak_alias(X,Y) +#undef MEMSET +#define MEMSET generic_memset +#include diff --git a/benchtests/bench-memset-zerofill.c b/benchtests/bench-memset-zerofill.c new file mode 100644 index 000000000000..ac20ae4c6537 --- /dev/null +++ b/benchtests/bench-memset-zerofill.c @@ -0,0 +1,156 @@ +/* Measure memset functions. + Copyright (C) 2013-2021 Free Software Foundation, Inc. + This file is part of the GNU C Library. + + The GNU C Library is free software; you can redistribute it and/or + modify it under the terms of the GNU Lesser General Public + License as published by the Free Software Foundation; either + version 2.1 of the License, or (at your option) any later version. + + The GNU C Library is distributed in the hope that it will be useful, + but WITHOUT ANY WARRANTY; without even the implied warranty of + MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU + Lesser General Public License for more details. + + You should have received a copy of the GNU Lesser General Public + License along with the GNU C Library; if not, see + . */ + +#define TEST_MAIN +#ifndef WIDE +# define TEST_NAME "memset" +#else +# define TEST_NAME "wmemset" +# define generic_memset generic_wmemset +#endif /* WIDE */ +#define MIN_PAGE_SIZE 131072 +#include "bench-string.h" + +#include "json-lib.h" + +#ifdef WIDE +CHAR *generic_wmemset (CHAR *, CHAR, size_t); +#else +void *generic_memset (void *, int, size_t); +#endif + +typedef void *(*proto_t) (void *, int, size_t); + +IMPL (MEMSET, 1) +IMPL (generic_memset, 0) + +static void +do_one_test (json_ctx_t *json_ctx, impl_t *impl, CHAR *s, + int c __attribute ((unused)), size_t n) +{ + size_t i, iters = INNER_LOOP_ITERS; + timing_t start, stop, cur; + + TIMING_NOW (start); + for (i = 0; i < iters; ++i) + { + CALL (impl, s, c, n); + } + TIMING_NOW (stop); + + TIMING_DIFF (cur, start, stop); + + json_element_double (json_ctx, (double) cur / (double) iters); +} + +static void +do_test (json_ctx_t *json_ctx, size_t align, int c, size_t len) +{ + align &= 4095; + if ((align + len) * sizeof (CHAR) > page_size) + return; + + json_element_object_begin (json_ctx); + json_attr_uint (json_ctx, "length", len); + json_attr_uint (json_ctx, "alignment", align); + json_attr_int (json_ctx, "char", c); + json_array_begin (json_ctx, "timings"); + + FOR_EACH_IMPL (impl, 0) + { + do_one_test (json_ctx, impl, (CHAR *) (buf1) + align, c, len); + alloc_bufs (); + } + + json_array_end (json_ctx); + json_element_object_end (json_ctx); +} + +int +test_main (void) +{ + json_ctx_t json_ctx; + size_t i; + int c = 0; + + test_init (); + + json_init (&json_ctx, 0, stdout); + + json_document_begin (&json_ctx); + json_attr_string (&json_ctx, "timing_type", TIMING_TYPE); + + json_attr_object_begin (&json_ctx, "functions"); + json_attr_object_begin (&json_ctx, TEST_NAME); + json_attr_string (&json_ctx, "bench-variant", "default-zerofill"); + + json_array_begin (&json_ctx, "ifuncs"); + FOR_EACH_IMPL (impl, 0) + json_element_string (&json_ctx, impl->name); + json_array_end (&json_ctx); + + json_array_begin (&json_ctx, "results"); + + c = 0; + for (i = 0; i < 18; ++i) + do_test (&json_ctx, 0, c, 1 << i); + for (i = 1; i < 64; ++i) + { + do_test (&json_ctx, i, c, i); + do_test (&json_ctx, 4096 - i, c, i); + do_test (&json_ctx, 4095, c, i); + if (i & (i - 1)) + do_test (&json_ctx, 0, c, i); + } + for (i = 32; i < 512; i+=32) + { + do_test (&json_ctx, 0, c, i); + do_test (&json_ctx, i, c, i); + } + do_test (&json_ctx, 1, c, 14); + do_test (&json_ctx, 3, c, 1024); + do_test (&json_ctx, 4, c, 64); + do_test (&json_ctx, 2, c, 25); + for (i = 33; i <= 256; i += 4) + { + do_test (&json_ctx, 0, c, 32 * i); + do_test (&json_ctx, i, c, 32 * i); + } + + json_array_end (&json_ctx); + json_attr_object_end (&json_ctx); + json_attr_object_end (&json_ctx); + json_document_end (&json_ctx); + + return ret; +} + +#include + +#define libc_hidden_builtin_def(X) +#define libc_hidden_def(X) +#define libc_hidden_weak(X) +#define weak_alias(X,Y) +#ifndef WIDE +# undef MEMSET +# define MEMSET generic_memset +# include +#else +# define WMEMSET generic_wmemset +# include +#endif