From patchwork Tue Sep 16 11:52:37 2025 Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit X-Patchwork-Submitter: "Li, Pan2" X-Patchwork-Id: 58956 Return-Path: X-Original-To: patchwork@sourceware.org Delivered-To: patchwork@sourceware.org Received: from server2.sourceware.org (localhost [IPv6:::1]) by sourceware.org (Postfix) with ESMTP id 54F633857002 for ; Tue, 16 Sep 2025 11:55:28 +0000 (GMT) DKIM-Filter: OpenDKIM Filter v2.11.0 sourceware.org 54F633857002 Authentication-Results: sourceware.org; dkim=pass (2048-bit key, unprotected) header.d=intel.com header.i=@intel.com header.a=rsa-sha256 header.s=Intel header.b=KJOwflAA X-Original-To: gcc-patches@gcc.gnu.org Delivered-To: gcc-patches@gcc.gnu.org Received: from mgamail.intel.com (mgamail.intel.com [192.198.163.10]) by sourceware.org (Postfix) with ESMTPS id A916B3858D39 for ; Tue, 16 Sep 2025 11:53:51 +0000 (GMT) DMARC-Filter: OpenDMARC Filter v1.4.2 sourceware.org A916B3858D39 Authentication-Results: sourceware.org; dmarc=pass (p=none dis=none) header.from=intel.com Authentication-Results: sourceware.org; spf=pass smtp.mailfrom=intel.com ARC-Filter: OpenARC Filter v1.0.0 sourceware.org A916B3858D39 Authentication-Results: server2.sourceware.org; arc=none smtp.remote-ip=192.198.163.10 ARC-Seal: i=1; a=rsa-sha256; d=sourceware.org; s=key; t=1758023632; cv=none; b=Ay82oNzakI6CaPZjW3eBPO0XATF+YxmYnkUyjQcPDGet7+8qoQTykCWgVsc85N0/CuI9nP9Py7agHBdErQb28TkmGLK8PrDxaT1+i5nOTnn2x+DqRv3o0xnQm2n85j3uBx1PBxyjNNyQ+dbTfCx0PWU9p4QbEjnAGpLHiMdmKzg= ARC-Message-Signature: i=1; a=rsa-sha256; d=sourceware.org; s=key; t=1758023632; c=relaxed/simple; bh=6oqPwJnX/NoMdwLv5Yy1qFw899AwRtNVd/c13N3d8vA=; h=DKIM-Signature:From:To:Subject:Date:Message-ID:MIME-Version; b=K0aT5k1quhFekiNfRKmQ9+nbdXGkuFOPlNhcZ5/xwCzcQDN7oA7nFUQ2tUHqqx8F7DXmeDvDT1qedjwddL3SWae/EvW/FZ4oIN0yUbI2x68jxWTt+lHEZqjCmPjb4OjXYxlzg9HP0z64fLTW1nkKO9LqpMB2EBKdAgSYJAclpf0= ARC-Authentication-Results: i=1; server2.sourceware.org DKIM-Filter: OpenDKIM Filter v2.11.0 sourceware.org A916B3858D39 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=intel.com; i=@intel.com; q=dns/txt; s=Intel; t=1758023632; x=1789559632; h=from:to:cc:subject:date:message-id:mime-version: content-transfer-encoding; bh=6oqPwJnX/NoMdwLv5Yy1qFw899AwRtNVd/c13N3d8vA=; b=KJOwflAAM0OYJTMJE0VosKtIdQdKVeZ5dJJx/Fy0McjZ8DBlpjfiCICa bk22hLEI9l9alN4zWSy8kYm4bf6dq2h9wIL8f4Fwd7efPZ0BwWS7RkPBe Q67EgmmG3HIKGr/X1DrDkMKj/nquUAx4L4od04iTT6m45rIo/Qp48H6/H +ksKz+yoSyhnaqakZtIcLd2UjPcjb3Slg2tpfHhkZW/V5J1sPo1L1zYtu vgC63t9Mc9lFRHlUUfCMcXyU7ccGXzSN0fjuac5+M2/6AcVqlCSyQg6AX 3Ez+ybwc4TE2D/7y3LJztQlXqnkE1ZYz3aSd6xM0jbOJHK7tfdqXUQ/t6 g==; X-CSE-ConnectionGUID: jZQgLb9fQnSldsugjOF6JA== X-CSE-MsgGUID: c7PaHvksTb+BS3IMHeoiEw== X-IronPort-AV: E=McAfee;i="6800,10657,11554"; a="71667388" X-IronPort-AV: E=Sophos;i="6.18,269,1751266800"; d="scan'208";a="71667388" Received: from fmviesa005.fm.intel.com ([10.60.135.145]) by fmvoesa104.fm.intel.com with ESMTP/TLS/ECDHE-RSA-AES256-GCM-SHA384; 16 Sep 2025 04:53:50 -0700 X-CSE-ConnectionGUID: AAbvwfuxR8OmLg3YYOxxgA== X-CSE-MsgGUID: vIfDW6JuSoCg80huJLS7yA== X-ExtLoop1: 1 X-IronPort-AV: E=Sophos;i="6.18,269,1751266800"; d="scan'208";a="179303072" Received: from panli.sh.intel.com ([10.239.159.63]) by fmviesa005.fm.intel.com with ESMTP; 16 Sep 2025 04:53:48 -0700 From: pan2.li@intel.com To: gcc-patches@gcc.gnu.org Cc: juzhe.zhong@rivai.ai, kito.cheng@gmail.com, jeffreyalaw@gmail.com, rdapp.gcc@gmail.com, ken.chen@intel.com, hongtao.liu@intel.com, Pan Li Subject: [PATCH v2 0/4] RISC-V: Combine vec_duplicate + v{widen}u.vv to v{widen}u.vx on GR2VR cost Date: Tue, 16 Sep 2025 19:52:37 +0800 Message-ID: <20250916115428.1176402-1-pan2.li@intel.com> X-Mailer: git-send-email 2.43.0 MIME-Version: 1.0 X-Spam-Status: No, score=-5.4 required=5.0 tests=BAYES_00, DKIMWL_WL_HIGH, DKIM_SIGNED, DKIM_VALID, DKIM_VALID_AU, DKIM_VALID_EF, KAM_SHORT, RCVD_IN_VALIDITY_RPBL_BLOCKED, RCVD_IN_VALIDITY_SAFE_BLOCKED, SPF_HELO_NONE, SPF_NONE, TXREP autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on server2.sourceware.org X-BeenThere: gcc-patches@gcc.gnu.org X-Mailman-Version: 2.1.30 Precedence: list List-Id: Gcc-patches mailing list List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: gcc-patches-bounces~patchwork=sourceware.org@gcc.gnu.org From: Pan Li This patch would like to introduce the combine of vec_dup + v{widen}u.vv into v{widen}u.vx on the cost value of GR2VR. The late-combine will take place if the cost of GR2VRlike 1, 2, 15 in test. The below insn from uint32_t to uint64_t are included. * vwaddu.vx * vwsubu.vx * vwmulu.vx From: | ... | vmv.v.x | L1: | v{widen}u.vv | J L1 | ... To: | ... | L1: | v{widen}u.vx | J L1 | ... The below test suites are passed for this patch series. * The rv64gcv fully regression test. Pan Li (4): RISC-V: Combine vec_duplicate + vwaddu.vv to vwaddu.vx on GR2VR cost RISC-V: Add test for vec_duplicate + vwaddu.vv signed combine with GR2VR cost 0, 1 and 15 RISC-V: Add test for vec_duplicate + vwsubu.vv signed combine with GR2VR cost 0, 1 and 15 RISC-V: Add test for vec_duplicate + vwmulu.vv signed combine with GR2VR cost 0, 1 and 15 gcc/config/riscv/autovec-opt.md | 42 +++++ gcc/config/riscv/iterators.md | 3 + gcc/config/riscv/vector-iterators.md | 16 ++ .../riscv/rvv/autovec/vx_vf/vx-1-u16.c | 6 + .../riscv/rvv/autovec/vx_vf/vx-1-u32.c | 6 + .../riscv/rvv/autovec/vx_vf/vx-1-u64.c | 6 + .../riscv/rvv/autovec/vx_vf/vx-2-u16.c | 6 + .../riscv/rvv/autovec/vx_vf/vx-2-u32.c | 6 + .../riscv/rvv/autovec/vx_vf/vx-2-u64.c | 6 + .../riscv/rvv/autovec/vx_vf/vx-3-u16.c | 6 + .../riscv/rvv/autovec/vx_vf/vx-3-u32.c | 6 + .../riscv/rvv/autovec/vx_vf/vx-3-u64.c | 6 + .../rvv/autovec/vx_vf/vx_vwaddu-run-1-u64.c | 18 ++ .../rvv/autovec/vx_vf/vx_vwmulu-run-1-u64.c | 18 ++ .../rvv/autovec/vx_vf/vx_vwsubu-run-1-u64.c | 18 ++ .../riscv/rvv/autovec/vx_vf/vx_widen.h | 36 ++++ .../riscv/rvv/autovec/vx_vf/vx_widen_data.h | 159 ++++++++++++++++++ .../riscv/rvv/autovec/vx_vf/vx_widen_vx_run.h | 27 +++ 18 files changed, 391 insertions(+) create mode 100644 gcc/testsuite/gcc.target/riscv/rvv/autovec/vx_vf/vx_vwaddu-run-1-u64.c create mode 100644 gcc/testsuite/gcc.target/riscv/rvv/autovec/vx_vf/vx_vwmulu-run-1-u64.c create mode 100644 gcc/testsuite/gcc.target/riscv/rvv/autovec/vx_vf/vx_vwsubu-run-1-u64.c create mode 100644 gcc/testsuite/gcc.target/riscv/rvv/autovec/vx_vf/vx_widen.h create mode 100644 gcc/testsuite/gcc.target/riscv/rvv/autovec/vx_vf/vx_widen_data.h create mode 100644 gcc/testsuite/gcc.target/riscv/rvv/autovec/vx_vf/vx_widen_vx_run.h