| Message ID | 20250916115428.1176402-1-pan2.li@intel.com |
|---|---|
| Headers |
Return-Path: <gcc-patches-bounces~patchwork=sourceware.org@gcc.gnu.org>
X-Original-To: patchwork@sourceware.org
Delivered-To: patchwork@sourceware.org
Received: from server2.sourceware.org (localhost [IPv6:::1])
by sourceware.org (Postfix) with ESMTP id 54F633857002
for <patchwork@sourceware.org>; Tue, 16 Sep 2025 11:55:28 +0000 (GMT)
DKIM-Filter: OpenDKIM Filter v2.11.0 sourceware.org 54F633857002
Authentication-Results: sourceware.org;
dkim=pass (2048-bit key,
unprotected) header.d=intel.com header.i=@intel.com header.a=rsa-sha256
header.s=Intel header.b=KJOwflAA
X-Original-To: gcc-patches@gcc.gnu.org
Delivered-To: gcc-patches@gcc.gnu.org
Received: from mgamail.intel.com (mgamail.intel.com [192.198.163.10])
by sourceware.org (Postfix) with ESMTPS id A916B3858D39
for <gcc-patches@gcc.gnu.org>; Tue, 16 Sep 2025 11:53:51 +0000 (GMT)
DMARC-Filter: OpenDMARC Filter v1.4.2 sourceware.org A916B3858D39
Authentication-Results: sourceware.org;
dmarc=pass (p=none dis=none) header.from=intel.com
Authentication-Results: sourceware.org; spf=pass smtp.mailfrom=intel.com
ARC-Filter: OpenARC Filter v1.0.0 sourceware.org A916B3858D39
Authentication-Results: server2.sourceware.org;
arc=none smtp.remote-ip=192.198.163.10
ARC-Seal: i=1; a=rsa-sha256; d=sourceware.org; s=key; t=1758023632; cv=none;
b=Ay82oNzakI6CaPZjW3eBPO0XATF+YxmYnkUyjQcPDGet7+8qoQTykCWgVsc85N0/CuI9nP9Py7agHBdErQb28TkmGLK8PrDxaT1+i5nOTnn2x+DqRv3o0xnQm2n85j3uBx1PBxyjNNyQ+dbTfCx0PWU9p4QbEjnAGpLHiMdmKzg=
ARC-Message-Signature: i=1; a=rsa-sha256; d=sourceware.org; s=key;
t=1758023632; c=relaxed/simple;
bh=6oqPwJnX/NoMdwLv5Yy1qFw899AwRtNVd/c13N3d8vA=;
h=DKIM-Signature:From:To:Subject:Date:Message-ID:MIME-Version;
b=K0aT5k1quhFekiNfRKmQ9+nbdXGkuFOPlNhcZ5/xwCzcQDN7oA7nFUQ2tUHqqx8F7DXmeDvDT1qedjwddL3SWae/EvW/FZ4oIN0yUbI2x68jxWTt+lHEZqjCmPjb4OjXYxlzg9HP0z64fLTW1nkKO9LqpMB2EBKdAgSYJAclpf0=
ARC-Authentication-Results: i=1; server2.sourceware.org
DKIM-Filter: OpenDKIM Filter v2.11.0 sourceware.org A916B3858D39
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple;
d=intel.com; i=@intel.com; q=dns/txt; s=Intel;
t=1758023632; x=1789559632;
h=from:to:cc:subject:date:message-id:mime-version:
content-transfer-encoding;
bh=6oqPwJnX/NoMdwLv5Yy1qFw899AwRtNVd/c13N3d8vA=;
b=KJOwflAAM0OYJTMJE0VosKtIdQdKVeZ5dJJx/Fy0McjZ8DBlpjfiCICa
bk22hLEI9l9alN4zWSy8kYm4bf6dq2h9wIL8f4Fwd7efPZ0BwWS7RkPBe
Q67EgmmG3HIKGr/X1DrDkMKj/nquUAx4L4od04iTT6m45rIo/Qp48H6/H
+ksKz+yoSyhnaqakZtIcLd2UjPcjb3Slg2tpfHhkZW/V5J1sPo1L1zYtu
vgC63t9Mc9lFRHlUUfCMcXyU7ccGXzSN0fjuac5+M2/6AcVqlCSyQg6AX
3Ez+ybwc4TE2D/7y3LJztQlXqnkE1ZYz3aSd6xM0jbOJHK7tfdqXUQ/t6 g==;
X-CSE-ConnectionGUID: jZQgLb9fQnSldsugjOF6JA==
X-CSE-MsgGUID: c7PaHvksTb+BS3IMHeoiEw==
X-IronPort-AV: E=McAfee;i="6800,10657,11554"; a="71667388"
X-IronPort-AV: E=Sophos;i="6.18,269,1751266800"; d="scan'208";a="71667388"
Received: from fmviesa005.fm.intel.com ([10.60.135.145])
by fmvoesa104.fm.intel.com with ESMTP/TLS/ECDHE-RSA-AES256-GCM-SHA384;
16 Sep 2025 04:53:50 -0700
X-CSE-ConnectionGUID: AAbvwfuxR8OmLg3YYOxxgA==
X-CSE-MsgGUID: vIfDW6JuSoCg80huJLS7yA==
X-ExtLoop1: 1
X-IronPort-AV: E=Sophos;i="6.18,269,1751266800"; d="scan'208";a="179303072"
Received: from panli.sh.intel.com ([10.239.159.63])
by fmviesa005.fm.intel.com with ESMTP; 16 Sep 2025 04:53:48 -0700
From: pan2.li@intel.com
To: gcc-patches@gcc.gnu.org
Cc: juzhe.zhong@rivai.ai, kito.cheng@gmail.com, jeffreyalaw@gmail.com,
rdapp.gcc@gmail.com, ken.chen@intel.com, hongtao.liu@intel.com,
Pan Li <pan2.li@intel.com>
Subject: [PATCH v2 0/4] RISC-V: Combine vec_duplicate + v{widen}u.vv to
v{widen}u.vx on GR2VR cost
Date: Tue, 16 Sep 2025 19:52:37 +0800
Message-ID: <20250916115428.1176402-1-pan2.li@intel.com>
X-Mailer: git-send-email 2.43.0
MIME-Version: 1.0
Content-Transfer-Encoding: 8bit
X-Spam-Status: No, score=-5.4 required=5.0 tests=BAYES_00, DKIMWL_WL_HIGH,
DKIM_SIGNED, DKIM_VALID, DKIM_VALID_AU, DKIM_VALID_EF, KAM_SHORT,
RCVD_IN_VALIDITY_RPBL_BLOCKED, RCVD_IN_VALIDITY_SAFE_BLOCKED, SPF_HELO_NONE,
SPF_NONE, TXREP autolearn=ham autolearn_force=no version=3.4.6
X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on
server2.sourceware.org
X-BeenThere: gcc-patches@gcc.gnu.org
X-Mailman-Version: 2.1.30
Precedence: list
List-Id: Gcc-patches mailing list <gcc-patches.gcc.gnu.org>
List-Unsubscribe: <https://gcc.gnu.org/mailman/options/gcc-patches>,
<mailto:gcc-patches-request@gcc.gnu.org?subject=unsubscribe>
List-Archive: <https://gcc.gnu.org/pipermail/gcc-patches/>
List-Post: <mailto:gcc-patches@gcc.gnu.org>
List-Help: <mailto:gcc-patches-request@gcc.gnu.org?subject=help>
List-Subscribe: <https://gcc.gnu.org/mailman/listinfo/gcc-patches>,
<mailto:gcc-patches-request@gcc.gnu.org?subject=subscribe>
Errors-To: gcc-patches-bounces~patchwork=sourceware.org@gcc.gnu.org
|
| Series |
RISC-V: Combine vec_duplicate + v{widen}u.vv to v{widen}u.vx on GR2VR cost
|
|
Message
Li, Pan2
Sept. 16, 2025, 11:52 a.m. UTC
From: Pan Li <pan2.li@intel.com>
This patch would like to introduce the combine of vec_dup + v{widen}u.vv
into v{widen}u.vx on the cost value of GR2VR. The late-combine will take
place if the cost of GR2VRlike 1, 2, 15 in test.
The below insn from uint32_t to uint64_t are included.
* vwaddu.vx
* vwsubu.vx
* vwmulu.vx
From:
| ...
| vmv.v.x
| L1:
| v{widen}u.vv
| J L1
| ...
To:
| ...
| L1:
| v{widen}u.vx
| J L1
| ...
The below test suites are passed for this patch series.
* The rv64gcv fully regression test.
Pan Li (4):
RISC-V: Combine vec_duplicate + vwaddu.vv to vwaddu.vx on GR2VR cost
RISC-V: Add test for vec_duplicate + vwaddu.vv signed combine with GR2VR cost 0, 1 and 15
RISC-V: Add test for vec_duplicate + vwsubu.vv signed combine with GR2VR cost 0, 1 and 15
RISC-V: Add test for vec_duplicate + vwmulu.vv signed combine with GR2VR cost 0, 1 and 15
gcc/config/riscv/autovec-opt.md | 42 +++++
gcc/config/riscv/iterators.md | 3 +
gcc/config/riscv/vector-iterators.md | 16 ++
.../riscv/rvv/autovec/vx_vf/vx-1-u16.c | 6 +
.../riscv/rvv/autovec/vx_vf/vx-1-u32.c | 6 +
.../riscv/rvv/autovec/vx_vf/vx-1-u64.c | 6 +
.../riscv/rvv/autovec/vx_vf/vx-2-u16.c | 6 +
.../riscv/rvv/autovec/vx_vf/vx-2-u32.c | 6 +
.../riscv/rvv/autovec/vx_vf/vx-2-u64.c | 6 +
.../riscv/rvv/autovec/vx_vf/vx-3-u16.c | 6 +
.../riscv/rvv/autovec/vx_vf/vx-3-u32.c | 6 +
.../riscv/rvv/autovec/vx_vf/vx-3-u64.c | 6 +
.../rvv/autovec/vx_vf/vx_vwaddu-run-1-u64.c | 18 ++
.../rvv/autovec/vx_vf/vx_vwmulu-run-1-u64.c | 18 ++
.../rvv/autovec/vx_vf/vx_vwsubu-run-1-u64.c | 18 ++
.../riscv/rvv/autovec/vx_vf/vx_widen.h | 36 ++++
.../riscv/rvv/autovec/vx_vf/vx_widen_data.h | 159 ++++++++++++++++++
.../riscv/rvv/autovec/vx_vf/vx_widen_vx_run.h | 27 +++
18 files changed, 391 insertions(+)
create mode 100644 gcc/testsuite/gcc.target/riscv/rvv/autovec/vx_vf/vx_vwaddu-run-1-u64.c
create mode 100644 gcc/testsuite/gcc.target/riscv/rvv/autovec/vx_vf/vx_vwmulu-run-1-u64.c
create mode 100644 gcc/testsuite/gcc.target/riscv/rvv/autovec/vx_vf/vx_vwsubu-run-1-u64.c
create mode 100644 gcc/testsuite/gcc.target/riscv/rvv/autovec/vx_vf/vx_widen.h
create mode 100644 gcc/testsuite/gcc.target/riscv/rvv/autovec/vx_vf/vx_widen_data.h
create mode 100644 gcc/testsuite/gcc.target/riscv/rvv/autovec/vx_vf/vx_widen_vx_run.h
Comments
> This patch would like to introduce the combine of vec_dup + v{widen}u.vv > into v{widen}u.vx on the cost value of GR2VR. The late-combine will take > place if the cost of GR2VRlike 1, 2, 15 in test. This series LGTM, thanks.