[00/11] x86: NOP emission adjustments

Message ID	7ce54bc2-fef2-d2e4-21fd-202fdead0c20@suse.com
Headers	DMARC-Filter: OpenDMARC Filter v1.4.2 sourceware.org 9DF2838582BD Message-ID: <7ce54bc2-fef2-d2e4-21fd-202fdead0c20@suse.com> Date: Wed, 27 Sep 2023 17:46:01 +0200 User-Agent: Mozilla/5.0 (Windows NT 10.0; Win64; x64; rv:102.0) Gecko/20100101 Thunderbird/102.15.1 Content-Language: en-US To: Binutils <binutils@sourceware.org> Cc: "H.J. Lu" <hjl.tools@gmail.com> Subject: [PATCH 00/11] x86: NOP emission adjustments Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 7bit MIME-Version: 1.0 Precedence: list From: Jan Beulich via Binutils <binutils@sourceware.org> Reply-To: Jan Beulich <jbeulich@suse.com> Errors-To: binutils-bounces+patchwork=sourceware.org@sourceware.org Sender: "Binutils" <binutils-bounces+patchwork=sourceware.org@sourceware.org>
Series	x86: NOP emission adjustments \| [00/11] x86: NOP emission adjustments [01/11] x86: record flag_code in tc_frag_data [02/11] x86: i386_generate_nops() may not derive decisions from global variables [03/11] x86: don't use 32-bit LEA as NOP surrogate in 64-bit code [04/11] x86: don't use operand size override with NOP in 16-bit code [05/11] x86: respect ".arch nonop" when selecting which NOPs to emit [06/11] x86: i686 != PentiumPro [07/11] x86: don't record full i386_cpu_flags in struct i386_tc_frag_data [08/11] x86: add a few more NOP patterns [09/11] x86: fold a few of the "alternative" NOP patterns [10/11] x86: fold NOP testcase expectations where possible [11/11] gas: make .nops output visible in listing

Message ID

7ce54bc2-fef2-d2e4-21fd-202fdead0c20@suse.com

Headers

DMARC-Filter: OpenDMARC Filter v1.4.2 sourceware.org 9DF2838582BD
Message-ID: <7ce54bc2-fef2-d2e4-21fd-202fdead0c20@suse.com>
Date: Wed, 27 Sep 2023 17:46:01 +0200
User-Agent: Mozilla/5.0 (Windows NT 10.0; Win64; x64; rv:102.0) Gecko/20100101
 Thunderbird/102.15.1
Content-Language: en-US
To: Binutils <binutils@sourceware.org>
Cc: "H.J. Lu" <hjl.tools@gmail.com>
Subject: [PATCH 00/11] x86: NOP emission adjustments
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: 7bit
MIME-Version: 1.0
X-MS-Exchange-AntiSpam-MessageData-ChunkCount: 1
X-MS-Exchange-AntiSpam-MessageData-0: =?utf-8?q?+nv0KLQHKfqTktuBhvPwzl5QY9BB?=
	=?utf-8?q?z4qS05fSZ3PxOp0xKgy1F5ZmYVmSOyQqrtbB5ivmWDj+IAIccHRO/fZWaEB3aV0KG?=
	=?utf-8?q?vS2vQDM1CjJFUmS3vNnVSDdG0Zl+9dIK4ENWQOf33b1a2GbJsPWtmObKrPC5QEC1c?=
	=?utf-8?q?90UhMb8X3BCEe/nyiQNCh1F79IHUJ4Ke/bN4f8ETkAKMLhS1TxOEhJjQI5FbqOZu7?=
	=?utf-8?q?/QOceSIbYh6pKb7TjwltNOfpsqS8lBP8DOMtfJBlANBdkitzs5aPaPudP7k+Vx7+2?=
	=?utf-8?q?btt6u8Iw8w5fxQUT+Dg28LuTaqoQC3i64TsgdhcRTwUnKRonyWMLpKeOwjgtn0VhH?=
	=?utf-8?q?QVlgHvXNJKrTixDfcrNLITHmX9G6d1lS+824pxiwOrj+8kfLeZViPao5lWRQtrxSI?=
	=?utf-8?q?XqQn6IxfTbHf31xdpE/shcxyVg3jq22gqFf/SnVSs+ri54GU2Gn2NruhygZmbrmR/?=
	=?utf-8?q?vIJNYmZnIP2sTwsIh+O/XaegAhcrkVGNUXz1KvRC3szZ+fH89eF3a5oVJ1hJI0Da7?=
	=?utf-8?q?YQm+FtC8x6/Hre+6b601j6mOGvdhq7w7uLecl4KZJ8HAdvZVX6f6NDdKqi47scvEw?=
	=?utf-8?q?ZUtka/4Sk86SCTpqlLs0lZ6FGAOqGN1lvjkhBt4GRComFamxmo3IQoh86mQuLt8WH?=
	=?utf-8?q?TWmWvqp5SbKbWJIA9f7ZNfKTo0nSv0mnUX3RE99rWn/sYC2PkvNL6A3E0PEcKNXMx?=
	=?utf-8?q?AfJyUjvLibVt//W8bk7zfniWwH02QscjoDww8jDSKhXIAYt+1pYWDi0S8IoqNnrDm?=
	=?utf-8?q?CLIFC1Mv/eRuR6CLX57CIKx+fK4IFBzZQtY4Uoe3Q7RlBh5xe+/r6rhIxRHwszwfq?=
	=?utf-8?q?i3yl3NQJKZQ1NOar89UcznXs+AjY4A9f8DrfC2uOTtKBdMrbafaOOlBhWr3MHZwRg?=
	=?utf-8?q?SPz3qxLD/lpBtEyQdYkwcEOMlnwtCZN5Zo8PNmhKTQtqhd1aJDxKcU/g50a9z8N6R?=
	=?utf-8?q?PUW/njZm11LXr5jiq1ONdXu1xsCBbHXi4lRDjS7qmcgKAExTsYwD/dQB8nfe7RHaO?=
	=?utf-8?q?UFk7HuVl/oLGwm+MCj0MrZ3IrclJv+4wjCdgJXioVhBjz+NFU0LwuEISkSQNv+WKl?=
	=?utf-8?q?KsYoHLIVGKhcbjqgTqEuQ3wNhj3OnWJBh3tkZCtUxYOsa0LTri69NF5BWFAQ/cl8+?=
	=?utf-8?q?Mt6eOmKWqxoe90xINwvlL+XJGckrqe1LBxaIB5AKlEQKLkr0Yy/v8MVuZNBBtCUoD?=
	=?utf-8?q?mmOsdhH2rpywLkoxZmBbtKCeONX6sTGSePCxWoc62S1XgRjNn3LBHAANxpecAq4bB?=
	=?utf-8?q?txttI03gssKRE395klsTF0JwIX7v7vsdhj+u0CfdU+QI1KBl/IsgV4BRrQU0ExoYF?=
	=?utf-8?q?CoV6IngTplqmb+GcYFfW3Z6df476e5T5iqLa+Y2YJ1w4QpWWBa5YdBsI54RZgxu+K?=
	=?utf-8?q?qB6OHKLjKzgk8lT9lET6rU4DZKYYbAJhD1Pp2hJlHFp/whQbrh0Ygy6NOSlaU5ht1?=
	=?utf-8?q?k4SCyPq/mMj3EMLs1v/8foKEFB5Cbve1Dy7pLRc4b0SVereYX6jTHynuvf8HnWQYQ?=
	=?utf-8?q?Cd9g7jpWdtYx?=
X-OriginatorOrg: suse.com
X-MS-Exchange-CrossTenant-Network-Message-Id: 
 9918ef7f-37c7-4005-d528-08dbbf70db07
X-MS-Exchange-CrossTenant-AuthSource: AS8PR04MB8788.eurprd04.prod.outlook.com
X-MS-Exchange-CrossTenant-AuthAs: Internal
X-MS-Exchange-CrossTenant-OriginalArrivalTime: 27 Sep 2023 15:46:03.6476 (UTC)
X-MS-Exchange-CrossTenant-FromEntityHeader: Hosted
X-MS-Exchange-CrossTenant-Id: f7a17af6-1c5c-4a36-aa8b-f5be247aa4ba
X-MS-Exchange-CrossTenant-MailboxType: HOSTED
X-MS-Exchange-CrossTenant-UserPrincipalName: 
 ubKwnKPtXjx1ObMBBiENaJdRV+j7o0K24e3wBSSDa8UHXv2AGvHblnvComnmeWa9JsvETjbpLnjr9wkJh+Fb0g==
X-MS-Exchange-Transport-CrossTenantHeadersStamped: AM8PR04MB7812
X-Spam-Status: No, score=-3026.9 required=5.0 tests=BAYES_00, DKIM_SIGNED,
 DKIM_VALID, DKIM_VALID_AU, DKIM_VALID_EF, RCVD_IN_DNSWL_NONE,
 RCVD_IN_MSPIKE_H2, SPF_HELO_PASS, SPF_PASS,
 TXREP autolearn=ham autolearn_force=no version=3.4.6
X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on
 server2.sourceware.org
X-BeenThere: binutils@sourceware.org
X-Mailman-Version: 2.1.30
Precedence: list
List-Id: Binutils mailing list <binutils.sourceware.org>
List-Unsubscribe: <https://sourceware.org/mailman/options/binutils>,
 <mailto:binutils-request@sourceware.org?subject=unsubscribe>
List-Archive: <https://sourceware.org/pipermail/binutils/>
List-Post: <mailto:binutils@sourceware.org>
List-Help: <mailto:binutils-request@sourceware.org?subject=help>
List-Subscribe: <https://sourceware.org/mailman/listinfo/binutils>,
 <mailto:binutils-request@sourceware.org?subject=subscribe>
From: Jan Beulich via Binutils <binutils@sourceware.org>
Reply-To: Jan Beulich <jbeulich@suse.com>
Errors-To: binutils-bounces+patchwork=sourceware.org@sourceware.org
Sender: "Binutils" <binutils-bounces+patchwork=sourceware.org@sourceware.org>

Series

x86: NOP emission adjustments |

Message

Jan Beulich Sept. 27, 2023, 3:46 p.m. UTC

  I've noticed a number of issues and inefficiencies.

01: x86: record flag_code in tc_frag_data
02: x86: i386_generate_nops() may not derive decisions from global variables
03: x86: don't use 32-bit LEA as NOP surrogate in 64-bit code
04: x86: don't use operand size override with NOP in 16-bit code
05: x86: respect ".arch nonop" when selecting which NOPs to emit
06: x86: i686 != PentiumPro
07: x86: don't record full i386_cpu_flags in struct i386_tc_frag_data
08: x86: add a few more NOP patterns
09: x86: fold a few of the "alternative" NOP patterns
10: x86: fold NOP testcase expecations where possible
11: gas: make .nops output visible in listing

Jan

Comments

Jan Beulich Sept. 27, 2023, 3:59 p.m. UTC | #1

On 27.09.2023 17:46, Jan Beulich via Binutils wrote:
> I've noticed a number of issues and inefficiencies.
> 
> 01: x86: record flag_code in tc_frag_data
> 02: x86: i386_generate_nops() may not derive decisions from global variables
> 03: x86: don't use 32-bit LEA as NOP surrogate in 64-bit code
> 04: x86: don't use operand size override with NOP in 16-bit code
> 05: x86: respect ".arch nonop" when selecting which NOPs to emit
> 06: x86: i686 != PentiumPro
> 07: x86: don't record full i386_cpu_flags in struct i386_tc_frag_data
> 08: x86: add a few more NOP patterns
> 09: x86: fold a few of the "alternative" NOP patterns
> 10: x86: fold NOP testcase expecations where possible
> 11: gas: make .nops output visible in listing

I shall have mentioned one further observation: When we use LEA as NOP-
surrogate, we always use %{,e,r}si as destination. I was suspecting this
might not be optimal when these actually end up executing, and indeed on
one of the three systems I checked (a Skylake) there was a reliably
measurable difference between that and alternating the destination
registers used. Question is whether that's enough of a concern, when
generally we expect people to build 64-bit code and not use .arch .nonop.

Jan