[v4] Teach GDB to generate sparse core files (PR corefiles/31494)

From: Pedro Alves <pedro@palves.net>

  On 2024-03-21 21:27, Lancelot SIX wrote:

> I have a couple of small comments below.

Thanks!

> 
> On 18/03/2024 17:43, Pedro Alves wrote:
>> diff --git a/gdb/gcore.c b/gdb/gcore.c

>> +
>> +/* Wrapper around bfd_set_section_contents that avoids writing
>> +   all-zero blocks to disk, so we create a sparse core file.
>> +   SKIP_ALIGN is a recursion helper -- if true, we'll skip aligning
>> +   the file position to SPARSE_BLOCK_SIZE.  */
>> +
>> +static bool
>> +sparse_bfd_set_section_contents (bfd *obfd, asection *osec,
>> +                                const gdb_byte *data,
>> +                                size_t sec_offset,
>> +                                size_t size,
>> +                                bool skip_align = false)
>> +{
>> +  /* Note, we don't have to have special handling for the case of the
>> +     last memory region ending with zeros, because our caller always
>> +     writes out the note section after the memory/load sections.  If
>> +     it didn't, we'd have to seek+write the last byte to make the file
>> +     size correct.  (Or add an ftruncate abstraction to bfd and call
>> +     that.)  */
>> +
>> +  if (!skip_align)
>> +    {
>> +      /* Align the all-zero block search with SPARSE_BLOCK_SIZE, to
>> +        better align with filesystem blocks.  If we find we're
>> +        misaligned, then write/skip the bytes needed to make us
>> +        aligned.  We do that with (one level) recursion.  */
>> +
>> +      /* We need to know the section's file offset on disk.  We can
>> +        only look at it after the bfd's 'output_has_begun' flag has
>> +        been set, as bfd hasn't computed the file offsets
>> +        otherwise.  */
>> +      if (!obfd->output_has_begun)
>> +       {
>> +         gdb_byte dummy = 0;
>> +
>> +         /* A write forces BFD to compute the bfd's section file
>> +            positions.  Zero size works for that too.  */
>> +         if (!bfd_set_section_contents (obfd, osec, &dummy, 0, 0))
>> +           return false;
>> +
>> +         gdb_assert (obfd->output_has_begun);
>> +       }
>> +
>> +      /* How much we need to write/skip in order to find the next
>> +        SPARSE_BLOCK_SIZE filepos-aligned block.  */
>> +      size_t align_remainder
>> +       = (SPARSE_BLOCK_SIZE
>> +          - (osec->filepos + sec_offset) % SPARSE_BLOCK_SIZE);
>> +
>> +      /* How much we'll actually write in the recursion call.  */
>> +      size_t align_write_size = std::min (size, align_remainder);
>> +
>> +      if (align_write_size != 0)
> 
> I think at this point align_write_size can be SPARSE_BLOCK_SIZE (i.e. sec_offset lands at a SPARSE_BLOCK_SIZE boundary in the underlying filesystem).  If that's the case, and data+sec_offset starts with block of 0s, you'll write it to disk needlessly.  

Good find.

> Not a big deal, but I'd go for:
> 
>     if (align_write_size % SPARSE_BLOCK_SIZE != 0)
> 

I wrote something different, just because I dislike repeating the modulo operation.
It forced me to come up with better variable names, but I think the result is clearer.

Also, align_write_size can only be zero if we have something to write.  But if we have nothing
to write, this whole aligning logic isn't needed either.  So I added an early size == 0 check,
which then removed the need for this indentation level here.

>> +       {
>> +         /* Recurse, skipping the alignment code.  */
>> +         if (!sparse_bfd_set_section_contents (obfd, osec, data,
>> +                                               sec_offset,
>> +                                               align_write_size, true))
>> +           return false;
>> +
>> +         /* Skip over what we've written, and proceed with
>> +            assumes-aligned logic.  */
>> +         data += align_write_size;
>> +         sec_offset += align_write_size;
>> +         size -= align_write_size;
>> +       }
>> +    }
>> +
>> +  size_t data_offset = 0;
> 
> Just because that got me to think while reading, having the first part of the procedure update data/sec_offset/size and the second part of the procedure update data_offset seems a bit inconsistent.
> 
> I would probably move the declaration of data_offset at the very begining of the procedure update it consistently:
> 
>     size_t data_offset = 0;
>     if (!skip_align)
>       {
>         […]
>         if (align_write_size % SPARSE_BLOCK_SIZE != 0)
>           {
>             […]
>             data_offset += align_write_size;
>           }
>        }
>      while (data_offset < size)
>        […]

I did this, and then while stepping through the code to confirm it all works correctly, I noticed that it
leads to code that is a little harder to debug, as data_offset is no longer neatly aligned to 0x1000
while stepping through the "assume-aligned" main algorithm.  But that may just be me, and I do see your
point, so I kept the change.

Here's the updated patch.  I also applied John's suggestion to use a continue.

---- 8< ----
From 07d61478c8d02f593d8ab8bc0270eb0a90d535dd Mon Sep 17 00:00:00 2001
From: Pedro Alves <pedro@palves.net>
Date: Thu, 21 Mar 2024 23:07:46 +0000
Subject: [PATCH v4] Teach GDB to generate sparse core files (PR
 corefiles/31494)

This commit teaches GDB's gcore command to generate sparse core files
(if supported by the filesystem).

To create a sparse file, all you have to do is skip writing zeros to
the file, instead lseek'ing-ahead over them.

The sparse logic is applied when writing the memory sections, as
that's where the bulk of the data and the zeros are.

The commit also tweaks gdb.base/bigcore.exp to make it exercise
gdb-generated cores in addition to kernel-generated cores.  We
couldn't do that before, because GDB's gcore on that test's program
would generate a multi-GB non-sparse core (16GB on my system).

After this commit, gdb.base/bigcore.exp generates, when testing with
GDB's gcore, a much smaller core file, roughly in line with what the
kernel produces:

 real sizes:

 $ du --hu testsuite/outputs/gdb.base/bigcore/bigcore.corefile.*
 2.2M    testsuite/outputs/gdb.base/bigcore/bigcore.corefile.gdb
 2.0M    testsuite/outputs/gdb.base/bigcore/bigcore.corefile.kernel

 apparent sizes:

 $ du --hu --apparent-size testsuite/outputs/gdb.base/bigcore/bigcore.corefile.*
 16G     testsuite/outputs/gdb.base/bigcore/bigcore.corefile.gdb
 16G     testsuite/outputs/gdb.base/bigcore/bigcore.corefile.kernel

Time to generate the core also goes down significantly.  On my machine, I get:

  when writing to an SSD, from 21.0s, down to 8.0s
  when writing to an HDD, from 31.0s, down to 8.5s

The changes to gdb.base/bigcore.exp are smaller than they look at
first sight.  It's basically mostly refactoring -- moving most of the
code to a new procedure which takes as argument who should dump the
core, and then calling the procedure twice.  I purposely did not
modernize any of the refactored code in this patch.

Bug: https://sourceware.org/bugzilla/show_bug.cgi?id=31494
Reviewed-By: Lancelot Six <lancelot.six@amd.com>
Reviewed-By: Eli Zaretskii <eliz@gnu.org>
Reviewed-By: John Baldwin <jhb@FreeBSD.org>
Change-Id: I2554a6a4a72d8c199ce31f176e0ead0c0c76cff1
---
 gdb/NEWS                           |   4 +
 gdb/doc/gdb.texinfo                |   3 +
 gdb/gcore.c                        | 186 +++++++++++++++++++++-
 gdb/testsuite/gdb.base/bigcore.exp | 238 ++++++++++++++++-------------
 4 files changed, 323 insertions(+), 108 deletions(-)

base-commit: 9bec569fda7c76849cf3eb0e4a525f627d25f980

Message ID	f4215076-fbd8-4946-939e-1cb898f01136@palves.net
State	New
Headers	Return-Path: <gdb-patches-bounces+patchwork=sourceware.org@sourceware.org> X-Original-To: patchwork@sourceware.org Delivered-To: patchwork@sourceware.org Received: from server2.sourceware.org (localhost [IPv6:::1]) by sourceware.org (Postfix) with ESMTP id AC6133858D38 for <patchwork@sourceware.org>; Thu, 21 Mar 2024 23:15:05 +0000 (GMT) X-Original-To: gdb-patches@sourceware.org Delivered-To: gdb-patches@sourceware.org Received: from mail-wm1-f53.google.com (mail-wm1-f53.google.com [209.85.128.53]) by sourceware.org (Postfix) with ESMTPS id 27D433858D28 for <gdb-patches@sourceware.org>; Thu, 21 Mar 2024 23:14:31 +0000 (GMT) DMARC-Filter: OpenDMARC Filter v1.4.2 sourceware.org 27D433858D28 Authentication-Results: sourceware.org; dmarc=none (p=none dis=none) header.from=palves.net Authentication-Results: sourceware.org; spf=pass smtp.mailfrom=gmail.com ARC-Filter: OpenARC Filter v1.0.0 sourceware.org 27D433858D28 Authentication-Results: server2.sourceware.org; arc=none smtp.remote-ip=209.85.128.53 ARC-Seal: i=1; a=rsa-sha256; d=sourceware.org; s=key; t=1711062875; cv=none; b=NFgxcEmCK1YfNJa+AaPUEWb7ROVzJ0o5NprgiiEBnlWfdb5zq4JOFM6yrxdLtKy4u3uuWHrRICiMWexzTixaWEhovYzMx7uSr1fMfoNcqcDDQs552hEdj2pxn8RnhDDKXyYKcZHGk90AB7S2EI9QBlSfKjac4R1mRayCTItjIlI= ARC-Message-Signature: i=1; a=rsa-sha256; d=sourceware.org; s=key; t=1711062875; c=relaxed/simple; bh=/ojuCwDpofu9P6+dmO85LKhucdmhWLKOgTlCkixzSrY=; h=Message-ID:Date:MIME-Version:Subject:To:From; b=pgIiovlvez8zrn6bCy1SuN9HS+Cfw/BchN9MgLQuIjdw6D4RLBpZW7JsxlMBqB0YQD2d/UXbGWHPwti48bjxy81fuWulEN28TrwdH6IPLKsiiYn7gWD7FmNu6+k/N1qh04yfYLtrPjwwNUWo1qtuWjVEcN961LjrYiqTYlRMgUA= ARC-Authentication-Results: i=1; server2.sourceware.org Received: by mail-wm1-f53.google.com with SMTP id 5b1f17b1804b1-41412411672so11745805e9.3 for <gdb-patches@sourceware.org>; Thu, 21 Mar 2024 16:14:31 -0700 (PDT) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20230601; t=1711062870; x=1711667670; h=content-transfer-encoding:in-reply-to:from:references:to :content-language:subject:user-agent:mime-version:date:message-id :x-gm-message-state:from:to:cc:subject:date:message-id:reply-to; bh=YwD8cw8T6NGt7GiP8+QstJgQdx4YsUbXWkfMuh71kqk=; b=q4sIe01qRTKPO6yZEqlygX0QqYCHlnn3uxZS2ox5ZGJJCEP/Y1PPU9WkRKL3FJthtU Pftte0NvuqS8q9twEeTDeo5rkFAYXcAQ5etQ7GkrXT/H25/afgVffP3y94qaLQszigQC su4QVGgJVYtvgo1xMpJa6Vu+2Xyv5AukykIPtMlJKL2CXAWFveFt7B1B0E8OJc6lPcJ1 4//8poZHjZh7Pwb68er4HL2/lSj2EUz3pVSCbA6gioGzvM98m5waqOfbTq6EJ6w41pt7 D55g1Pq6zwogSW3IUMBwsoJ8sA75njnnLV8E1KkXycmlpOxZv6Fjmp2APMGEvGFpTY06 TDyQ== X-Forwarded-Encrypted: i=1; AJvYcCV+Ww5wAoxVqBbyLYU6QNTe0j+cknyc2qb3OWtClu1ACI9alEshx8HKZbvGL0zUR/4wu79kHuIKMlZ68eDZy8W1eES7gzV+CBkkQA== X-Gm-Message-State: AOJu0Yy4/zjVkNi1vvcuoiHWCweuRoNMVprt4nSJhC9XAZGMfwpiSiJG fcB4Rik8DqjgMSxRQARqdhHfTF+hstKa/6ly5eysBoyCxCbPNrdx X-Google-Smtp-Source: AGHT+IFN73qvYuW6WoJPEe6K7+rLVN7lAxpombs4K5OCkGuLicU1xylx2HVuaVIrX1iDjIUtjm5xQg== X-Received: by 2002:a5d:4d0e:0:b0:33e:7f51:c1e2 with SMTP id z14-20020a5d4d0e000000b0033e7f51c1e2mr312459wrt.45.1711062869596; Thu, 21 Mar 2024 16:14:29 -0700 (PDT) Received: from ?IPV6:2001:8a0:f918:ab00:5ea7:1bb:7941:5784? ([2001:8a0:f918:ab00:5ea7:1bb:7941:5784]) by smtp.gmail.com with ESMTPSA id m15-20020adffe4f000000b0033e5c54d0d9sm678551wrs.38.2024.03.21.16.14.28 (version=TLS1_3 cipher=TLS_AES_128_GCM_SHA256 bits=128/128); Thu, 21 Mar 2024 16:14:29 -0700 (PDT) Message-ID: <f4215076-fbd8-4946-939e-1cb898f01136@palves.net> Date: Thu, 21 Mar 2024 23:14:26 +0000 MIME-Version: 1.0 User-Agent: Mozilla Thunderbird Subject: [PATCH v4] Teach GDB to generate sparse core files (PR corefiles/31494) Content-Language: en-US To: Lancelot SIX <Lancelot.Six@amd.com>, gdb-patches@sourceware.org, John Baldwin <jhb@FreeBSD.org> References: <20240315182705.4064062-1-pedro@palves.net> <1cb2e4f4-f14d-4434-9eb2-b33fdf4bf0bb@palves.net> <269ff31a-9aeb-4293-a4d9-df0f16f12e88@palves.net> <5f504bb2-75b2-43fb-b74f-708cf0c46bb6@amd.com> From: Pedro Alves <pedro@palves.net> In-Reply-To: <5f504bb2-75b2-43fb-b74f-708cf0c46bb6@amd.com> Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 8bit X-Spam-Status: No, score=-8.4 required=5.0 tests=BAYES_00, BODY_8BITS, FREEMAIL_FORGED_FROMDOMAIN, FREEMAIL_FROM, GIT_PATCH_0, HEADER_FROM_DIFFERENT_DOMAINS, KAM_DMARC_STATUS, RCVD_IN_DNSWL_NONE, RCVD_IN_MSPIKE_H2, SPF_HELO_NONE, SPF_PASS, TXREP autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on server2.sourceware.org X-BeenThere: gdb-patches@sourceware.org X-Mailman-Version: 2.1.30 Precedence: list List-Id: Gdb-patches mailing list <gdb-patches.sourceware.org> List-Unsubscribe: <https://sourceware.org/mailman/options/gdb-patches>, <mailto:gdb-patches-request@sourceware.org?subject=unsubscribe> List-Archive: <https://sourceware.org/pipermail/gdb-patches/> List-Post: <mailto:gdb-patches@sourceware.org> List-Help: <mailto:gdb-patches-request@sourceware.org?subject=help> List-Subscribe: <https://sourceware.org/mailman/listinfo/gdb-patches>, <mailto:gdb-patches-request@sourceware.org?subject=subscribe> Errors-To: gdb-patches-bounces+patchwork=sourceware.org@sourceware.org
Series	[v4] Teach GDB to generate sparse core files (PR corefiles/31494) \| [v4] Teach GDB to generate sparse core files (PR corefiles/31494)

Context	Check	Description
linaro-tcwg-bot/tcwg_gdb_build--master-aarch64	success	Testing passed
linaro-tcwg-bot/tcwg_gdb_build--master-arm	success	Testing passed
linaro-tcwg-bot/tcwg_gdb_check--master-aarch64	success	Testing passed
linaro-tcwg-bot/tcwg_gdb_check--master-arm	success	Testing passed

[v4] Teach GDB to generate sparse core files (PR corefiles/31494)

Checks

Commit Message

Comments

Patch