[3/5] openmp, nvptx: ompx_unified_shared_mem_alloc

From: Andrew Stubbs <ams@codesourcery.com>

  From: Andrew Stubbs <ams@codesourcery.com>

This adds support for using Cuda Managed Memory with omp_alloc.  It will be
used as the underpinnings for "requires unified_shared_memory" in a later
patch.

There are two new predefined allocators, ompx_unified_shared_mem_alloc and
ompx_host_mem_alloc, plus corresponding memory spaces, which can be used to
allocate memory in the "managed" space and explicitly on the host (it is
intended that "malloc" will be intercepted by the compiler).

The nvptx plugin is modified to make the necessary Cuda calls, and libgomp
is modified to switch to shared-memory mode for USM allocated mappings.

libgomp/ChangeLog:

	* allocator.c (omp_max_predefined_alloc): Update.
	(omp_aligned_alloc): Don't fallback ompx_host_mem_alloc.
	(omp_aligned_calloc): Likewise.
	(omp_realloc): Likewise.
	* config/linux/allocator.c (linux_memspace_alloc): Handle USM.
	(linux_memspace_calloc): Handle USM.
	(linux_memspace_free): Handle USM.
	(linux_memspace_realloc): Handle USM.
	* config/nvptx/allocator.c (nvptx_memspace_alloc): Reject
	ompx_host_mem_alloc.
	(nvptx_memspace_calloc): Likewise.
	(nvptx_memspace_realloc): Likewise.
	* libgomp-plugin.h (GOMP_OFFLOAD_usm_alloc): New prototype.
	(GOMP_OFFLOAD_usm_free): New prototype.
	(GOMP_OFFLOAD_is_usm_ptr): New prototype.
	* libgomp.h (gomp_usm_alloc): New prototype.
	(gomp_usm_free): New prototype.
	(gomp_is_usm_ptr): New prototype.
	(struct gomp_device_descr): Add USM functions.
	* omp.h.in (omp_memspace_handle_t): Add ompx_unified_shared_mem_space
	and ompx_host_mem_space.
	(omp_allocator_handle_t): Add ompx_unified_shared_mem_alloc and
	ompx_host_mem_alloc.
	* omp_lib.f90.in: Likewise.
	* plugin/plugin-nvptx.c (nvptx_alloc): Add "usm" parameter.
	Call cuMemAllocManaged as appropriate.
	(GOMP_OFFLOAD_alloc): Move internals to ...
	(GOMP_OFFLOAD_alloc_1): ... this, and add usm parameter.
	(GOMP_OFFLOAD_usm_alloc): New function.
	(GOMP_OFFLOAD_usm_free): New function.
	(GOMP_OFFLOAD_is_usm_ptr): New function.
	* target.c (gomp_map_vars_internal): Add USM support.
	(gomp_usm_alloc): New function.
	(gomp_usm_free): New function.
	(gomp_load_plugin_for_device): New function.
	* testsuite/libgomp.c/usm-1.c: New test.
	* testsuite/libgomp.c/usm-2.c: New test.
	* testsuite/libgomp.c/usm-3.c: New test.
	* testsuite/libgomp.c/usm-4.c: New test.
	* testsuite/libgomp.c/usm-5.c: New test.
---
 libgomp/allocator.c                 | 13 ++++--
 libgomp/config/linux/allocator.c    | 48 ++++++++++++--------
 libgomp/config/nvptx/allocator.c    |  6 +++
 libgomp/libgomp-plugin.h            |  3 ++
 libgomp/libgomp.h                   |  6 +++
 libgomp/omp.h.in                    |  4 ++
 libgomp/omp_lib.f90.in              |  8 ++++
 libgomp/plugin/plugin-nvptx.c       | 45 ++++++++++++++++---
 libgomp/target.c                    | 70 +++++++++++++++++++++++++++++
 libgomp/testsuite/libgomp.c/usm-1.c | 24 ++++++++++
 libgomp/testsuite/libgomp.c/usm-2.c | 32 +++++++++++++
 libgomp/testsuite/libgomp.c/usm-3.c | 35 +++++++++++++++
 libgomp/testsuite/libgomp.c/usm-4.c | 36 +++++++++++++++
 libgomp/testsuite/libgomp.c/usm-5.c | 28 ++++++++++++
 14 files changed, 330 insertions(+), 28 deletions(-)
 create mode 100644 libgomp/testsuite/libgomp.c/usm-1.c
 create mode 100644 libgomp/testsuite/libgomp.c/usm-2.c
 create mode 100644 libgomp/testsuite/libgomp.c/usm-3.c
 create mode 100644 libgomp/testsuite/libgomp.c/usm-4.c
 create mode 100644 libgomp/testsuite/libgomp.c/usm-5.c

Message ID	20220308113059.688551-4-abidh@codesourcery.com
State	New
Headers	DMARC-Filter: OpenDMARC Filter v1.4.1 sourceware.org A1AD0385C416 IronPort-SDR: JZZHiNP3GpYlzWLIMpdljhrJDj4fjyewPy8zBspVPpQY/hKqa2U0Z/PIDLpeQla8T3YWjk74+4 w79hCE53jJV/kpapt9GEOiDYNDnRNuL5pdMBeVr23OD7T/1paQwkoAQpv1xxnHJL0+8I3swGCu 7nINlPYlWZpD3XbMIHfJQEe447A/83LwF0gVzTR5YzC9AabEyRQkNO39ueUYWZpzifjfpeRP2l 2B+xHuC+BDdn7aWS0YaQ/CMPxrc9ox9UqAXEZM0VFx+B5KuQG/KVGkZNVdLI0qkqN3jSu08tk1 LRM= From: Hafiz Abid Qadeer <abidh@codesourcery.com> To: <gcc-patches@gcc.gnu.org>, <fortran@gcc.gnu.org> Subject: [PATCH 3/5] openmp, nvptx: ompx_unified_shared_mem_alloc Date: Tue, 8 Mar 2022 11:30:57 +0000 Message-ID: <20220308113059.688551-4-abidh@codesourcery.com> In-Reply-To: <20220308113059.688551-1-abidh@codesourcery.com> References: <20220308113059.688551-1-abidh@codesourcery.com> MIME-Version: 1.0 Content-Transfer-Encoding: 8bit Content-Type: text/plain Precedence: list Cc: jakub@redhat.com, ams@codesourcery.com, joseph@codesourcery.com Errors-To: gcc-patches-bounces+patchwork=sourceware.org@gcc.gnu.org Sender: "Gcc-patches" <gcc-patches-bounces+patchwork=sourceware.org@gcc.gnu.org>
Series	openmp: Handle pinned and unified shared memory. \| [0/5] openmp: Handle pinned and unified shared memory. [1/5] openmp: Add -foffload-memory [2/5] openmp: allow requires unified_shared_memory [3/5] openmp, nvptx: ompx_unified_shared_mem_alloc [4/5] openmp: Use libgomp memory allocation functions with unified shared memory. [5/5] openmp: -foffload-memory=pinned

[3/5] openmp, nvptx: ompx_unified_shared_mem_alloc

Commit Message

Comments

Patch