From patchwork Wed Apr 5 16:21:43 2023 Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit X-Patchwork-Submitter: "H.J. Lu" X-Patchwork-Id: 67417 Return-Path: X-Original-To: patchwork@sourceware.org Delivered-To: patchwork@sourceware.org Received: from server2.sourceware.org (localhost [IPv6:::1]) by sourceware.org (Postfix) with ESMTP id CFFB13881D03 for ; Wed, 5 Apr 2023 16:25:11 +0000 (GMT) DKIM-Filter: OpenDKIM Filter v2.11.0 sourceware.org CFFB13881D03 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=sourceware.org; s=default; t=1680711911; bh=HKvQNKlmWDPkcVwVE5/ZKpRp8wgIVcVXPb8f1MV/Gcs=; h=To:Subject:Date:In-Reply-To:References:List-Id:List-Unsubscribe: List-Archive:List-Post:List-Help:List-Subscribe:From:Reply-To: From; b=ZgvQ0vPO6/fQ53lwhvJqm4cdzTsFvj779WBWc8bEHco4RNu7OKhiDmmviuSuns7qI u3Dw/2hD00KT3ONogcW+Z79zramVblWzBVMe9H4TZjivtC7MIS59i277YEDEgLQKgg dAfsj3K3/fXGydx+t87OkkuNIJ0Geu1wVMs+Th5o= X-Original-To: libc-alpha@sourceware.org Delivered-To: libc-alpha@sourceware.org Received: from mail-pj1-x1035.google.com (mail-pj1-x1035.google.com [IPv6:2607:f8b0:4864:20::1035]) by sourceware.org (Postfix) with ESMTPS id 8023C3857714 for ; Wed, 5 Apr 2023 16:21:53 +0000 (GMT) DMARC-Filter: OpenDMARC Filter v1.4.2 sourceware.org 8023C3857714 Received: by mail-pj1-x1035.google.com with SMTP id h12-20020a17090aea8c00b0023d1311fab3so37827256pjz.1 for ; Wed, 05 Apr 2023 09:21:53 -0700 (PDT) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20210112; t=1680711712; h=content-transfer-encoding:mime-version:references:in-reply-to :message-id:date:subject:to:from:x-gm-message-state:from:to:cc :subject:date:message-id:reply-to; bh=HKvQNKlmWDPkcVwVE5/ZKpRp8wgIVcVXPb8f1MV/Gcs=; b=jur5C68kGdTqffvWHxXEJn8ugRyJeEZbahny0fWr3PE26IKlEzkfAoY6nM/6nS00RC nszKQAaz9eDS531E3edJVh0jznJbycJWrE025olWII0vd56vYA/g++oJrZ5B3VqaBk7k Q/sSJg0lyxpUJejSbQTZY0FS9eaUjmcSoxnuYYzfS9ypJyUIcefKoVjh6PBiUSvISpjd sQwbXlgEr6t94m0JhGTWcpuEiBfbrhfBkNzdMpoR7O7hw0mz9Mdg4a49tN/fKiQR0cjT tOqEWOJye6rpOpcRD6xAg6uXoLTHv7S7xp91lxrjJez/abHTP5kKSjsNhtduB77eJ4lV IgTw== X-Gm-Message-State: AAQBX9cVWru1A8FDQtS5tUaRP/MJlKyC0C1MpJEPRNS0JcXh6BzUI5Vz CYU7E4i+NSiqm8Xl3C2/tavUN/IlEDs= X-Google-Smtp-Source: AKy350ZcCadDekydm1+u00yZYpeYirVYL2uUnq1fRoRkaG6b/+OAcP/1Gwq5L79yVps1Ecbp6y+06Q== X-Received: by 2002:a17:902:d081:b0:1a1:d5fd:1e9a with SMTP id v1-20020a170902d08100b001a1d5fd1e9amr6050163plv.40.1680711712189; Wed, 05 Apr 2023 09:21:52 -0700 (PDT) Received: from gnu-cfl-3.localdomain ([172.59.161.240]) by smtp.gmail.com with ESMTPSA id w23-20020a170902d71700b0019cb6222691sm10350978ply.133.2023.04.05.09.21.49 for (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Wed, 05 Apr 2023 09:21:50 -0700 (PDT) Received: from gnu-cfl-3.. (localhost [IPv6:::1]) by gnu-cfl-3.localdomain (Postfix) with ESMTP id 4337E740171 for ; Wed, 5 Apr 2023 09:21:45 -0700 (PDT) To: libc-alpha@sourceware.org Subject: [PATCH 18/19] : Add AMX-COMPLEX support Date: Wed, 5 Apr 2023 09:21:43 -0700 Message-Id: <20230405162144.984598-19-hjl.tools@gmail.com> X-Mailer: git-send-email 2.39.2 In-Reply-To: <20230405162144.984598-1-hjl.tools@gmail.com> References: <20230405162144.984598-1-hjl.tools@gmail.com> MIME-Version: 1.0 X-Spam-Status: No, score=-3025.6 required=5.0 tests=BAYES_00, DKIM_SIGNED, DKIM_VALID, DKIM_VALID_AU, DKIM_VALID_EF, FREEMAIL_FROM, GIT_PATCH_0, RCVD_IN_DNSWL_NONE, SPF_HELO_NONE, SPF_PASS, TXREP autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on server2.sourceware.org X-BeenThere: libc-alpha@sourceware.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: Libc-alpha mailing list List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-Patchwork-Original-From: "H.J. Lu via Libc-alpha" From: "H.J. Lu" Reply-To: "H.J. Lu" Errors-To: libc-alpha-bounces+patchwork=sourceware.org@sourceware.org Sender: "Libc-alpha" Add AMX-COMPLEX support to . Reviewed-by: Noah Goldstein --- manual/platform.texi | 3 +++ sysdeps/x86/bits/platform/x86.h | 1 + sysdeps/x86/cpu-features.c | 2 ++ sysdeps/x86/include/cpu-features.h | 3 +++ sysdeps/x86/tst-get-cpu-features.c | 2 ++ 5 files changed, 11 insertions(+) diff --git a/manual/platform.texi b/manual/platform.texi index 1e120993d7..e7448ffc1a 100644 --- a/manual/platform.texi +++ b/manual/platform.texi @@ -197,6 +197,9 @@ The supported processor features are: @item @code{AMX_BF16} -- Tile computational operations on bfloat16 numbers. +@item +@code{AMX_COMPLEX} -- Tile computational operations on complex FP16 numbers. + @item @code{AMX_INT8} -- Tile computational operations on 8-bit numbers. diff --git a/sysdeps/x86/bits/platform/x86.h b/sysdeps/x86/bits/platform/x86.h index d8ba33bd42..96eb4c070d 100644 --- a/sysdeps/x86/bits/platform/x86.h +++ b/sysdeps/x86/bits/platform/x86.h @@ -310,6 +310,7 @@ enum x86_cpu_AVX_VNNI_INT8 = x86_cpu_index_7_ecx_1_edx + 4, x86_cpu_AVX_NE_CONVERT = x86_cpu_index_7_ecx_1_edx + 5, + x86_cpu_AMX_COMPLEX = x86_cpu_index_7_ecx_1_edx + 8, x86_cpu_index_19_ebx = (CPUID_INDEX_19 * 8 * 4 * sizeof (unsigned int) diff --git a/sysdeps/x86/cpu-features.c b/sysdeps/x86/cpu-features.c index dfd1b85dce..c2bea6a32d 100644 --- a/sysdeps/x86/cpu-features.c +++ b/sysdeps/x86/cpu-features.c @@ -221,6 +221,8 @@ update_active (struct cpu_features *cpu_features) CPU_FEATURE_SET_ACTIVE (cpu_features, AMX_INT8); /* Determine if AMX_FP16 is usable. */ CPU_FEATURE_SET_ACTIVE (cpu_features, AMX_FP16); + /* Determine if AMX_COMPLEX is usable. */ + CPU_FEATURE_SET_ACTIVE (cpu_features, AMX_COMPLEX); } /* These features are usable only when OSXSAVE is enabled. */ diff --git a/sysdeps/x86/include/cpu-features.h b/sysdeps/x86/include/cpu-features.h index 673cf8ca92..f14c1078d5 100644 --- a/sysdeps/x86/include/cpu-features.h +++ b/sysdeps/x86/include/cpu-features.h @@ -317,6 +317,7 @@ enum /* EDX. */ #define bit_cpu_AVX_VNNI_INT8 (1u << 4) #define bit_cpu_AVX_NE_CONVERT (1u << 5) +#define bit_cpu_AMX_COMPLEX (1u << 8) /* CPUID_INDEX_19. */ @@ -558,6 +559,7 @@ enum #define index_cpu_LAM CPUID_INDEX_7_ECX_1 #define index_cpu_AVX_VNNI_INT8 CPUID_INDEX_7_ECX_1 #define index_cpu_AVX_NE_CONVERT CPUID_INDEX_7_ECX_1 +#define index_cpu_AMX_COMPLEX CPUID_INDEX_7_ECX_1 /* CPUID_INDEX_19. */ @@ -801,6 +803,7 @@ enum /* EDX. */ #define reg_AVX_VNNI_INT8 edx #define reg_AVX_NE_CONVERT edx +#define reg_AMX_COMPLEX edx /* CPUID_INDEX_19. */ diff --git a/sysdeps/x86/tst-get-cpu-features.c b/sysdeps/x86/tst-get-cpu-features.c index bb1b67fd1c..87fe27340f 100644 --- a/sysdeps/x86/tst-get-cpu-features.c +++ b/sysdeps/x86/tst-get-cpu-features.c @@ -217,6 +217,7 @@ do_test (void) CHECK_CPU_FEATURE_PRESENT (MSRLIST); CHECK_CPU_FEATURE_PRESENT (AVX_VNNI_INT8); CHECK_CPU_FEATURE_PRESENT (AVX_NE_CONVERT); + CHECK_CPU_FEATURE_PRESENT (AMX_COMPLEX); CHECK_CPU_FEATURE_PRESENT (AESKLE); CHECK_CPU_FEATURE_PRESENT (WIDE_KL); CHECK_CPU_FEATURE_PRESENT (PTWRITE); @@ -386,6 +387,7 @@ do_test (void) CHECK_CPU_FEATURE_ACTIVE (AVX_IFMA); CHECK_CPU_FEATURE_ACTIVE (AVX_VNNI_INT8); CHECK_CPU_FEATURE_ACTIVE (AVX_NE_CONVERT); + CHECK_CPU_FEATURE_ACTIVE (AMX_COMPLEX); CHECK_CPU_FEATURE_ACTIVE (AESKLE); CHECK_CPU_FEATURE_ACTIVE (WIDE_KL); CHECK_CPU_FEATURE_ACTIVE (PTWRITE);