Skip to content

[device-libs]: Compile failure on amd/device-libs/ockl/src/cluster.cl with LLVM 21 on Gentoo #1517

@mattmcadoo

Description

@mattmcadoo

Problem Description

OS:
NAME='Gentoo'
VERSION='2.18'
CPU:
model name : AMD Ryzen Threadripper PRO 3955WX 16-Cores
GPU:
Name: AMD Ryzen Threadripper PRO 3955WX 16-Cores
Marketing Name: AMD Ryzen Threadripper PRO 3955WX 16-Cores
Name: gfx1012
Marketing Name: AMD Radeon Pro W5500
Name: amdgcn-amd-amdhsa--gfx1012:xnack-
Name: amdgcn-amd-amdhsa--gfx10-1-generic:xnack-

FAILED: [code=1] ockl/cluster.bc /var/tmp/portage/dev-libs/rocm-device-libs-7.2.0/work/llvm-project-rocm-7.2.0/amd/device-libs_build/ockl/cluster.bc
cd /var/tmp/portage/dev-libs/rocm-device-libs-7.2.0/work/llvm-project-rocm-7.2.0/amd/device-libs_build/ockl && /usr/lib/llvm/21/bin/clang-21 -I/var/tmp/portage/dev-libs/rocm-device-libs-7.2.0/work/llvm-project-rocm-7.2.0/amd/device-libs/ockl/../irif/inc -I/var/tmp/portage/dev-libs/rocm-device-libs-7.2.0/work/llvm-project-rocm-7.2.0/amd/device-libs/ockl/../oclc/inc -I/var/tmp/portage/dev-libs/rocm-device-libs-7.2.0/work/llvm-project-rocm-7.2.0/amd/device-libs/ockl/inc -fcolor-diagnostics -Werror -Wno-error=atomic-alignment -x cl -Xclang -cl-std=CL2.0 -target amdgcn-amd-amdhsa -fvisibility=hidden -fomit-frame-pointer -Xclang -finclude-default-header -Xclang -fexperimental-strict-floating-point -Xclang -fdenormal-fp-math=dynamic -nogpulib -cl-no-stdinc -Xclang -mcode-object-version=none -emit-llvm -c /var/tmp/portage/dev-libs/rocm-device-libs-7.2.0/work/llvm-project-rocm-7.2.0/amd/device-libs/ockl/src/cluster.cl -o /var/tmp/portage/dev-libs/rocm-device-libs-7.2.0/work/llvm-project-rocm-7.2.0/amd/device-libs_build/ockl/cluster.bc -MD -MF /var/tmp/portage/dev-libs/rocm-device-libs-7.2.0/work/llvm-project-rocm-7.2.0/amd/device-libs_build/ockl/cluster.cl.d && /usr/bin/cmake -E cmake_transform_depfile Ninja gccdepfile /var/tmp/portage/dev-libs/rocm-device-libs-7.2.0/work/llvm-project-rocm-7.2.0/amd/device-libs /var/tmp/portage/dev-libs/rocm-device-libs-7.2.0/work/llvm-project-rocm-7.2.0/amd/device-libs/ockl /var/tmp/portage/dev-libs/rocm-device-libs-7.2.0/work/llvm-project-rocm-7.2.0/amd/device-libs_build /var/tmp/portage/dev-libs/rocm-device-libs-7.2.0/work/llvm-project-rocm-7.2.0/amd/device-libs_build/ockl /var/tmp/portage/dev-libs/rocm-device-libs-7.2.0/work/llvm-project-rocm-7.2.0/amd/device-libs_build/ockl/cluster.cl.d /var/tmp/portage/dev-libs/rocm-device-libs-7.2.0/work/llvm-project-rocm-7.2.0/amd/device-libs_build/CMakeFiles/d/9cb277e447a0840f639afb4d76af9594233b317d09f80da5e7f68083027135ce.d
/var/tmp/portage/dev-libs/rocm-device-libs-7.2.0/work/llvm-project-rocm-7.2.0/amd/device-libs/ockl/src/cluster.cl:16:16: error: use of undeclared identifier '__builtin_amdgcn_cluster_workgroup_max_id_x'
   16 |         return __builtin_amdgcn_cluster_workgroup_max_id_x() + 1;
      |                ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
/var/tmp/portage/dev-libs/rocm-device-libs-7.2.0/work/llvm-project-rocm-7.2.0/amd/device-libs/ockl/src/cluster.cl:18:16: error: use of undeclared identifier '__builtin_amdgcn_cluster_workgroup_max_id_y'
   18 |         return __builtin_amdgcn_cluster_workgroup_max_id_y() + 1;
      |                ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
/var/tmp/portage/dev-libs/rocm-device-libs-7.2.0/work/llvm-project-rocm-7.2.0/amd/device-libs/ockl/src/cluster.cl:20:16: error: use of undeclared identifier '__builtin_amdgcn_cluster_workgroup_max_id_z'
   20 |         return __builtin_amdgcn_cluster_workgroup_max_id_z() + 1;
      |                ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
/var/tmp/portage/dev-libs/rocm-device-libs-7.2.0/work/llvm-project-rocm-7.2.0/amd/device-libs/ockl/src/cluster.cl:31:16: error: use of undeclared identifier '__builtin_amdgcn_cluster_workgroup_id_x'
   31 |         return __builtin_amdgcn_cluster_workgroup_id_x();
      |                ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
/var/tmp/portage/dev-libs/rocm-device-libs-7.2.0/work/llvm-project-rocm-7.2.0/amd/device-libs/ockl/src/cluster.cl:33:16: error: use of undeclared identifier '__builtin_amdgcn_cluster_workgroup_id_y'
   33 |         return __builtin_amdgcn_cluster_workgroup_id_y();
      |                ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
/var/tmp/portage/dev-libs/rocm-device-libs-7.2.0/work/llvm-project-rocm-7.2.0/amd/device-libs/ockl/src/cluster.cl:35:16: error: use of undeclared identifier '__builtin_amdgcn_cluster_workgroup_id_z'
   35 |         return __builtin_amdgcn_cluster_workgroup_id_z();
      |                ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
/var/tmp/portage/dev-libs/rocm-device-libs-7.2.0/work/llvm-project-rocm-7.2.0/amd/device-libs/ockl/src/cluster.cl:44:12: error: use of undeclared identifier '__builtin_amdgcn_cluster_workgroup_max_flat_id'
   44 |     return __builtin_amdgcn_cluster_workgroup_max_flat_id() + 1;
      |            ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
7 errors generated.

Operating System

NAME='Gentoo' VERSION='2.18'

CPU

model name : AMD Ryzen Threadripper PRO 3955WX 16-Cores

GPU

Name: gfx1012 Marketing Name: AMD Radeon Pro W5500 Name: amdgcn-amd-amdhsa--gfx1012:xnack- Name: amdgcn-amd-amdhsa--gfx10-1-generic:xnack-

ROCm Version

7.2.0

ROCm Component

No response

Steps to Reproduce

No response

(Optional for Linux users) Output of /opt/rocm/bin/rocminfo --support

# rocminfo --support
=====================
HSA System Attributes
=====================
Runtime Version:         1.1
Runtime Ext Version:     1.14
System Timestamp Freq.:  1000.000000MHz
Sig. Max Wait Duration:  18446744073709551615 (0xFFFFFFFFFFFFFFFF) (timestamp count)
Machine Model:           LARGE
System Endianness:       LITTLE
Mwaitx:                  DISABLED
XNACK enabled:           NO
DMAbuf Support:          YES
VMM Support:             YES

==========
HSA Agents
==========
*******
Agent 1
*******
  Name:                    AMD Ryzen Threadripper PRO 3955WX 16-Cores
  Uuid:                    CPU-XX
  Marketing Name:          AMD Ryzen Threadripper PRO 3955WX 16-Cores
  Vendor Name:             CPU
  Feature:                 None specified
  Profile:                 FULL_PROFILE
  Float Round Mode:        NEAR
  Max Queue Number:        0(0x0)
  Queue Min Size:          0(0x0)
  Queue Max Size:          0(0x0)
  Queue Type:              MULTI
  Node:                    0
  Device Type:             CPU
  Cache Info:
    L1:                      32768(0x8000) KB
  Chip ID:                 0(0x0)
  ASIC Revision:           0(0x0)
  Cacheline Size:          64(0x40)
  Max Clock Freq. (MHz):   4405
  BDFID:                   0
  Internal Node ID:        0
  Compute Unit:            32
  SIMDs per CU:            0
  Shader Engines:          0
  Shader Arrs. per Eng.:   0
  WatchPts on Addr. Ranges:1
  Memory Properties:
  Features:                None
  Pool Info:
    Pool 1
      Segment:                 GLOBAL; FLAGS: FINE GRAINED
      Size:                    395800364(0x17976f2c) KB
      Allocatable:             TRUE
      Alloc Granule:           4KB
      Alloc Recommended Granule:4KB
      Alloc Alignment:         4KB
      Accessible by all:       TRUE
    Pool 2
      Segment:                 GLOBAL; FLAGS: EXTENDED FINE GRAINED
      Size:                    395800364(0x17976f2c) KB
      Allocatable:             TRUE
      Alloc Granule:           4KB
      Alloc Recommended Granule:4KB
      Alloc Alignment:         4KB
      Accessible by all:       TRUE
    Pool 3
      Segment:                 GLOBAL; FLAGS: KERNARG, FINE GRAINED
      Size:                    395800364(0x17976f2c) KB
      Allocatable:             TRUE
      Alloc Granule:           4KB
      Alloc Recommended Granule:4KB
      Alloc Alignment:         4KB
      Accessible by all:       TRUE
    Pool 4
      Segment:                 GLOBAL; FLAGS: COARSE GRAINED
      Size:                    395800364(0x17976f2c) KB
      Allocatable:             TRUE
      Alloc Granule:           4KB
      Alloc Recommended Granule:4KB
      Alloc Alignment:         4KB
      Accessible by all:       TRUE
  ISA Info:
*******
Agent 2
*******
  Name:                    gfx1012
  Uuid:                    GPU-XX
  Marketing Name:          AMD Radeon Pro W5500
  Vendor Name:             AMD
  Feature:                 KERNEL_DISPATCH
  Profile:                 BASE_PROFILE
  Float Round Mode:        NEAR
  Max Queue Number:        128(0x80)
  Queue Min Size:          64(0x40)
  Queue Max Size:          131072(0x20000)
  Queue Type:              MULTI
  Node:                    1
  Device Type:             GPU
  Cache Info:
    L1:                      16(0x10) KB
    L2:                      2048(0x800) KB
  Chip ID:                 29505(0x7341)
  ASIC Revision:           1(0x1)
  Cacheline Size:          128(0x80)
  Max Clock Freq. (MHz):   1900
  BDFID:                   25344
  Internal Node ID:        1
  Compute Unit:            22
  SIMDs per CU:            2
  Shader Engines:          1
  Shader Arrs. per Eng.:   2
  WatchPts on Addr. Ranges:4
  Coherent Host Access:    FALSE
  Memory Properties:
  Features:                KERNEL_DISPATCH
  Fast F16 Operation:      TRUE
  Wavefront Size:          32(0x20)
  Workgroup Max Size:      1024(0x400)
  Workgroup Max Size per Dimension:
    x                        1024(0x400)
    y                        1024(0x400)
    z                        1024(0x400)
  Max Waves Per CU:        40(0x28)
  Max Work-item Per CU:    1280(0x500)
  Grid Max Size:           4294967295(0xffffffff)
  Grid Max Size per Dimension:
    x                        2147483647(0x7fffffff)
    y                        65535(0xffff)
    z                        65535(0xffff)
  Max fbarriers/Workgrp:   32
  Packet Processor uCode:: 123
  SDMA engine uCode::      41
  IOMMU Support::          None
  Pool Info:
    Pool 1
      Segment:                 GLOBAL; FLAGS: COARSE GRAINED
      Size:                    8372224(0x7fc000) KB
      Allocatable:             TRUE
      Alloc Granule:           4KB
      Alloc Recommended Granule:2048KB
      Alloc Alignment:         4KB
      Accessible by all:       FALSE
    Pool 2
      Segment:                 GLOBAL; FLAGS: EXTENDED FINE GRAINED
      Size:                    8372224(0x7fc000) KB
      Allocatable:             TRUE
      Alloc Granule:           4KB
      Alloc Recommended Granule:2048KB
      Alloc Alignment:         4KB
      Accessible by all:       FALSE
    Pool 3
      Segment:                 GROUP
      Size:                    64(0x40) KB
      Allocatable:             FALSE
      Alloc Granule:           0KB
      Alloc Recommended Granule:0KB
      Alloc Alignment:         0KB
      Accessible by all:       FALSE
  ISA Info:
    ISA 1
      Name:                    amdgcn-amd-amdhsa--gfx1012:xnack-
      Machine Models:          HSA_MACHINE_MODEL_LARGE
      Profiles:                HSA_PROFILE_BASE
      Default Rounding Mode:   NEAR
      Default Rounding Mode:   NEAR
      Fast f16:                TRUE
      Workgroup Max Size:      1024(0x400)
      Workgroup Max Size per Dimension:
        x                        1024(0x400)
        y                        1024(0x400)
        z                        1024(0x400)
      Grid Max Size:           4294967295(0xffffffff)
      Grid Max Size per Dimension:
        x                        2147483647(0x7fffffff)
        y                        65535(0xffff)
        z                        65535(0xffff)
      FBarrier Max Size:       32
    ISA 2
      Name:                    amdgcn-amd-amdhsa--gfx10-1-generic:xnack-
      Machine Models:          HSA_MACHINE_MODEL_LARGE
      Profiles:                HSA_PROFILE_BASE
      Default Rounding Mode:   NEAR
      Default Rounding Mode:   NEAR
      Fast f16:                TRUE
      Workgroup Max Size:      1024(0x400)
      Workgroup Max Size per Dimension:
        x                        1024(0x400)
        y                        1024(0x400)
        z                        1024(0x400)
      Grid Max Size:           4294967295(0xffffffff)
      Grid Max Size per Dimension:
        x                        2147483647(0x7fffffff)
        y                        65535(0xffff)
        z                        65535(0xffff)
      FBarrier Max Size:       32
*** Done ***

Additional Information

This is the commit that added that file: fcc50fb

Metadata

Metadata

Assignees

Labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions