forked from llvm/llvm-project
-
Notifications
You must be signed in to change notification settings - Fork 82
Open
Labels
Description
Problem Description
OS:
NAME='Gentoo'
VERSION='2.18'
CPU:
model name : AMD Ryzen Threadripper PRO 3955WX 16-Cores
GPU:
Name: AMD Ryzen Threadripper PRO 3955WX 16-Cores
Marketing Name: AMD Ryzen Threadripper PRO 3955WX 16-Cores
Name: gfx1012
Marketing Name: AMD Radeon Pro W5500
Name: amdgcn-amd-amdhsa--gfx1012:xnack-
Name: amdgcn-amd-amdhsa--gfx10-1-generic:xnack-
FAILED: [code=1] ockl/cluster.bc /var/tmp/portage/dev-libs/rocm-device-libs-7.2.0/work/llvm-project-rocm-7.2.0/amd/device-libs_build/ockl/cluster.bc
cd /var/tmp/portage/dev-libs/rocm-device-libs-7.2.0/work/llvm-project-rocm-7.2.0/amd/device-libs_build/ockl && /usr/lib/llvm/21/bin/clang-21 -I/var/tmp/portage/dev-libs/rocm-device-libs-7.2.0/work/llvm-project-rocm-7.2.0/amd/device-libs/ockl/../irif/inc -I/var/tmp/portage/dev-libs/rocm-device-libs-7.2.0/work/llvm-project-rocm-7.2.0/amd/device-libs/ockl/../oclc/inc -I/var/tmp/portage/dev-libs/rocm-device-libs-7.2.0/work/llvm-project-rocm-7.2.0/amd/device-libs/ockl/inc -fcolor-diagnostics -Werror -Wno-error=atomic-alignment -x cl -Xclang -cl-std=CL2.0 -target amdgcn-amd-amdhsa -fvisibility=hidden -fomit-frame-pointer -Xclang -finclude-default-header -Xclang -fexperimental-strict-floating-point -Xclang -fdenormal-fp-math=dynamic -nogpulib -cl-no-stdinc -Xclang -mcode-object-version=none -emit-llvm -c /var/tmp/portage/dev-libs/rocm-device-libs-7.2.0/work/llvm-project-rocm-7.2.0/amd/device-libs/ockl/src/cluster.cl -o /var/tmp/portage/dev-libs/rocm-device-libs-7.2.0/work/llvm-project-rocm-7.2.0/amd/device-libs_build/ockl/cluster.bc -MD -MF /var/tmp/portage/dev-libs/rocm-device-libs-7.2.0/work/llvm-project-rocm-7.2.0/amd/device-libs_build/ockl/cluster.cl.d && /usr/bin/cmake -E cmake_transform_depfile Ninja gccdepfile /var/tmp/portage/dev-libs/rocm-device-libs-7.2.0/work/llvm-project-rocm-7.2.0/amd/device-libs /var/tmp/portage/dev-libs/rocm-device-libs-7.2.0/work/llvm-project-rocm-7.2.0/amd/device-libs/ockl /var/tmp/portage/dev-libs/rocm-device-libs-7.2.0/work/llvm-project-rocm-7.2.0/amd/device-libs_build /var/tmp/portage/dev-libs/rocm-device-libs-7.2.0/work/llvm-project-rocm-7.2.0/amd/device-libs_build/ockl /var/tmp/portage/dev-libs/rocm-device-libs-7.2.0/work/llvm-project-rocm-7.2.0/amd/device-libs_build/ockl/cluster.cl.d /var/tmp/portage/dev-libs/rocm-device-libs-7.2.0/work/llvm-project-rocm-7.2.0/amd/device-libs_build/CMakeFiles/d/9cb277e447a0840f639afb4d76af9594233b317d09f80da5e7f68083027135ce.d
/var/tmp/portage/dev-libs/rocm-device-libs-7.2.0/work/llvm-project-rocm-7.2.0/amd/device-libs/ockl/src/cluster.cl:16:16: error: use of undeclared identifier '__builtin_amdgcn_cluster_workgroup_max_id_x'
16 | return __builtin_amdgcn_cluster_workgroup_max_id_x() + 1;
| ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
/var/tmp/portage/dev-libs/rocm-device-libs-7.2.0/work/llvm-project-rocm-7.2.0/amd/device-libs/ockl/src/cluster.cl:18:16: error: use of undeclared identifier '__builtin_amdgcn_cluster_workgroup_max_id_y'
18 | return __builtin_amdgcn_cluster_workgroup_max_id_y() + 1;
| ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
/var/tmp/portage/dev-libs/rocm-device-libs-7.2.0/work/llvm-project-rocm-7.2.0/amd/device-libs/ockl/src/cluster.cl:20:16: error: use of undeclared identifier '__builtin_amdgcn_cluster_workgroup_max_id_z'
20 | return __builtin_amdgcn_cluster_workgroup_max_id_z() + 1;
| ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
/var/tmp/portage/dev-libs/rocm-device-libs-7.2.0/work/llvm-project-rocm-7.2.0/amd/device-libs/ockl/src/cluster.cl:31:16: error: use of undeclared identifier '__builtin_amdgcn_cluster_workgroup_id_x'
31 | return __builtin_amdgcn_cluster_workgroup_id_x();
| ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
/var/tmp/portage/dev-libs/rocm-device-libs-7.2.0/work/llvm-project-rocm-7.2.0/amd/device-libs/ockl/src/cluster.cl:33:16: error: use of undeclared identifier '__builtin_amdgcn_cluster_workgroup_id_y'
33 | return __builtin_amdgcn_cluster_workgroup_id_y();
| ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
/var/tmp/portage/dev-libs/rocm-device-libs-7.2.0/work/llvm-project-rocm-7.2.0/amd/device-libs/ockl/src/cluster.cl:35:16: error: use of undeclared identifier '__builtin_amdgcn_cluster_workgroup_id_z'
35 | return __builtin_amdgcn_cluster_workgroup_id_z();
| ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
/var/tmp/portage/dev-libs/rocm-device-libs-7.2.0/work/llvm-project-rocm-7.2.0/amd/device-libs/ockl/src/cluster.cl:44:12: error: use of undeclared identifier '__builtin_amdgcn_cluster_workgroup_max_flat_id'
44 | return __builtin_amdgcn_cluster_workgroup_max_flat_id() + 1;
| ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
7 errors generated.
Operating System
NAME='Gentoo' VERSION='2.18'
CPU
model name : AMD Ryzen Threadripper PRO 3955WX 16-Cores
GPU
Name: gfx1012 Marketing Name: AMD Radeon Pro W5500 Name: amdgcn-amd-amdhsa--gfx1012:xnack- Name: amdgcn-amd-amdhsa--gfx10-1-generic:xnack-
ROCm Version
7.2.0
ROCm Component
No response
Steps to Reproduce
No response
(Optional for Linux users) Output of /opt/rocm/bin/rocminfo --support
# rocminfo --support
=====================
HSA System Attributes
=====================
Runtime Version: 1.1
Runtime Ext Version: 1.14
System Timestamp Freq.: 1000.000000MHz
Sig. Max Wait Duration: 18446744073709551615 (0xFFFFFFFFFFFFFFFF) (timestamp count)
Machine Model: LARGE
System Endianness: LITTLE
Mwaitx: DISABLED
XNACK enabled: NO
DMAbuf Support: YES
VMM Support: YES
==========
HSA Agents
==========
*******
Agent 1
*******
Name: AMD Ryzen Threadripper PRO 3955WX 16-Cores
Uuid: CPU-XX
Marketing Name: AMD Ryzen Threadripper PRO 3955WX 16-Cores
Vendor Name: CPU
Feature: None specified
Profile: FULL_PROFILE
Float Round Mode: NEAR
Max Queue Number: 0(0x0)
Queue Min Size: 0(0x0)
Queue Max Size: 0(0x0)
Queue Type: MULTI
Node: 0
Device Type: CPU
Cache Info:
L1: 32768(0x8000) KB
Chip ID: 0(0x0)
ASIC Revision: 0(0x0)
Cacheline Size: 64(0x40)
Max Clock Freq. (MHz): 4405
BDFID: 0
Internal Node ID: 0
Compute Unit: 32
SIMDs per CU: 0
Shader Engines: 0
Shader Arrs. per Eng.: 0
WatchPts on Addr. Ranges:1
Memory Properties:
Features: None
Pool Info:
Pool 1
Segment: GLOBAL; FLAGS: FINE GRAINED
Size: 395800364(0x17976f2c) KB
Allocatable: TRUE
Alloc Granule: 4KB
Alloc Recommended Granule:4KB
Alloc Alignment: 4KB
Accessible by all: TRUE
Pool 2
Segment: GLOBAL; FLAGS: EXTENDED FINE GRAINED
Size: 395800364(0x17976f2c) KB
Allocatable: TRUE
Alloc Granule: 4KB
Alloc Recommended Granule:4KB
Alloc Alignment: 4KB
Accessible by all: TRUE
Pool 3
Segment: GLOBAL; FLAGS: KERNARG, FINE GRAINED
Size: 395800364(0x17976f2c) KB
Allocatable: TRUE
Alloc Granule: 4KB
Alloc Recommended Granule:4KB
Alloc Alignment: 4KB
Accessible by all: TRUE
Pool 4
Segment: GLOBAL; FLAGS: COARSE GRAINED
Size: 395800364(0x17976f2c) KB
Allocatable: TRUE
Alloc Granule: 4KB
Alloc Recommended Granule:4KB
Alloc Alignment: 4KB
Accessible by all: TRUE
ISA Info:
*******
Agent 2
*******
Name: gfx1012
Uuid: GPU-XX
Marketing Name: AMD Radeon Pro W5500
Vendor Name: AMD
Feature: KERNEL_DISPATCH
Profile: BASE_PROFILE
Float Round Mode: NEAR
Max Queue Number: 128(0x80)
Queue Min Size: 64(0x40)
Queue Max Size: 131072(0x20000)
Queue Type: MULTI
Node: 1
Device Type: GPU
Cache Info:
L1: 16(0x10) KB
L2: 2048(0x800) KB
Chip ID: 29505(0x7341)
ASIC Revision: 1(0x1)
Cacheline Size: 128(0x80)
Max Clock Freq. (MHz): 1900
BDFID: 25344
Internal Node ID: 1
Compute Unit: 22
SIMDs per CU: 2
Shader Engines: 1
Shader Arrs. per Eng.: 2
WatchPts on Addr. Ranges:4
Coherent Host Access: FALSE
Memory Properties:
Features: KERNEL_DISPATCH
Fast F16 Operation: TRUE
Wavefront Size: 32(0x20)
Workgroup Max Size: 1024(0x400)
Workgroup Max Size per Dimension:
x 1024(0x400)
y 1024(0x400)
z 1024(0x400)
Max Waves Per CU: 40(0x28)
Max Work-item Per CU: 1280(0x500)
Grid Max Size: 4294967295(0xffffffff)
Grid Max Size per Dimension:
x 2147483647(0x7fffffff)
y 65535(0xffff)
z 65535(0xffff)
Max fbarriers/Workgrp: 32
Packet Processor uCode:: 123
SDMA engine uCode:: 41
IOMMU Support:: None
Pool Info:
Pool 1
Segment: GLOBAL; FLAGS: COARSE GRAINED
Size: 8372224(0x7fc000) KB
Allocatable: TRUE
Alloc Granule: 4KB
Alloc Recommended Granule:2048KB
Alloc Alignment: 4KB
Accessible by all: FALSE
Pool 2
Segment: GLOBAL; FLAGS: EXTENDED FINE GRAINED
Size: 8372224(0x7fc000) KB
Allocatable: TRUE
Alloc Granule: 4KB
Alloc Recommended Granule:2048KB
Alloc Alignment: 4KB
Accessible by all: FALSE
Pool 3
Segment: GROUP
Size: 64(0x40) KB
Allocatable: FALSE
Alloc Granule: 0KB
Alloc Recommended Granule:0KB
Alloc Alignment: 0KB
Accessible by all: FALSE
ISA Info:
ISA 1
Name: amdgcn-amd-amdhsa--gfx1012:xnack-
Machine Models: HSA_MACHINE_MODEL_LARGE
Profiles: HSA_PROFILE_BASE
Default Rounding Mode: NEAR
Default Rounding Mode: NEAR
Fast f16: TRUE
Workgroup Max Size: 1024(0x400)
Workgroup Max Size per Dimension:
x 1024(0x400)
y 1024(0x400)
z 1024(0x400)
Grid Max Size: 4294967295(0xffffffff)
Grid Max Size per Dimension:
x 2147483647(0x7fffffff)
y 65535(0xffff)
z 65535(0xffff)
FBarrier Max Size: 32
ISA 2
Name: amdgcn-amd-amdhsa--gfx10-1-generic:xnack-
Machine Models: HSA_MACHINE_MODEL_LARGE
Profiles: HSA_PROFILE_BASE
Default Rounding Mode: NEAR
Default Rounding Mode: NEAR
Fast f16: TRUE
Workgroup Max Size: 1024(0x400)
Workgroup Max Size per Dimension:
x 1024(0x400)
y 1024(0x400)
z 1024(0x400)
Grid Max Size: 4294967295(0xffffffff)
Grid Max Size per Dimension:
x 2147483647(0x7fffffff)
y 65535(0xffff)
z 65535(0xffff)
FBarrier Max Size: 32
*** Done ***
Additional Information
This is the commit that added that file: fcc50fb
Reactions are currently unavailable