Coder Social home page Coder Social logo

radeonopencompute / rocr-runtime Goto Github PK

View Code? Open in Web Editor NEW
202.0 67.0 96.0 69.67 MB

ROCm Platform Runtime: ROCr a HPC market enhanced HSA based runtime

Home Page: https://rocm.docs.amd.com/projects/ROCR-Runtime/en/latest/

License: Other

CMake 1.18% C++ 86.77% C 11.12% Shell 0.31% Assembly 0.61%

rocr-runtime's Introduction

HSA Runtime API and runtime for ROCm

This repository includes the user-mode API interfaces and libraries necessary for host applications to launch compute kernels to available HSA ROCm kernel agents. Reference source code for the core runtime is also available.

Initial target platform requirements

  • CPU: Intel Haswell or newer, Core i5, Core i7, Xeon E3 v4 & v5; Xeon E5 v3
  • GPU: Fiji ASIC (AMD R9 Nano, R9 Fury and R9 Fury X)
  • GPU: Polaris ASIC (AMD RX480)

Source code

The HSA core runtime source code for the ROCR runtime is located in the src subdirectory. Please consult the associated README.md file for contents and build instructions.

Binaries for Ubuntu & Fedora and installation instructions

Pre-built binaries are available for installation from the ROCm package repository. For ROCR, they include:

Core runtime package:

  • HSA include files to support application development on the HSA runtime for the ROCR runtime
  • A 64-bit version of AMD's HSA core runtime for the ROCR runtime

Runtime extension package:

  • A 64-bit version of AMD's runtime tools library
  • A 64-bit version of AMD's runtime image library

The contents of these packages are installed in /opt/rocm/hsa and /opt/rocm by default. The core runtime package depends on the hsakmt-roct-dev package

Installation instructions can be found in the ROCm manifest repository README.md:

https://github.com/RadeonOpenCompute/ROCm

Infrastructure

The HSA runtime is a thin, user-mode API that exposes the necessary interfaces to access and interact with graphics hardware driven by the AMDGPU driver set and the ROCK kernel driver. Together they enable programmers to directly harness the power of AMD discrete graphics devices by allowing host applications to launch compute kernels directly to the graphics hardware.

The capabilities expressed by the HSA Runtime API are:

  • Error handling
  • Runtime initialization and shutdown
  • System and agent information
  • Signals and synchronization
  • Architected dispatch
  • Memory management
  • HSA runtime fits into a typical software architecture stack.

The HSA runtime provides direct access to the graphics hardware to give the programmer more control of the execution. An example of low level hardware access is the support of one or more user mode queues provides programmers with a low-latency kernel dispatch interface, allowing them to develop customized dispatch algorithms specific to their application.

The HSA Architected Queuing Language is an open standard, defined by the HSA Foundation, specifying the packet syntax used to control supported AMD/ATI Radeon (c) graphics devices. The AQL language supports several packet types, including packets that can command the hardware to automatically resolve inter-packet dependencies (barrier AND & barrier OR packet), kernel dispatch packets and agent dispatch packets.

In addition to user mode queues and AQL, the HSA runtime exposes various virtual address ranges that can be accessed by one or more of the system's graphics devices, and possibly the host. The exposed virtual address ranges either support a fine grained or a coarse grained access. Updates to memory in a fine grained region are immediately visible to all devices that can access it, but only one device can have access to a coarse grained allocation at a time. Ownership of a coarse grained region can be changed using the HSA runtime memory APIs, but this transfer of ownership must be explicitly done by the host application.

Programmers should consult the HSA Runtime Programmer's Reference Manual for a full description of the HSA Runtime APIs, AQL and the HSA memory policy.

Known issues

  • Each HSA process creates an internal DMA queue, but there is a system-wide limit of four DMA queues. When the limit is reached HSA processes will use internal kernels for copies.

Disclaimer

The information contained herein is for informational purposes only, and is subject to change without notice. While every precaution has been taken in the preparation of this document, it may contain technical inaccuracies, omissions and typographical errors, and AMD is under no obligation to update or otherwise correct this information. Advanced Micro Devices, Inc. makes no representations or warranties with respect to the accuracy or completeness of the contents of this document, and assumes no liability of any kind, including the implied warranties of noninfringement, merchantability or fitness for particular purposes, with respect to the operation or use of AMD hardware, software or other products described herein. No license, including implied or arising by estoppel, to any intellectual property rights is granted by this document. Terms and limitations applicable to the purchase or use of AMD's products are as set forth in a signed agreement between the parties or in AMD's Standard Terms and Conditions of Sale.

AMD, the AMD Arrow logo, and combinations thereof are trademarks of Advanced Micro Devices, Inc. Other product names used in this publication are for identification purposes only and may be trademarks of their respective companies.

Copyright (c) 2014-2021 Advanced Micro Devices, Inc. All rights reserved.

rocr-runtime's People

Contributors

abhimeda avatar amd-jmacaran avatar chsigg avatar dayatsin-amd avatar dependabot[bot] avatar diamondlovesyou avatar icarus-sparry avatar jcbaratault avatar jedwards-amd avatar jkelling avatar jvesely avatar kentrussell avatar matyas-streamhpc avatar peterjunpark avatar samjwu avatar skeelyamd avatar swraw avatar yhuiyh avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

rocr-runtime's Issues

CPU Node [0] has no GPU connected

I am continuously running into the same error when installing the ROCm stack from the Ubuntu repositories after a fresh system install (tried multiple times, starting with a fresh Ubuntu 16.10 install every time). The setup is using an AMD R9 Nano and runs Ubuntu 16.10.
After installing while following the instructions from here, (Both with and without rocm-opencl), running the included vector_copy sample results in the following output:

CPU Node [0] has no GPU connected
Initializing the hsa runtime succeeded.
Checking finalizer 1.0 extension support succeeded. 
Generating function table for finalizer succeeded.
Getting a gpu agent failed.

Installing and running clinfo results in:

CPU Node [0] has no GPU connected
Number of platforms                               0

This is the dmesg output, where there are some ACPI errors that occur (line 699+) and further down some EDAC errors (1242+). Those errors do not appear when booting into the non-rocm kernel. Possibly related?

Page not present or supervisor privilege.

Hi, I am currently trying to get a pure HC based implementation of GPU stream running, see my (topic branch here)[https://github.com/psteinb/GPU-STREAM/tree/bare_hc]. the code compiles alright, but when I run the binary, I get the following output:

GPU-STREAM
Version: 2.1
Implementation: HC
Running kernels 100 times
Precision: double
Array size: 268.4 MB (=0.3 GB)
Total size: 805.3 MB (=0.8 GB)
Using HC device AMD HSA Agent gfx8032
Memory access fault by GPU node-2 on address 0x38000. Reason: Page not present or supervisor privilege.
Aborted (core dumped)

running rocm-gdb, I see this stacktrace that points to the runtime:

(ROCm-gdb) bt
#0  0x00002aaaab87fc37 in raise () from /lib/x86_64-linux-gnu/libc.so.6
#1  0x00002aaaab883028 in abort () from /lib/x86_64-linux-gnu/libc.so.6
#2  0x00002aaaac0be0eb in ?? () from /opt/rocm/hsa/lib/libhsa-runtime64.so.1
#3  0x00002aaaac0c1ea5 in ?? () from /opt/rocm/hsa/lib/libhsa-runtime64.so.1
#4  0x00002aaaac0a0107 in ?? () from /opt/rocm/hsa/lib/libhsa-runtime64.so.1
#5  0x00002aaaab41d184 in start_thread () from /lib/x86_64-linux-gnu/libpthread.so.0
#6  0x00002aaaab94337d in clone () from /lib/x86_64-linux-gnu/libc.so.6

I am a bit confused about this problem and would grateful for some pointers on how to resolve it.

here is my environment on a Ubuntu 14.04.5 haswell box with one R9 fiji nano:

$ uname -a 
Linux islay 4.6.0-kfd-compute-rocm-rel-1.4-16 #1 SMP Tue Dec 13 13:14:21 EST 2016 x86_64 x86_64 x86_64 GNU/Linux
$ hcc --version
HCC clang version 3.5.0  (based on HCC 0.10.16501-81f0a2f-02246a0 LLVM 3.5.0svn)
Target: x86_64-unknown-linux-gnu
Thread model: posix

Status of Hawaii support under ROCm 3.0?

My test server runs under Ubuntu 18.04 with the 4.15.0-60-generic kernel. There are 2 GPUs in the system: 1) Radeon RX Vega gfx900 card that works correctly with ROCm 3.0 and 2) S9150 gfx701 card that experiences problems.

As far as I understand from #68, the problems with Hawaii architecture support have been solved.

However on my system I can see the correct output from rocm-smi and rocminfo but clinfo gives not output and hangs up to the ctrl-c cancelation.

Please let me know if ROCm 3.0 is supposed to work with Hawaii (and S9150)?
I cite the key information about this system below:

rocminfo
ROCk module is loaded
user is member of video group
=====================    
HSA System Attributes    
=====================    
Runtime Version:         1.1
System Timestamp Freq.:  1000.000000MHz
Sig. Max Wait Duration:  18446744073709551615 (0xFFFFFFFFFFFFFFFF) (timestamp count)
Machine Model:           LARGE                              
System Endianness:       LITTLE                             

==========               
HSA Agents               
==========               
*******                  
Agent 1                  
*******                  
  Name:                    Intel(R) Xeon(R) CPU E5-2620 v4 @ 2.10GHz
  Marketing Name:          Intel(R) Xeon(R) CPU E5-2620 v4 @ 2.10GHz
  Vendor Name:             CPU                                
  Feature:                 None specified                     
  Profile:                 FULL_PROFILE                       
  Float Round Mode:        NEAR                               
  Max Queue Number:        0(0x0)                             
  Queue Min Size:          0(0x0)                             
  Queue Max Size:          0(0x0)                             
  Queue Type:              MULTI                              
  Node:                    0                                  
  Device Type:             CPU                                
  Cache Info:              
    L1:                      32768(0x8000) KB                   
  Chip ID:                 0(0x0)                             
  Cacheline Size:          64(0x40)                           
  Max Clock Freq. (MHz):   3000                               
  BDFID:                   0                                  
  Internal Node ID:        0                                  
  Compute Unit:            16                                 
  SIMDs per CU:            0                                  
  Shader Engines:          0                                  
  Shader Arrs. per Eng.:   0                                  
  WatchPts on Addr. Ranges:1                                  
  Features:                None
  Pool Info:               
    Pool 1                   
      Segment:                 GLOBAL; FLAGS: KERNARG, FINE GRAINED
      Size:                    16308652(0xf8d9ac) KB              
      Allocatable:             TRUE                               
      Alloc Granule:           4KB                                
      Alloc Alignment:         4KB                                
      Acessible by all:        TRUE                               
    Pool 2                   
      Segment:                 GLOBAL; FLAGS: COARSE GRAINED      
      Size:                    16308652(0xf8d9ac) KB              
      Allocatable:             TRUE                               
      Alloc Granule:           4KB                                
      Alloc Alignment:         4KB                                
      Acessible by all:        TRUE                               
  ISA Info:                
    N/A                      
*******                  
Agent 2                  
*******                  
  Name:                    gfx900                             
  Marketing Name:          Vega [Radeon RX Vega]              
  Vendor Name:             AMD                                
  Feature:                 KERNEL_DISPATCH                    
  Profile:                 BASE_PROFILE                       
  Float Round Mode:        NEAR                               
  Max Queue Number:        128(0x80)                          
  Queue Min Size:          4096(0x1000)                       
  Queue Max Size:          131072(0x20000)                    
  Queue Type:              MULTI                              
  Node:                    1                                  
  Device Type:             GPU                                
  Cache Info:              
    L1:                      16(0x10) KB                        
  Chip ID:                 26751(0x687f)                      
  Cacheline Size:          64(0x40)                           
  Max Clock Freq. (MHz):   1590                               
  BDFID:                   1024                               
  Internal Node ID:        1                                  
  Compute Unit:            56                                 
  SIMDs per CU:            4                                  
  Shader Engines:          4                                  
  Shader Arrs. per Eng.:   1                                  
  WatchPts on Addr. Ranges:4                                  
  Features:                KERNEL_DISPATCH 
  Fast F16 Operation:      FALSE                              
  Wavefront Size:          64(0x40)                           
  Workgroup Max Size:      1024(0x400)                        
  Workgroup Max Size per Dimension:
    x                        1024(0x400)                        
    y                        1024(0x400)                        
    z                        1024(0x400)                        
  Max Waves Per CU:        40(0x28)                           
  Max Work-item Per CU:    2560(0xa00)                        
  Grid Max Size:           4294967295(0xffffffff)             
  Grid Max Size per Dimension:
    x                        4294967295(0xffffffff)             
    y                        4294967295(0xffffffff)             
    z                        4294967295(0xffffffff)             
  Max fbarriers/Workgrp:   32                                 
  Pool Info:               
    Pool 1                   
      Segment:                 GLOBAL; FLAGS: COARSE GRAINED      
      Size:                    8372224(0x7fc000) KB               
      Allocatable:             TRUE                               
      Alloc Granule:           4KB                                
      Alloc Alignment:         4KB                                
      Acessible by all:        FALSE                              
    Pool 2                   
      Segment:                 GROUP                              
      Size:                    64(0x40) KB                        
      Allocatable:             FALSE                              
      Alloc Granule:           0KB                                
      Alloc Alignment:         0KB                                
      Acessible by all:        FALSE                              
  ISA Info:                
    ISA 1                    
      Name:                    amdgcn-amd-amdhsa--gfx900          
      Machine Models:          HSA_MACHINE_MODEL_LARGE            
      Profiles:                HSA_PROFILE_BASE                   
      Default Rounding Mode:   NEAR                               
      Default Rounding Mode:   NEAR                               
      Fast f16:                TRUE                               
      Workgroup Max Size:      1024(0x400)                        
      Workgroup Max Size per Dimension:
        x                        1024(0x400)                        
        y                        1024(0x400)                        
        z                        1024(0x400)                        
      Grid Max Size:           4294967295(0xffffffff)             
      Grid Max Size per Dimension:
        x                        4294967295(0xffffffff)             
        y                        4294967295(0xffffffff)             
        z                        4294967295(0xffffffff)             
      FBarrier Max Size:       32                                 
*******                  
Agent 3                  
*******                  
  Name:                    gfx701                             
  Marketing Name:          Hawaii XT GL [FirePro W9100]       
  Vendor Name:             AMD                                
  Feature:                 KERNEL_DISPATCH                    
  Profile:                 BASE_PROFILE                       
  Float Round Mode:        NEAR                               
  Max Queue Number:        128(0x80)                          
  Queue Min Size:          4096(0x1000)                       
  Queue Max Size:          131072(0x20000)                    
  Queue Type:              MULTI                              
  Node:                    2                                  
  Device Type:             GPU                                
  Cache Info:              
    L1:                      16(0x10) KB                        
  Chip ID:                 26528(0x67a0)                      
  Cacheline Size:          64(0x40)                           
  Max Clock Freq. (MHz):   900                                
  BDFID:                   1280                               
  Internal Node ID:        2                                  
  Compute Unit:            44                                 
  SIMDs per CU:            4                                  
  Shader Engines:          4                                  
  Shader Arrs. per Eng.:   1                                  
  WatchPts on Addr. Ranges:4                                  
  Features:                KERNEL_DISPATCH 
  Fast F16 Operation:      FALSE                              
  Wavefront Size:          64(0x40)                           
  Workgroup Max Size:      1024(0x400)                        
  Workgroup Max Size per Dimension:
    x                        1024(0x400)                        
    y                        1024(0x400)                        
    z                        1024(0x400)                        
  Max Waves Per CU:        40(0x28)                           
  Max Work-item Per CU:    2560(0xa00)                        
  Grid Max Size:           4294967295(0xffffffff)             
  Grid Max Size per Dimension:
    x                        4294967295(0xffffffff)             
    y                        4294967295(0xffffffff)             
    z                        4294967295(0xffffffff)             
  Max fbarriers/Workgrp:   32                                 
  Pool Info:               
    Pool 1                   
      Segment:                 GLOBAL; FLAGS: COARSE GRAINED      
      Size:                    16777216(0x1000000) KB             
      Allocatable:             TRUE                               
      Alloc Granule:           4KB                                
      Alloc Alignment:         4KB                                
      Acessible by all:        FALSE                              
    Pool 2                   
      Segment:                 GROUP                              
      Size:                    64(0x40) KB                        
      Allocatable:             FALSE                              
      Alloc Granule:           0KB                                
      Alloc Alignment:         0KB                                
      Acessible by all:        FALSE                              
  ISA Info:                
    ISA 1                    
      Name:                    amdgcn-amd-amdhsa--gfx701          
      Machine Models:          HSA_MACHINE_MODEL_LARGE            
      Profiles:                HSA_PROFILE_BASE                   
      Default Rounding Mode:   NEAR                               
      Default Rounding Mode:   NEAR                               
      Fast f16:                FALSE                              
      Workgroup Max Size:      1024(0x400)                        
      Workgroup Max Size per Dimension:
        x                        1024(0x400)                        
        y                        1024(0x400)                        
        z                        1024(0x400)                        
      Grid Max Size:           4294967295(0xffffffff)             
      Grid Max Size per Dimension:
        x                        4294967295(0xffffffff)             
        y                        4294967295(0xffffffff)             
        z                        4294967295(0xffffffff)             
      FBarrier Max Size:       32                                 
*** Done ***             
lsb_release -a
No LSB modules are available.
Distributor ID:	Ubuntu
Description:	Ubuntu 18.04.1 LTS
Release:	18.04
Codename:	bionic
uname -r
4.15.0-60-generic
lsmod | grep amd
amdgpu               4681728  1
amdttm                 98304  1 amdgpu
amd_sched              28672  1 amdgpu
amdkcl                 28672  3 amd_sched,amdttm,amdgpu
amd_iommu_v2           20480  1 amdgpu
drm_kms_helper        172032  3 ast,amdgpu,amdkcl
i2c_algo_bit           16384  3 igb,ast,amdgpu
drm                   401408  9 drm_kms_helper,amd_sched,amdttm,ast,amdgpu,ttm
modinfo amdgpu
filename:       /lib/modules/4.15.0-60-generic/updates/dkms/amdgpu.ko
version:        5.2.4
license:        GPL and additional rights
description:    AMD GPU
author:         AMD linux driver team
firmware:       amdgpu/navi12_gpu_info.bin
firmware:       amdgpu/navi14_gpu_info.bin
firmware:       amdgpu/navi10_gpu_info.bin
firmware:       amdgpu/renoir_gpu_info.bin
firmware:       amdgpu/arcturus_gpu_info.bin
firmware:       amdgpu/raven2_gpu_info.bin
firmware:       amdgpu/picasso_gpu_info.bin
firmware:       amdgpu/raven_gpu_info.bin
firmware:       amdgpu/vega12_gpu_info.bin
firmware:       amdgpu/vega10_gpu_info.bin
firmware:       amdgpu/mullins_mec.bin
firmware:       amdgpu/mullins_rlc.bin
firmware:       amdgpu/mullins_ce.bin
firmware:       amdgpu/mullins_me.bin
firmware:       amdgpu/mullins_pfp.bin
firmware:       amdgpu/kabini_mec.bin
firmware:       amdgpu/kabini_rlc.bin
firmware:       amdgpu/kabini_ce.bin
firmware:       amdgpu/kabini_me.bin
firmware:       amdgpu/kabini_pfp.bin
firmware:       amdgpu/kaveri_mec2.bin
firmware:       amdgpu/kaveri_mec.bin
firmware:       amdgpu/kaveri_rlc.bin
firmware:       amdgpu/kaveri_ce.bin
firmware:       amdgpu/kaveri_me.bin
firmware:       amdgpu/kaveri_pfp.bin
firmware:       amdgpu/hawaii_mec.bin
firmware:       amdgpu/hawaii_rlc.bin
firmware:       amdgpu/hawaii_ce.bin
firmware:       amdgpu/hawaii_me.bin
firmware:       amdgpu/hawaii_pfp.bin
firmware:       amdgpu/bonaire_mec.bin
firmware:       amdgpu/bonaire_rlc.bin
firmware:       amdgpu/bonaire_ce.bin
firmware:       amdgpu/bonaire_me.bin
firmware:       amdgpu/bonaire_pfp.bin
firmware:       amdgpu/mullins_sdma1.bin
firmware:       amdgpu/mullins_sdma.bin
firmware:       amdgpu/kabini_sdma1.bin
firmware:       amdgpu/kabini_sdma.bin
firmware:       amdgpu/kaveri_sdma1.bin
firmware:       amdgpu/kaveri_sdma.bin
firmware:       amdgpu/hawaii_sdma1.bin
firmware:       amdgpu/hawaii_sdma.bin
firmware:       amdgpu/bonaire_sdma1.bin
firmware:       amdgpu/bonaire_sdma.bin
firmware:       amdgpu/si58_mc.bin
firmware:       amdgpu/hainan_mc.bin
firmware:       amdgpu/oland_mc.bin
firmware:       amdgpu/verde_mc.bin
firmware:       amdgpu/pitcairn_mc.bin
firmware:       amdgpu/tahiti_mc.bin
firmware:       amdgpu/hainan_rlc.bin
firmware:       amdgpu/hainan_ce.bin
firmware:       amdgpu/hainan_me.bin
firmware:       amdgpu/hainan_pfp.bin
firmware:       amdgpu/oland_rlc.bin
firmware:       amdgpu/oland_ce.bin
firmware:       amdgpu/oland_me.bin
firmware:       amdgpu/oland_pfp.bin
firmware:       amdgpu/verde_rlc.bin
firmware:       amdgpu/verde_ce.bin
firmware:       amdgpu/verde_me.bin
firmware:       amdgpu/verde_pfp.bin
firmware:       amdgpu/pitcairn_rlc.bin
firmware:       amdgpu/pitcairn_ce.bin
firmware:       amdgpu/pitcairn_me.bin
firmware:       amdgpu/pitcairn_pfp.bin
firmware:       amdgpu/tahiti_rlc.bin
firmware:       amdgpu/tahiti_ce.bin
firmware:       amdgpu/tahiti_me.bin
firmware:       amdgpu/tahiti_pfp.bin
firmware:       amdgpu/banks_k_2_smc.bin
firmware:       amdgpu/hainan_k_smc.bin
firmware:       amdgpu/hainan_smc.bin
firmware:       amdgpu/oland_k_smc.bin
firmware:       amdgpu/oland_smc.bin
firmware:       amdgpu/verde_k_smc.bin
firmware:       amdgpu/verde_smc.bin
firmware:       amdgpu/pitcairn_k_smc.bin
firmware:       amdgpu/pitcairn_smc.bin
firmware:       amdgpu/tahiti_smc.bin
firmware:       amdgpu/topaz_mc.bin
firmware:       amdgpu/hawaii_mc.bin
firmware:       amdgpu/bonaire_mc.bin
firmware:       amdgpu/polaris12_k_mc.bin
firmware:       amdgpu/polaris10_k_mc.bin
firmware:       amdgpu/polaris11_k_mc.bin
firmware:       amdgpu/polaris12_mc.bin
firmware:       amdgpu/polaris10_mc.bin
firmware:       amdgpu/polaris11_mc.bin
firmware:       amdgpu/tonga_mc.bin
firmware:       amdgpu/vega12_asd.bin
firmware:       amdgpu/vega12_sos.bin
firmware:       amdgpu/vega10_asd.bin
firmware:       amdgpu/vega10_sos.bin
firmware:       amdgpu/raven_ta.bin
firmware:       amdgpu/raven2_ta.bin
firmware:       amdgpu/picasso_ta.bin
firmware:       amdgpu/raven2_asd.bin
firmware:       amdgpu/picasso_asd.bin
firmware:       amdgpu/raven_asd.bin
firmware:       amdgpu/arcturus_ta.bin
firmware:       amdgpu/arcturus_asd.bin
firmware:       amdgpu/arcturus_sos.bin
firmware:       amdgpu/navi12_asd.bin
firmware:       amdgpu/navi12_sos.bin
firmware:       amdgpu/navi14_asd.bin
firmware:       amdgpu/navi14_sos.bin
firmware:       amdgpu/navi10_asd.bin
firmware:       amdgpu/navi10_sos.bin
firmware:       amdgpu/vega20_ta.bin
firmware:       amdgpu/vega20_asd.bin
firmware:       amdgpu/vega20_sos.bin
firmware:       amdgpu/renoir_asd.bin
firmware:       amdgpu/vegam_rlc.bin
firmware:       amdgpu/vegam_mec2.bin
firmware:       amdgpu/vegam_mec.bin
firmware:       amdgpu/vegam_me.bin
firmware:       amdgpu/vegam_pfp.bin
firmware:       amdgpu/vegam_ce.bin
firmware:       amdgpu/polaris12_rlc.bin
firmware:       amdgpu/polaris12_mec2_2.bin
firmware:       amdgpu/polaris12_mec2.bin
firmware:       amdgpu/polaris12_mec_2.bin
firmware:       amdgpu/polaris12_mec.bin
firmware:       amdgpu/polaris12_me_2.bin
firmware:       amdgpu/polaris12_me.bin
firmware:       amdgpu/polaris12_pfp_2.bin
firmware:       amdgpu/polaris12_pfp.bin
firmware:       amdgpu/polaris12_ce_2.bin
firmware:       amdgpu/polaris12_ce.bin
firmware:       amdgpu/polaris11_rlc.bin
firmware:       amdgpu/polaris11_mec2_2.bin
firmware:       amdgpu/polaris11_mec2.bin
firmware:       amdgpu/polaris11_mec_2.bin
firmware:       amdgpu/polaris11_mec.bin
firmware:       amdgpu/polaris11_me_2.bin
firmware:       amdgpu/polaris11_me.bin
firmware:       amdgpu/polaris11_pfp_2.bin
firmware:       amdgpu/polaris11_pfp.bin
firmware:       amdgpu/polaris11_ce_2.bin
firmware:       amdgpu/polaris11_ce.bin
firmware:       amdgpu/polaris10_rlc.bin
firmware:       amdgpu/polaris10_mec2_2.bin
firmware:       amdgpu/polaris10_mec2.bin
firmware:       amdgpu/polaris10_mec_2.bin
firmware:       amdgpu/polaris10_mec.bin
firmware:       amdgpu/polaris10_me_2.bin
firmware:       amdgpu/polaris10_me.bin
firmware:       amdgpu/polaris10_pfp_2.bin
firmware:       amdgpu/polaris10_pfp.bin
firmware:       amdgpu/polaris10_ce_2.bin
firmware:       amdgpu/polaris10_ce.bin
firmware:       amdgpu/fiji_rlc.bin
firmware:       amdgpu/fiji_mec2.bin
firmware:       amdgpu/fiji_mec.bin
firmware:       amdgpu/fiji_me.bin
firmware:       amdgpu/fiji_pfp.bin
firmware:       amdgpu/fiji_ce.bin
firmware:       amdgpu/topaz_rlc.bin
firmware:       amdgpu/topaz_mec.bin
firmware:       amdgpu/topaz_me.bin
firmware:       amdgpu/topaz_pfp.bin
firmware:       amdgpu/topaz_ce.bin
firmware:       amdgpu/tonga_rlc.bin
firmware:       amdgpu/tonga_mec2.bin
firmware:       amdgpu/tonga_mec.bin
firmware:       amdgpu/tonga_me.bin
firmware:       amdgpu/tonga_pfp.bin
firmware:       amdgpu/tonga_ce.bin
firmware:       amdgpu/stoney_rlc.bin
firmware:       amdgpu/stoney_mec.bin
firmware:       amdgpu/stoney_me.bin
firmware:       amdgpu/stoney_pfp.bin
firmware:       amdgpu/stoney_ce.bin
firmware:       amdgpu/carrizo_rlc.bin
firmware:       amdgpu/carrizo_mec2.bin
firmware:       amdgpu/carrizo_mec.bin
firmware:       amdgpu/carrizo_me.bin
firmware:       amdgpu/carrizo_pfp.bin
firmware:       amdgpu/carrizo_ce.bin
firmware:       amdgpu/renoir_rlc.bin
firmware:       amdgpu/renoir_mec2.bin
firmware:       amdgpu/renoir_mec.bin
firmware:       amdgpu/renoir_me.bin
firmware:       amdgpu/renoir_pfp.bin
firmware:       amdgpu/renoir_ce.bin
firmware:       amdgpu/arcturus_rlc.bin
firmware:       amdgpu/arcturus_mec2.bin
firmware:       amdgpu/arcturus_mec.bin
firmware:       amdgpu/raven_kicker_rlc.bin
firmware:       amdgpu/raven2_rlc.bin
firmware:       amdgpu/raven2_mec2.bin
firmware:       amdgpu/raven2_mec.bin
firmware:       amdgpu/raven2_me.bin
firmware:       amdgpu/raven2_pfp.bin
firmware:       amdgpu/raven2_ce.bin
firmware:       amdgpu/picasso_rlc_am4.bin
firmware:       amdgpu/picasso_rlc.bin
firmware:       amdgpu/picasso_mec2.bin
firmware:       amdgpu/picasso_mec.bin
firmware:       amdgpu/picasso_me.bin
firmware:       amdgpu/picasso_pfp.bin
firmware:       amdgpu/picasso_ce.bin
firmware:       amdgpu/raven_rlc.bin
firmware:       amdgpu/raven_mec2.bin
firmware:       amdgpu/raven_mec.bin
firmware:       amdgpu/raven_me.bin
firmware:       amdgpu/raven_pfp.bin
firmware:       amdgpu/raven_ce.bin
firmware:       amdgpu/vega20_rlc.bin
firmware:       amdgpu/vega20_mec2.bin
firmware:       amdgpu/vega20_mec.bin
firmware:       amdgpu/vega20_me.bin
firmware:       amdgpu/vega20_pfp.bin
firmware:       amdgpu/vega20_ce.bin
firmware:       amdgpu/vega12_rlc.bin
firmware:       amdgpu/vega12_mec2.bin
firmware:       amdgpu/vega12_mec.bin
firmware:       amdgpu/vega12_me.bin
firmware:       amdgpu/vega12_pfp.bin
firmware:       amdgpu/vega12_ce.bin
firmware:       amdgpu/vega10_rlc.bin
firmware:       amdgpu/vega10_mec2.bin
firmware:       amdgpu/vega10_mec.bin
firmware:       amdgpu/vega10_me.bin
firmware:       amdgpu/vega10_pfp.bin
firmware:       amdgpu/vega10_ce.bin
firmware:       amdgpu/navi12_rlc.bin
firmware:       amdgpu/navi12_mec2.bin
firmware:       amdgpu/navi12_mec.bin
firmware:       amdgpu/navi12_me.bin
firmware:       amdgpu/navi12_pfp.bin
firmware:       amdgpu/navi12_ce.bin
firmware:       amdgpu/navi14_rlc.bin
firmware:       amdgpu/navi14_mec2.bin
firmware:       amdgpu/navi14_mec.bin
firmware:       amdgpu/navi14_me.bin
firmware:       amdgpu/navi14_pfp.bin
firmware:       amdgpu/navi14_ce.bin
firmware:       amdgpu/navi14_mec2_wks.bin
firmware:       amdgpu/navi14_mec_wks.bin
firmware:       amdgpu/navi14_me_wks.bin
firmware:       amdgpu/navi14_pfp_wks.bin
firmware:       amdgpu/navi14_ce_wks.bin
firmware:       amdgpu/navi10_rlc.bin
firmware:       amdgpu/navi10_mec2.bin
firmware:       amdgpu/navi10_mec.bin
firmware:       amdgpu/navi10_me.bin
firmware:       amdgpu/navi10_pfp.bin
firmware:       amdgpu/navi10_ce.bin
firmware:       amdgpu/topaz_sdma1.bin
firmware:       amdgpu/topaz_sdma.bin
firmware:       amdgpu/vegam_sdma1.bin
firmware:       amdgpu/vegam_sdma.bin
firmware:       amdgpu/polaris12_sdma1.bin
firmware:       amdgpu/polaris12_sdma.bin
firmware:       amdgpu/polaris11_sdma1.bin
firmware:       amdgpu/polaris11_sdma.bin
firmware:       amdgpu/polaris10_sdma1.bin
firmware:       amdgpu/polaris10_sdma.bin
firmware:       amdgpu/stoney_sdma.bin
firmware:       amdgpu/fiji_sdma1.bin
firmware:       amdgpu/fiji_sdma.bin
firmware:       amdgpu/carrizo_sdma1.bin
firmware:       amdgpu/carrizo_sdma.bin
firmware:       amdgpu/tonga_sdma1.bin
firmware:       amdgpu/tonga_sdma.bin
firmware:       amdgpu/renoir_sdma.bin
firmware:       amdgpu/arcturus_sdma.bin
firmware:       amdgpu/raven2_sdma.bin
firmware:       amdgpu/picasso_sdma.bin
firmware:       amdgpu/raven_sdma.bin
firmware:       amdgpu/vega20_sdma1.bin
firmware:       amdgpu/vega20_sdma.bin
firmware:       amdgpu/vega12_sdma1.bin
firmware:       amdgpu/vega12_sdma.bin
firmware:       amdgpu/vega10_sdma1.bin
firmware:       amdgpu/vega10_sdma.bin
firmware:       amdgpu/navi12_sdma1.bin
firmware:       amdgpu/navi12_sdma.bin
firmware:       amdgpu/navi14_sdma1.bin
firmware:       amdgpu/navi14_sdma.bin
firmware:       amdgpu/navi10_sdma1.bin
firmware:       amdgpu/navi10_sdma.bin
firmware:       amdgpu/navi10_mes.bin
firmware:       amdgpu/vega20_uvd.bin
firmware:       amdgpu/vega12_uvd.bin
firmware:       amdgpu/vega10_uvd.bin
firmware:       amdgpu/vegam_uvd.bin
firmware:       amdgpu/polaris12_uvd.bin
firmware:       amdgpu/polaris11_uvd.bin
firmware:       amdgpu/polaris10_uvd.bin
firmware:       amdgpu/stoney_uvd.bin
firmware:       amdgpu/fiji_uvd.bin
firmware:       amdgpu/carrizo_uvd.bin
firmware:       amdgpu/tonga_uvd.bin
firmware:       amdgpu/mullins_uvd.bin
firmware:       amdgpu/hawaii_uvd.bin
firmware:       amdgpu/kaveri_uvd.bin
firmware:       amdgpu/kabini_uvd.bin
firmware:       amdgpu/bonaire_uvd.bin
firmware:       amdgpu/vega20_vce.bin
firmware:       amdgpu/vega12_vce.bin
firmware:       amdgpu/vega10_vce.bin
firmware:       amdgpu/vegam_vce.bin
firmware:       amdgpu/polaris12_vce.bin
firmware:       amdgpu/polaris11_vce.bin
firmware:       amdgpu/polaris10_vce.bin
firmware:       amdgpu/stoney_vce.bin
firmware:       amdgpu/fiji_vce.bin
firmware:       amdgpu/carrizo_vce.bin
firmware:       amdgpu/tonga_vce.bin
firmware:       amdgpu/mullins_vce.bin
firmware:       amdgpu/hawaii_vce.bin
firmware:       amdgpu/kaveri_vce.bin
firmware:       amdgpu/kabini_vce.bin
firmware:       amdgpu/bonaire_vce.bin
firmware:       amdgpu/navi12_vcn.bin
firmware:       amdgpu/navi14_vcn.bin
firmware:       amdgpu/navi10_vcn.bin
firmware:       amdgpu/renoir_vcn.bin
firmware:       amdgpu/arcturus_vcn.bin
firmware:       amdgpu/raven2_vcn.bin
firmware:       amdgpu/picasso_vcn.bin
firmware:       amdgpu/raven_vcn.bin
firmware:       amdgpu/vega20_smc.bin
firmware:       amdgpu/vega12_smc.bin
firmware:       amdgpu/vega10_acg_smc.bin
firmware:       amdgpu/vega10_smc.bin
firmware:       amdgpu/vegam_smc.bin
firmware:       amdgpu/polaris12_k_smc.bin
firmware:       amdgpu/polaris12_smc.bin
firmware:       amdgpu/polaris11_k2_smc.bin
firmware:       amdgpu/polaris11_k_smc.bin
firmware:       amdgpu/polaris11_smc_sk.bin
firmware:       amdgpu/polaris11_smc.bin
firmware:       amdgpu/polaris10_k2_smc.bin
firmware:       amdgpu/polaris10_k_smc.bin
firmware:       amdgpu/polaris10_smc_sk.bin
firmware:       amdgpu/polaris10_smc.bin
firmware:       amdgpu/fiji_smc.bin
firmware:       amdgpu/tonga_k_smc.bin
firmware:       amdgpu/tonga_smc.bin
firmware:       amdgpu/topaz_k_smc.bin
firmware:       amdgpu/topaz_smc.bin
firmware:       amdgpu/hawaii_k_smc.bin
firmware:       amdgpu/hawaii_smc.bin
firmware:       amdgpu/bonaire_k_smc.bin
firmware:       amdgpu/bonaire_smc.bin
firmware:       amdgpu/navi12_smc.bin
firmware:       amdgpu/navi14_smc.bin
firmware:       amdgpu/navi10_smc.bin
firmware:       amdgpu/arcturus_smc.bin
firmware:       amdgpu/vega20_smc.bin
firmware:       amdgpu/raven_dmcu.bin
firmware:       amdgpu/renoir_dmcub.bin
srcversion:     533BB7E5866E52F63B9ACCB
alias:          pci:v00001002d00007362sv*sd*bc*sc*i*
alias:          pci:v00001002d00007360sv*sd*bc*sc*i*
alias:          pci:v00001002d00001636sv*sd*bc*sc*i*
alias:          pci:v00001002d0000734Fsv*sd*bc*sc*i*
alias:          pci:v00001002d00007347sv*sd*bc*sc*i*
alias:          pci:v00001002d00007341sv*sd*bc*sc*i*
alias:          pci:v00001002d00007340sv*sd*bc*sc*i*
alias:          pci:v00001002d0000731Fsv*sd*bc*sc*i*
alias:          pci:v00001002d0000731Bsv*sd*bc*sc*i*
alias:          pci:v00001002d0000731Asv*sd*bc*sc*i*
alias:          pci:v00001002d00007319sv*sd*bc*sc*i*
alias:          pci:v00001002d00007318sv*sd*bc*sc*i*
alias:          pci:v00001002d00007312sv*sd*bc*sc*i*
alias:          pci:v00001002d00007310sv*sd*bc*sc*i*
alias:          pci:v00001002d00007390sv*sd*bc*sc*i*
alias:          pci:v00001002d0000738Esv*sd*bc*sc*i*
alias:          pci:v00001002d00007388sv*sd*bc*sc*i*
alias:          pci:v00001002d0000738Csv*sd*bc*sc*i*
alias:          pci:v00001002d000015D8sv*sd*bc*sc*i*
alias:          pci:v00001002d000015DDsv*sd*bc*sc*i*
alias:          pci:v00001002d000066AFsv*sd*bc*sc*i*
alias:          pci:v00001002d000066A7sv*sd*bc*sc*i*
alias:          pci:v00001002d000066A4sv*sd*bc*sc*i*
alias:          pci:v00001002d000066A3sv*sd*bc*sc*i*
alias:          pci:v00001002d000066A2sv*sd*bc*sc*i*
alias:          pci:v00001002d000066A1sv*sd*bc*sc*i*
alias:          pci:v00001002d000066A0sv*sd*bc*sc*i*
alias:          pci:v00001002d000069AFsv*sd*bc*sc*i*
alias:          pci:v00001002d000069A3sv*sd*bc*sc*i*
alias:          pci:v00001002d000069A2sv*sd*bc*sc*i*
alias:          pci:v00001002d000069A1sv*sd*bc*sc*i*
alias:          pci:v00001002d000069A0sv*sd*bc*sc*i*
alias:          pci:v00001002d0000687Fsv*sd*bc*sc*i*
alias:          pci:v00001002d0000686Fsv*sd*bc*sc*i*
alias:          pci:v00001002d0000686Esv*sd*bc*sc*i*
alias:          pci:v00001002d0000686Dsv*sd*bc*sc*i*
alias:          pci:v00001002d0000686Csv*sd*bc*sc*i*
alias:          pci:v00001002d0000686Bsv*sd*bc*sc*i*
alias:          pci:v00001002d0000686Asv*sd*bc*sc*i*
alias:          pci:v00001002d00006869sv*sd*bc*sc*i*
alias:          pci:v00001002d00006868sv*sd*bc*sc*i*
alias:          pci:v00001002d00006867sv*sd*bc*sc*i*
alias:          pci:v00001002d00006864sv*sd*bc*sc*i*
alias:          pci:v00001002d00006863sv*sd*bc*sc*i*
alias:          pci:v00001002d00006862sv*sd*bc*sc*i*
alias:          pci:v00001002d00006861sv*sd*bc*sc*i*
alias:          pci:v00001002d00006860sv*sd*bc*sc*i*
alias:          pci:v00001002d0000694Fsv*sd*bc*sc*i*
alias:          pci:v00001002d0000694Esv*sd*bc*sc*i*
alias:          pci:v00001002d0000694Csv*sd*bc*sc*i*
alias:          pci:v00001002d0000699Fsv*sd*bc*sc*i*
alias:          pci:v00001002d00006997sv*sd*bc*sc*i*
alias:          pci:v00001002d00006995sv*sd*bc*sc*i*
alias:          pci:v00001002d00006987sv*sd*bc*sc*i*
alias:          pci:v00001002d00006986sv*sd*bc*sc*i*
alias:          pci:v00001002d00006985sv*sd*bc*sc*i*
alias:          pci:v00001002d00006981sv*sd*bc*sc*i*
alias:          pci:v00001002d00006980sv*sd*bc*sc*i*
alias:          pci:v00001002d00006FDFsv*sd*bc*sc*i*
alias:          pci:v00001002d000067CFsv*sd*bc*sc*i*
alias:          pci:v00001002d000067CCsv*sd*bc*sc*i*
alias:          pci:v00001002d000067CAsv*sd*bc*sc*i*
alias:          pci:v00001002d000067C9sv*sd*bc*sc*i*
alias:          pci:v00001002d000067C8sv*sd*bc*sc*i*
alias:          pci:v00001002d000067DFsv*sd*bc*sc*i*
alias:          pci:v00001002d000067D0sv*sd*bc*sc*i*
alias:          pci:v00001002d000067C7sv*sd*bc*sc*i*
alias:          pci:v00001002d000067C4sv*sd*bc*sc*i*
alias:          pci:v00001002d000067C2sv*sd*bc*sc*i*
alias:          pci:v00001002d000067C1sv*sd*bc*sc*i*
alias:          pci:v00001002d000067C0sv*sd*bc*sc*i*
alias:          pci:v00001002d000067E9sv*sd*bc*sc*i*
alias:          pci:v00001002d000067E7sv*sd*bc*sc*i*
alias:          pci:v00001002d000067E1sv*sd*bc*sc*i*
alias:          pci:v00001002d000067FFsv*sd*bc*sc*i*
alias:          pci:v00001002d000067EFsv*sd*bc*sc*i*
alias:          pci:v00001002d000067EBsv*sd*bc*sc*i*
alias:          pci:v00001002d000067E8sv*sd*bc*sc*i*
alias:          pci:v00001002d000067E3sv*sd*bc*sc*i*
alias:          pci:v00001002d000067E0sv*sd*bc*sc*i*
alias:          pci:v00001002d000098E4sv*sd*bc*sc*i*
alias:          pci:v00001002d00009877sv*sd*bc*sc*i*
alias:          pci:v00001002d00009876sv*sd*bc*sc*i*
alias:          pci:v00001002d00009875sv*sd*bc*sc*i*
alias:          pci:v00001002d00009874sv*sd*bc*sc*i*
alias:          pci:v00001002d00009870sv*sd*bc*sc*i*
alias:          pci:v00001002d0000730Fsv*sd*bc*sc*i*
alias:          pci:v00001002d00007300sv*sd*bc*sc*i*
alias:          pci:v00001002d00006939sv*sd*bc*sc*i*
alias:          pci:v00001002d00006938sv*sd*bc*sc*i*
alias:          pci:v00001002d00006930sv*sd*bc*sc*i*
alias:          pci:v00001002d0000692Fsv*sd*bc*sc*i*
alias:          pci:v00001002d0000692Bsv*sd*bc*sc*i*
alias:          pci:v00001002d00006929sv*sd*bc*sc*i*
alias:          pci:v00001002d00006928sv*sd*bc*sc*i*
alias:          pci:v00001002d00006921sv*sd*bc*sc*i*
alias:          pci:v00001002d00006920sv*sd*bc*sc*i*
alias:          pci:v00001002d00006907sv*sd*bc*sc*i*
alias:          pci:v00001002d00006903sv*sd*bc*sc*i*
alias:          pci:v00001002d00006902sv*sd*bc*sc*i*
alias:          pci:v00001002d00006901sv*sd*bc*sc*i*
alias:          pci:v00001002d00006900sv*sd*bc*sc*i*
alias:          pci:v00001002d0000985Fsv*sd*bc*sc*i*
alias:          pci:v00001002d0000985Esv*sd*bc*sc*i*
alias:          pci:v00001002d0000985Dsv*sd*bc*sc*i*
alias:          pci:v00001002d0000985Csv*sd*bc*sc*i*
alias:          pci:v00001002d0000985Bsv*sd*bc*sc*i*
alias:          pci:v00001002d0000985Asv*sd*bc*sc*i*
alias:          pci:v00001002d00009859sv*sd*bc*sc*i*
alias:          pci:v00001002d00009858sv*sd*bc*sc*i*
alias:          pci:v00001002d00009857sv*sd*bc*sc*i*
alias:          pci:v00001002d00009856sv*sd*bc*sc*i*
alias:          pci:v00001002d00009855sv*sd*bc*sc*i*
alias:          pci:v00001002d00009854sv*sd*bc*sc*i*
alias:          pci:v00001002d00009853sv*sd*bc*sc*i*
alias:          pci:v00001002d00009852sv*sd*bc*sc*i*
alias:          pci:v00001002d00009851sv*sd*bc*sc*i*
alias:          pci:v00001002d00009850sv*sd*bc*sc*i*
alias:          pci:v00001002d0000983Fsv*sd*bc*sc*i*
alias:          pci:v00001002d0000983Esv*sd*bc*sc*i*
alias:          pci:v00001002d0000983Dsv*sd*bc*sc*i*
alias:          pci:v00001002d0000983Csv*sd*bc*sc*i*
alias:          pci:v00001002d0000983Bsv*sd*bc*sc*i*
alias:          pci:v00001002d0000983Asv*sd*bc*sc*i*
alias:          pci:v00001002d00009839sv*sd*bc*sc*i*
alias:          pci:v00001002d00009838sv*sd*bc*sc*i*
alias:          pci:v00001002d00009837sv*sd*bc*sc*i*
alias:          pci:v00001002d00009836sv*sd*bc*sc*i*
alias:          pci:v00001002d00009835sv*sd*bc*sc*i*
alias:          pci:v00001002d00009834sv*sd*bc*sc*i*
alias:          pci:v00001002d00009833sv*sd*bc*sc*i*
alias:          pci:v00001002d00009832sv*sd*bc*sc*i*
alias:          pci:v00001002d00009831sv*sd*bc*sc*i*
alias:          pci:v00001002d00009830sv*sd*bc*sc*i*
alias:          pci:v00001002d000067BEsv*sd*bc*sc*i*
alias:          pci:v00001002d000067BAsv*sd*bc*sc*i*
alias:          pci:v00001002d000067B9sv*sd*bc*sc*i*
alias:          pci:v00001002d000067B8sv*sd*bc*sc*i*
alias:          pci:v00001002d000067B1sv*sd*bc*sc*i*
alias:          pci:v00001002d000067B0sv*sd*bc*sc*i*
alias:          pci:v00001002d000067AAsv*sd*bc*sc*i*
alias:          pci:v00001002d000067A9sv*sd*bc*sc*i*
alias:          pci:v00001002d000067A8sv*sd*bc*sc*i*
alias:          pci:v00001002d000067A2sv*sd*bc*sc*i*
alias:          pci:v00001002d000067A1sv*sd*bc*sc*i*
alias:          pci:v00001002d000067A0sv*sd*bc*sc*i*
alias:          pci:v00001002d0000665Fsv*sd*bc*sc*i*
alias:          pci:v00001002d0000665Dsv*sd*bc*sc*i*
alias:          pci:v00001002d0000665Csv*sd*bc*sc*i*
alias:          pci:v00001002d00006658sv*sd*bc*sc*i*
alias:          pci:v00001002d00006651sv*sd*bc*sc*i*
alias:          pci:v00001002d00006650sv*sd*bc*sc*i*
alias:          pci:v00001002d00006649sv*sd*bc*sc*i*
alias:          pci:v00001002d00006647sv*sd*bc*sc*i*
alias:          pci:v00001002d00006646sv*sd*bc*sc*i*
alias:          pci:v00001002d00006641sv*sd*bc*sc*i*
alias:          pci:v00001002d00006640sv*sd*bc*sc*i*
alias:          pci:v00001002d0000131Dsv*sd*bc*sc*i*
alias:          pci:v00001002d0000131Csv*sd*bc*sc*i*
alias:          pci:v00001002d0000131Bsv*sd*bc*sc*i*
alias:          pci:v00001002d00001318sv*sd*bc*sc*i*
alias:          pci:v00001002d00001317sv*sd*bc*sc*i*
alias:          pci:v00001002d00001316sv*sd*bc*sc*i*
alias:          pci:v00001002d00001315sv*sd*bc*sc*i*
alias:          pci:v00001002d00001313sv*sd*bc*sc*i*
alias:          pci:v00001002d00001312sv*sd*bc*sc*i*
alias:          pci:v00001002d00001311sv*sd*bc*sc*i*
alias:          pci:v00001002d00001310sv*sd*bc*sc*i*
alias:          pci:v00001002d0000130Fsv*sd*bc*sc*i*
alias:          pci:v00001002d0000130Esv*sd*bc*sc*i*
alias:          pci:v00001002d0000130Dsv*sd*bc*sc*i*
alias:          pci:v00001002d0000130Csv*sd*bc*sc*i*
alias:          pci:v00001002d0000130Bsv*sd*bc*sc*i*
alias:          pci:v00001002d0000130Asv*sd*bc*sc*i*
alias:          pci:v00001002d00001309sv*sd*bc*sc*i*
alias:          pci:v00001002d00001307sv*sd*bc*sc*i*
alias:          pci:v00001002d00001306sv*sd*bc*sc*i*
alias:          pci:v00001002d00001305sv*sd*bc*sc*i*
alias:          pci:v00001002d00001304sv*sd*bc*sc*i*
alias:          pci:v00001002d0000666Fsv*sd*bc*sc*i*
alias:          pci:v00001002d00006667sv*sd*bc*sc*i*
alias:          pci:v00001002d00006665sv*sd*bc*sc*i*
alias:          pci:v00001002d00006664sv*sd*bc*sc*i*
alias:          pci:v00001002d00006663sv*sd*bc*sc*i*
alias:          pci:v00001002d00006660sv*sd*bc*sc*i*
alias:          pci:v00001002d0000683Fsv*sd*bc*sc*i*
alias:          pci:v00001002d0000683Dsv*sd*bc*sc*i*
alias:          pci:v00001002d0000683Bsv*sd*bc*sc*i*
alias:          pci:v00001002d00006839sv*sd*bc*sc*i*
alias:          pci:v00001002d00006838sv*sd*bc*sc*i*
alias:          pci:v00001002d00006837sv*sd*bc*sc*i*
alias:          pci:v00001002d00006835sv*sd*bc*sc*i*
alias:          pci:v00001002d00006831sv*sd*bc*sc*i*
alias:          pci:v00001002d00006830sv*sd*bc*sc*i*
alias:          pci:v00001002d0000682Fsv*sd*bc*sc*i*
alias:          pci:v00001002d0000682Dsv*sd*bc*sc*i*
alias:          pci:v00001002d0000682Csv*sd*bc*sc*i*
alias:          pci:v00001002d0000682Bsv*sd*bc*sc*i*
alias:          pci:v00001002d0000682Asv*sd*bc*sc*i*
alias:          pci:v00001002d00006829sv*sd*bc*sc*i*
alias:          pci:v00001002d00006828sv*sd*bc*sc*i*
alias:          pci:v00001002d00006827sv*sd*bc*sc*i*
alias:          pci:v00001002d00006826sv*sd*bc*sc*i*
alias:          pci:v00001002d00006825sv*sd*bc*sc*i*
alias:          pci:v00001002d00006824sv*sd*bc*sc*i*
alias:          pci:v00001002d00006823sv*sd*bc*sc*i*
alias:          pci:v00001002d00006822sv*sd*bc*sc*i*
alias:          pci:v00001002d00006821sv*sd*bc*sc*i*
alias:          pci:v00001002d00006820sv*sd*bc*sc*i*
alias:          pci:v00001002d00006631sv*sd*bc*sc*i*
alias:          pci:v00001002d00006623sv*sd*bc*sc*i*
alias:          pci:v00001002d00006621sv*sd*bc*sc*i*
alias:          pci:v00001002d00006620sv*sd*bc*sc*i*
alias:          pci:v00001002d00006617sv*sd*bc*sc*i*
alias:          pci:v00001002d00006613sv*sd*bc*sc*i*
alias:          pci:v00001002d00006611sv*sd*bc*sc*i*
alias:          pci:v00001002d00006610sv*sd*bc*sc*i*
alias:          pci:v00001002d00006608sv*sd*bc*sc*i*
alias:          pci:v00001002d00006607sv*sd*bc*sc*i*
alias:          pci:v00001002d00006606sv*sd*bc*sc*i*
alias:          pci:v00001002d00006605sv*sd*bc*sc*i*
alias:          pci:v00001002d00006604sv*sd*bc*sc*i*
alias:          pci:v00001002d00006603sv*sd*bc*sc*i*
alias:          pci:v00001002d00006602sv*sd*bc*sc*i*
alias:          pci:v00001002d00006601sv*sd*bc*sc*i*
alias:          pci:v00001002d00006600sv*sd*bc*sc*i*
alias:          pci:v00001002d00006819sv*sd*bc*sc*i*
alias:          pci:v00001002d00006818sv*sd*bc*sc*i*
alias:          pci:v00001002d00006817sv*sd*bc*sc*i*
alias:          pci:v00001002d00006816sv*sd*bc*sc*i*
alias:          pci:v00001002d00006811sv*sd*bc*sc*i*
alias:          pci:v00001002d00006810sv*sd*bc*sc*i*
alias:          pci:v00001002d00006809sv*sd*bc*sc*i*
alias:          pci:v00001002d00006808sv*sd*bc*sc*i*
alias:          pci:v00001002d00006806sv*sd*bc*sc*i*
alias:          pci:v00001002d00006802sv*sd*bc*sc*i*
alias:          pci:v00001002d00006801sv*sd*bc*sc*i*
alias:          pci:v00001002d00006800sv*sd*bc*sc*i*
alias:          pci:v00001002d0000679Fsv*sd*bc*sc*i*
alias:          pci:v00001002d0000679Esv*sd*bc*sc*i*
alias:          pci:v00001002d0000679Bsv*sd*bc*sc*i*
alias:          pci:v00001002d0000679Asv*sd*bc*sc*i*
alias:          pci:v00001002d00006799sv*sd*bc*sc*i*
alias:          pci:v00001002d00006798sv*sd*bc*sc*i*
alias:          pci:v00001002d00006792sv*sd*bc*sc*i*
alias:          pci:v00001002d00006791sv*sd*bc*sc*i*
alias:          pci:v00001002d00006790sv*sd*bc*sc*i*
alias:          pci:v00001002d0000678Asv*sd*bc*sc*i*
alias:          pci:v00001002d00006788sv*sd*bc*sc*i*
alias:          pci:v00001002d00006784sv*sd*bc*sc*i*
alias:          pci:v00001002d00006780sv*sd*bc*sc*i*
depends:        amdttm,drm_kms_helper,drm,amdkcl,amd_iommu_v2,amd-sched,i2c-algo-bit
retpoline:      Y
name:           amdgpu
vermagic:       4.15.0-60-generic SMP mod_unload 
parm:           vramlimit:Restrict VRAM for testing, in megabytes (int)
parm:           vis_vramlimit:Restrict visible VRAM for testing, in megabytes (int)
parm:           gartsize:Size of GART to setup in megabytes (32, 64, etc., -1=auto) (uint)
parm:           gttsize:Size of the GTT domain in megabytes (-1 = auto) (int)
parm:           moverate:Maximum buffer migration rate in MB/s. (32, 64, etc., -1=auto, 0=1=disabled) (int)
parm:           benchmark:Run benchmark (int)
parm:           test:Run tests (int)
parm:           audio:Audio enable (-1 = auto, 0 = disable, 1 = enable) (int)
parm:           disp_priority:Display Priority (0 = auto, 1 = normal, 2 = high) (int)
parm:           hw_i2c:hw i2c engine enable (0 = disable) (int)
parm:           pcie_gen2:PCIE Gen2 mode (-1 = auto, 0 = disable, 1 = enable) (int)
parm:           msi:MSI support (1 = enable, 0 = disable, -1 = auto) (int)
parm:           lockup_timeout:GPU lockup timeout in ms (default: for bare metal 10000 for non-compute jobs and infinity timeout for compute jobs; for passthrough or sriov, 10000 for all jobs. 0: keep default value. negative: infinity timeout), format: for bare metal [Non-Compute] or [GFX,Compute,SDMA,Video]; for passthrough or sriov [all jobs] or [GFX,Compute,SDMA,Video]. (string)
parm:           dpm:DPM support (1 = enable, 0 = disable, -1 = auto) (int)
parm:           fw_load_type:firmware loading type (0 = direct, 1 = SMU, 2 = PSP, -1 = auto) (int)
parm:           aspm:ASPM support (1 = enable, 0 = disable, -1 = auto) (int)
parm:           runpm:PX runtime pm (1 = force enable, 0 = disable, -1 = PX only default) (int)
parm:           ip_block_mask:IP Block Mask (all blocks enabled (default)) (uint)
parm:           bapm:BAPM support (1 = enable, 0 = disable, -1 = auto) (int)
parm:           deep_color:Deep Color support (1 = enable, 0 = disable (default)) (int)
parm:           vm_size:VM address space size in gigabytes (default 64GB) (int)
parm:           vm_fragment_size:VM fragment size in bits (4, 5, etc. 4 = 64K (default), Max 9 = 2M) (int)
parm:           vm_block_size:VM page table size in bits (default depending on vm_size) (int)
parm:           vm_fault_stop:Stop on VM fault (0 = never (default), 1 = print first, 2 = always) (int)
parm:           vm_debug:Debug VM handling (0 = disabled (default), 1 = enabled) (int)
parm:           vm_update_mode:VM update using CPU (0 = never (default except for large BAR(LB)), 1 = Graphics only, 2 = Compute only (default for LB), 3 = Both (int)
parm:           exp_hw_support:experimental hw support (1 = enable, 0 = disable (default)) (int)
parm:           dc:Display Core driver (1 = enable, 0 = disable, -1 = auto (default)) (int)
parm:           sched_jobs:the max number of jobs supported in the sw queue (default 32) (int)
parm:           sched_hw_submission:the max number of HW submissions (default 2) (int)
parm:           ppfeaturemask:all power features enabled (default)) (uint)
parm:           no_evict:Support pinning request from user space (1 = enable, 0 = disable (default)) (int)
parm:           direct_gma_size:Direct GMA size in megabytes (max 96MB) (int)
parm:           ssg:SSG support (1 = enable, 0 = disable (default)) (int)
parm:           forcelongtraining:force memory long training (uint)
parm:           pcie_gen_cap:PCIE Gen Caps (0: autodetect (default)) (uint)
parm:           pcie_lane_cap:PCIE Lane Caps (0: autodetect (default)) (uint)
parm:           cg_mask:Clockgating flags mask (0 = disable clock gating) (uint)
parm:           pg_mask:Powergating flags mask (0 = disable power gating) (uint)
parm:           sdma_phase_quantum:SDMA context switch phase quantum (x 1K GPU clock cycles, 0 = no change (default 32)) (uint)
parm:           disable_cu:Disable CUs (se.sh.cu,...) (charp)
parm:           virtual_display:Enable virtual display feature (the virtual_display will be set like xxxx:xx:xx.x,x;xxxx:xx:xx.x,x) (charp)
parm:           job_hang_limit:how much time allow a job hang and not drop it (default 0) (int)
parm:           lbpw:Load Balancing Per Watt (LBPW) support (1 = enable, 0 = disable, -1 = auto) (int)
parm:           compute_multipipe:Force compute queues to be spread across pipes (1 = enable, 0 = disable, -1 = auto) (int)
parm:           gpu_recovery:Enable GPU recovery mechanism, (1 = enable, 0 = disable, -1 = auto) (int)
parm:           emu_mode:Emulation mode, (1 = enable, 0 = disable) (int)
parm:           ras_enable:Enable RAS features on the GPU (0 = disable, 1 = enable, -1 = auto (default)) (int)
parm:           ras_mask:Mask of RAS features to enable (default 0xffffffff), only valid when ras_enable == 1 (uint)
parm:           si_support:SI support (1 = enabled (default), 0 = disabled) (int)
parm:           cik_support:CIK support (1 = enabled (default), 0 = disabled) (int)
parm:           smu_memory_pool_size:reserve gtt for smu debug usage, 0 = disable,0x1 = 256Mbyte, 0x2 = 512Mbyte, 0x4 = 1 Gbyte, 0x8 = 2GByte (uint)
parm:           async_gfx_ring:Asynchronous GFX rings that could be configured with either different priorities (HP3D ring and LP3D ring), or equal priorities (0 = disabled, 1 = enabled (default)) (int)
parm:           mcbp:Enable Mid-command buffer preemption (0 = disabled (default), 1 = enabled) (int)
parm:           discovery:Allow driver to discover hardware IPs from IP Discovery table at the top of VRAM (int)
parm:           mes:Enable Micro Engine Scheduler (0 = disabled (default), 1 = enabled) (int)
parm:           noretry:Disable retry faults (0 = retry enabled, 1 = retry disabled (default)) (int)
parm:           force_asic_type:A non negative value used to specify the asic type for all supported GPUs (int)
parm:           sched_policy:Scheduling policy (0 = HWS (Default), 1 = HWS without over-subscription, 2 = Non-HWS (Used for debugging only) (int)
parm:           hws_max_conc_proc:Max # processes HWS can execute concurrently when sched_policy=0 (0 = no concurrency, #VMIDs for KFD = Maximum(default)) (int)
parm:           cwsr_enable:CWSR enable (0 = Off, 1 = On (Default)) (int)
parm:           max_num_of_queues_per_device:Maximum number of supported queues per device (1 = Minimum, 4096 = default) (int)
parm:           send_sigterm:Send sigterm to HSA process on unhandled exception (0 = disable, 1 = enable) (int)
parm:           debug_largebar:Debug large-bar flag used to simulate large-bar capability on non-large bar machine (0 = disable, 1 = enable) (int)
parm:           ignore_crat:Ignore CRAT table during KFD initialization (0 = use CRAT (default), 1 = ignore CRAT) (int)
parm:           halt_if_hws_hang:Halt if HWS hang is detected (0 = off (default), 1 = on) (int)
parm:           hws_gws_support:MEC FW support gws barriers (false = not supported (Default), true = supported) (bool)
parm:           queue_preemption_timeout_ms:queue preemption timeout in ms (1 = Minimum, 9000 = default) (int)
parm:           priv_cp_queues:Enable privileged mode for CP queues (0 = off (default), 1 = on) (int)
parm:           keep_idle_process_evicted:Restore evicted process only if queues are active (N = off(default), Y = on) (bool)
parm:           pcie_p2p:Enable PCIe P2P (requires large-BAR). (N = off, Y = on(default)) (bool)
parm:           dcfeaturemask:all stable DC features enabled (default)) (uint)
parm:           abmlevel:ABM level (0 = off (default), 1-4 = backlight reduction level)  (uint)
parm:           tmz:Enable TMZ feature (-1 = auto, 0 = off (default), 1 = on) (int)
lspci -n
00:00.0 0600: 8086:6f00 (rev 01)
00:01.0 0604: 8086:6f02 (rev 01)
00:02.0 0604: 8086:6f04 (rev 01)
00:03.0 0604: 8086:6f08 (rev 01)
00:04.0 0880: 8086:6f20 (rev 01)
00:04.1 0880: 8086:6f21 (rev 01)
00:04.2 0880: 8086:6f22 (rev 01)
00:04.3 0880: 8086:6f23 (rev 01)
00:04.4 0880: 8086:6f24 (rev 01)
00:04.5 0880: 8086:6f25 (rev 01)
00:04.6 0880: 8086:6f26 (rev 01)
00:04.7 0880: 8086:6f27 (rev 01)
00:05.0 0880: 8086:6f28 (rev 01)
00:05.1 0880: 8086:6f29 (rev 01)
00:05.2 0880: 8086:6f2a (rev 01)
00:05.4 0800: 8086:6f2c (rev 01)
00:11.0 ff00: 8086:8d7c (rev 05)
00:11.4 0106: 8086:8d62 (rev 05)
00:14.0 0c03: 8086:8d31 (rev 05)
00:16.0 0780: 8086:8d3a (rev 05)
00:16.1 0780: 8086:8d3b (rev 05)
00:1a.0 0c03: 8086:8d2d (rev 05)
00:1c.0 0604: 8086:8d10 (rev d5)
00:1c.2 0604: 8086:8d14 (rev d5)
00:1c.4 0604: 8086:8d18 (rev d5)
00:1d.0 0c03: 8086:8d26 (rev 05)
00:1f.0 0601: 8086:8d44 (rev 05)
00:1f.2 0106: 8086:8d02 (rev 05)
00:1f.3 0c05: 8086:8d22 (rev 05)
02:00.0 0604: 1022:1470 (rev c3)
03:00.0 0604: 1022:1471
04:00.0 0300: 1002:687f (rev c3)
04:00.1 0403: 1002:aaf8
05:00.0 0380: 1002:67a0
07:00.0 0604: 1a03:1150 (rev 03)
08:00.0 0300: 1a03:2000 (rev 30)
09:00.0 0200: 8086:1521 (rev 01)
09:00.1 0200: 8086:1521 (rev 01)
ff:0b.0 0880: 8086:6f81 (rev 01)
ff:0b.1 1101: 8086:6f36 (rev 01)
ff:0b.2 1101: 8086:6f37 (rev 01)
ff:0b.3 0880: 8086:6f76 (rev 01)
ff:0c.0 0880: 8086:6fe0 (rev 01)
ff:0c.1 0880: 8086:6fe1 (rev 01)
ff:0c.2 0880: 8086:6fe2 (rev 01)
ff:0c.3 0880: 8086:6fe3 (rev 01)
ff:0c.4 0880: 8086:6fe4 (rev 01)
ff:0c.5 0880: 8086:6fe5 (rev 01)
ff:0c.6 0880: 8086:6fe6 (rev 01)
ff:0c.7 0880: 8086:6fe7 (rev 01)
ff:0f.0 0880: 8086:6ff8 (rev 01)
ff:0f.1 0880: 8086:6ff9 (rev 01)
ff:0f.4 0880: 8086:6ffc (rev 01)
ff:0f.5 0880: 8086:6ffd (rev 01)
ff:0f.6 0880: 8086:6ffe (rev 01)
ff:10.0 0880: 8086:6f1d (rev 01)
ff:10.1 1101: 8086:6f34 (rev 01)
ff:10.5 0880: 8086:6f1e (rev 01)
ff:10.6 1101: 8086:6f7d (rev 01)
ff:10.7 0880: 8086:6f1f (rev 01)
ff:12.0 0880: 8086:6fa0 (rev 01)
ff:12.1 1101: 8086:6f30 (rev 01)
ff:13.0 0880: 8086:6fa8 (rev 01)
ff:13.1 0880: 8086:6f71 (rev 01)
ff:13.2 0880: 8086:6faa (rev 01)
ff:13.3 0880: 8086:6fab (rev 01)
ff:13.4 0880: 8086:6fac (rev 01)
ff:13.5 0880: 8086:6fad (rev 01)
ff:13.6 0880: 8086:6fae (rev 01)
ff:13.7 0880: 8086:6faf (rev 01)
ff:14.0 0880: 8086:6fb0 (rev 01)
ff:14.1 0880: 8086:6fb1 (rev 01)
ff:14.2 0880: 8086:6fb2 (rev 01)
ff:14.3 0880: 8086:6fb3 (rev 01)
ff:14.4 0880: 8086:6fbc (rev 01)
ff:14.5 0880: 8086:6fbd (rev 01)
ff:14.6 0880: 8086:6fbe (rev 01)
ff:14.7 0880: 8086:6fbf (rev 01)
ff:15.0 0880: 8086:6fb4 (rev 01)
ff:15.1 0880: 8086:6fb5 (rev 01)
ff:15.2 0880: 8086:6fb6 (rev 01)
ff:15.3 0880: 8086:6fb7 (rev 01)
ff:16.0 0880: 8086:6f68 (rev 01)
ff:16.6 0880: 8086:6f6e (rev 01)
ff:16.7 0880: 8086:6f6f (rev 01)
ff:17.0 0880: 8086:6fd0 (rev 01)
ff:17.4 0880: 8086:6fb8 (rev 01)
ff:17.5 0880: 8086:6fb9 (rev 01)
ff:17.6 0880: 8086:6fba (rev 01)
ff:17.7 0880: 8086:6fbb (rev 01)
ff:1e.0 0880: 8086:6f98 (rev 01)
ff:1e.1 0880: 8086:6f99 (rev 01)
ff:1e.2 0880: 8086:6f9a (rev 01)
ff:1e.3 0880: 8086:6fc0 (rev 01)
ff:1e.4 0880: 8086:6f9c (rev 01)
ff:1f.0 0880: 8086:6f88 (rev 01)
ff:1f.2 0880: 8086:6f8a (rev 01)
lspci -t
-+-[0000:ff]-+-0b.0
 |           +-0b.1
 |           +-0b.2
 |           +-0b.3
 |           +-0c.0
 |           +-0c.1
 |           +-0c.2
 |           +-0c.3
 |           +-0c.4
 |           +-0c.5
 |           +-0c.6
 |           +-0c.7
 |           +-0f.0
 |           +-0f.1
 |           +-0f.4
 |           +-0f.5
 |           +-0f.6
 |           +-10.0
 |           +-10.1
 |           +-10.5
 |           +-10.6
 |           +-10.7
 |           +-12.0
 |           +-12.1
 |           +-13.0
 |           +-13.1
 |           +-13.2
 |           +-13.3
 |           +-13.4
 |           +-13.5
 |           +-13.6
 |           +-13.7
 |           +-14.0
 |           +-14.1
 |           +-14.2
 |           +-14.3
 |           +-14.4
 |           +-14.5
 |           +-14.6
 |           +-14.7
 |           +-15.0
 |           +-15.1
 |           +-15.2
 |           +-15.3
 |           +-16.0
 |           +-16.6
 |           +-16.7
 |           +-17.0
 |           +-17.4
 |           +-17.5
 |           +-17.6
 |           +-17.7
 |           +-1e.0
 |           +-1e.1
 |           +-1e.2
 |           +-1e.3
 |           +-1e.4
 |           +-1f.0
 |           \-1f.2
 \-[0000:00]-+-00.0
             +-01.0-[01]--
             +-02.0-[02-04]----00.0-[03-04]----00.0-[04]--+-00.0
             |                                            \-00.1
             +-03.0-[05]----00.0
             +-04.0
             +-04.1
             +-04.2
             +-04.3
             +-04.4
             +-04.5
             +-04.6
             +-04.7
             +-05.0
             +-05.1
             +-05.2
             +-05.4
             +-11.0
             +-11.4
             +-14.0
             +-16.0
             +-16.1
             +-1a.0
             +-1c.0-[06]--
             +-1c.2-[07-08]----00.0-[08]----00.0
             +-1c.4-[09]--+-00.0
             |            \-00.1
             +-1d.0
             +-1f.0
             +-1f.2
             \-1f.3

why HSA_ISA_INFO_WORKGROUP_MAX_SIZE is hardcoded to 1024?

As my understanding , hcc can only use 1024 thread per kernel. this is because libhsa_runtime.so return 1024 as max HSA_ISA_INFO_WORKGROUP_MAX_SIZE.
But my curious is the max wave front is 40, and wave front size is 64. it means the HSA_ISA_INFO_WORKGROUP_MAX_SIZE actually can be 40*64 = 2560 .
In opencl api , the max thread is 256, in hcc is 1024, I still feel it not right as it actually can be 2560, right?
Why ROCR choose 1024 instead of 2560 which is the hardware's max threads value?

How to test ROCR-Runtime?

Hi, I’m trying to install ROCm stack on Arch Linux (I know, it’s not officially supported) but I cannot manage to get a version of HIP that pass its tests. On the same machine, if I compile HIP in an ubuntu docker, it works as expected.

Thus, I want to check all components.
How do I test that my compiled ROCR-Runtime on Arch Linux is OK?

ROCR-Runtime alongside libcuda.so

I was experiencing an interesting problem:

Linking an executable with libhsa-runtime64.so (1.6.4) and libcuda.so (384.90) results in the following:

*** Error in `./vadd': munmap_chunk(): invalid pointer: 0x00000000010848e8 ***
======= Backtrace: =========
/lib/x86_64-linux-gnu/libc.so.6(+0x777e5)[0x7f293bac57e5]
/lib/x86_64-linux-gnu/libc.so.6(cfree+0x1a8)[0x7f293bad2698]
/usr/lib/nvidia-384/libnvidia-fatbinaryloader.so.384.90(+0x1ea0a)[0x7f293b1b3a0a]
/usr/lib/nvidia-384/libnvidia-fatbinaryloader.so.384.90(elf_end+0x9)[0x7f293b1a50a9]
/opt/rocm/hsa/lib/libhsa-runtime64.so.1(+0x58cf7)[0x7f29530cfcf7]
/opt/rocm/hsa/lib/libhsa-runtime64.so.1(+0x58eb9)[0x7f29530cfeb9]
/opt/rocm/hsa/lib/libhsa-runtime64.so.1(+0x74bd8)[0x7f29530ebbd8]
/opt/rocm/hsa/lib/libhsa-runtime64.so.1(+0x7b5f4)[0x7f29530f25f4]
/opt/rocm/hsa/lib/libhsa-runtime64.so.1(+0x3107e)[0x7f29530a807e]
/opt/rocm/hip/lib/cmake/hip/../../../../lib/libmcwamp_hsa.so(_ZN6Kalmar9HSADevice18IsCompatibleKernelEPvS1_+0x9b)[0x7f2939c299ab]
/opt/rocm/hip/lib/cmake/hip/../../../../lib/libhip_hcc.so(_ZN6Kalmar5CLAMP22DetermineAndGetProgramEPNS_11KalmarQueueEPmPPv+0x43b)[0x7f2952dd35bb]
/opt/rocm/hip/lib/cmake/hip/../../../../lib/libhip_hcc.so(_ZN6Kalmar15KalmarBootstrapC2Ev+0x120)[0x7f2952dd3a80]
/opt/rocm/hip/lib/cmake/hip/../../../../lib/libhip_hcc.so(__hcc_shared_library_init+0x29)[0x7f2952dd3939]
/lib64/ld-linux-x86-64.so.2(+0x106ba)[0x7f2954aac6ba]
/lib64/ld-linux-x86-64.so.2(+0x107cb)[0x7f2954aac7cb]
/lib64/ld-linux-x86-64.so.2(+0xc6a)[0x7f2954a9cc6a]

or just seg faults.
The issue is depends on the link order of libhsa-runtime64.so and libcuda.so. If libcuda.so is loaded first the problem arises, if it is loaded after libhsa-runtime64.so the problem disappears.

clinfo causes segfault in /usr/lib64/libhsa-ext-image64.so.1

Running clinfo -l results in a segfault inside /usr/lib64/libhsa-ext-image64.so.1:

(gdb) run -l
Starting program: /usr/bin/clinfo -l
[Thread debugging using libthread_db enabled]
Using host libthread_db library "/usr/lib64/libthread_db.so.1".
[New Thread 0x7fffef4d7700 (LWP 3596)]
[New Thread 0x7fffeeb81700 (LWP 3597)]
[New Thread 0x7fffee380700 (LWP 3598)]
[New Thread 0x7fffedb7f700 (LWP 3599)]
[New Thread 0x7fffed37e700 (LWP 3600)]
[New Thread 0x7fffecb7d700 (LWP 3601)]
[New Thread 0x7fffd7fff700 (LWP 3602)]
[New Thread 0x7fffd77fe700 (LWP 3603)]
[New Thread 0x7fffd6ffd700 (LWP 3604)]
[New Thread 0x7fffd67fc700 (LWP 3605)]
[New Thread 0x7fffd5ffb700 (LWP 3606)]
[New Thread 0x7fffd57fa700 (LWP 3607)]
[New Thread 0x7fffd4ff9700 (LWP 3608)]
[New Thread 0x7fffb3fff700 (LWP 3609)]
[New Thread 0x7fffb37fe700 (LWP 3610)]
[New Thread 0x7fffb2ffd700 (LWP 3611)]
[New Thread 0x7fffb27fc700 (LWP 3612)]
[New Thread 0x7fffb1ffb700 (LWP 3613)]
[New Thread 0x7fffb17fa700 (LWP 3614)]
[New Thread 0x7fffb0ff9700 (LWP 3615)]
[New Thread 0x7fff93fff700 (LWP 3616)]
[New Thread 0x7fff937fe700 (LWP 3617)]
[New Thread 0x7fff92ffd700 (LWP 3618)]
[New Thread 0x7fff927fc700 (LWP 3619)]
[New Thread 0x7fff91ffb700 (LWP 3620)]
[New Thread 0x7fff917fa700 (LWP 3621)]
[New Thread 0x7fff90ff9700 (LWP 3622)]
[New Thread 0x7fff73fff700 (LWP 3623)]
[New Thread 0x7fff737fe700 (LWP 3624)]
[New Thread 0x7fff72ffd700 (LWP 3625)]
[New Thread 0x7fff727fc700 (LWP 3626)]
[New Thread 0x7fff71ffb700 (LWP 3627)]
[New Thread 0x7fff717fa700 (LWP 3628)]
[New Thread 0x7fff70ff9700 (LWP 3629)]
[New Thread 0x7fff53fff700 (LWP 3630)]
[New Thread 0x7fff537fe700 (LWP 3631)]
[New Thread 0x7fff52ffd700 (LWP 3632)]
[New Thread 0x7fff527fc700 (LWP 3633)]
[New Thread 0x7fff51ffb700 (LWP 3634)]
[New Thread 0x7fff517fa700 (LWP 3635)]
[New Thread 0x7fff50ff9700 (LWP 3636)]
[New Thread 0x7fff33fff700 (LWP 3637)]
[New Thread 0x7fff337fe700 (LWP 3638)]
[New Thread 0x7fff32ffd700 (LWP 3639)]
[New Thread 0x7fff327fc700 (LWP 3640)]
[New Thread 0x7fff31ffb700 (LWP 3641)]
[Thread 0x7fff31ffb700 (LWP 3641) exited]
[Thread 0x7fff327fc700 (LWP 3640) exited]
[Thread 0x7fff32ffd700 (LWP 3639) exited]
[Thread 0x7fff337fe700 (LWP 3638) exited]
[Thread 0x7fff33fff700 (LWP 3637) exited]
[Thread 0x7fff50ff9700 (LWP 3636) exited]
[Thread 0x7fff517fa700 (LWP 3635) exited]
[Thread 0x7fff51ffb700 (LWP 3634) exited]
[Thread 0x7fff527fc700 (LWP 3633) exited]
[Thread 0x7fff52ffd700 (LWP 3632) exited]
[Thread 0x7fff537fe700 (LWP 3631) exited]
[Thread 0x7fff53fff700 (LWP 3630) exited]
[Thread 0x7fff70ff9700 (LWP 3629) exited]
[Thread 0x7fff717fa700 (LWP 3628) exited]
[Thread 0x7fff71ffb700 (LWP 3627) exited]
[Thread 0x7fff727fc700 (LWP 3626) exited]

Thread 1 "clinfo" hit Catchpoint 1 (exception thrown), __cxxabiv1::__cxa_throw (obj=0x5555555d9f30, tinfo=0x7ffff7d3fba0 <typeinfo for clover::error>, dest=0x7ffff7c6eeb0 <clover::error::~error()>) at /var/tmp/portage/sys-devel/gcc-9.3.0/work/gcc-9.3.0/libstdc++-v3/libsupc++/eh_throw.cc:78
78	/var/tmp/portage/sys-devel/gcc-9.3.0/work/gcc-9.3.0/libstdc++-v3/libsupc++/eh_throw.cc: No such file or directory.
#0  __cxxabiv1::__cxa_throw (obj=0x5555555d9f30, tinfo=0x7ffff7d3fba0 <typeinfo for clover::error>, dest=0x7ffff7c6eeb0 <clover::error::~error()>) at /var/tmp/portage/sys-devel/gcc-9.3.0/work/gcc-9.3.0/libstdc++-v3/libsupc++/eh_throw.cc:78
        globals = <optimized out>
        header = <optimized out>
#1  0x00007ffff7c69fd4 in clover::device::device(clover::platform&, pipe_loader_device*) [clone .cold] () at /usr/lib/gcc/x86_64-pc-linux-gnu/9.3.0/include/g++-v9/ext/new_allocator.h:89
No symbol table info available.
#2  0x00007ffff7c8ec78 in clover::create<clover::device, clover::platform&, pipe_loader_device*&> () at ../mesa-20.0.4/src/gallium/state_trackers/clover/util/pointer.hpp:229
        ref = <optimized out>
        ref = <optimized out>
#3  clover::platform::platform (this=0x7ffff7d44960 <(anonymous namespace)::_clover_platform>) at ../mesa-20.0.4/src/gallium/state_trackers/clover/core/platform.cpp:36
        ldev = 0x5555555d7e50
        __for_range = <optimized out>
        __for_begin = <optimized out>
        __for_end = <optimized out>
        n = <optimized out>
        ldevs = <optimized out>
        n = <optimized out>
        ldevs = <optimized out>
        ldev = <optimized out>
        __for_range = <optimized out>
        __for_begin = <optimized out>
        __for_end = <optimized out>
#4  0x00007ffff7c6e066 in _GLOBAL__sub_I_platform.cpp () at ../mesa-20.0.4/src/gallium/state_trackers/clover/api/device.cpp:416
No symbol table info available.
#5  0x00007ffff7fe258a in call_init.part () from /lib64/ld-linux-x86-64.so.2
No symbol table info available.
#6  0x00007ffff7fe26a1 in _dl_init () from /lib64/ld-linux-x86-64.so.2
No symbol table info available.
#7  0x00007ffff7fe663b in dl_open_worker () from /lib64/ld-linux-x86-64.so.2
No symbol table info available.
#8  0x00007ffff7ea3011 in _dl_catch_exception () from /usr/lib64/libc.so.6
No symbol table info available.
#9  0x00007ffff7fe5eea in _dl_open () from /lib64/ld-linux-x86-64.so.2
No symbol table info available.
#10 0x00007ffff7f30288 in ?? () from /usr/lib64/libdl.so.2
No symbol table info available.
#11 0x00007ffff7ea3011 in _dl_catch_exception () from /usr/lib64/libc.so.6
No symbol table info available.
#12 0x00007ffff7ea30af in _dl_catch_error () from /usr/lib64/libc.so.6
No symbol table info available.
#13 0x00007ffff7f309d9 in ?? () from /usr/lib64/libdl.so.2
No symbol table info available.
#14 0x00007ffff7f30316 in dlopen () from /usr/lib64/libdl.so.2
No symbol table info available.
#15 0x00007ffff7f3956d in ?? () from /usr/lib64/libOpenCL.so.1
No symbol table info available.
#16 0x00007ffff7f39a48 in ?? () from /usr/lib64/libOpenCL.so.1
No symbol table info available.
#17 0x00007ffff7d59537 in __pthread_once_slow () from /usr/lib64/libpthread.so.0
No symbol table info available.
#18 0x00007ffff7f38d10 in ?? () from /usr/lib64/libOpenCL.so.1
No symbol table info available.
#19 0x00007ffff7f3b524 in clGetPlatformIDs () from /usr/lib64/libOpenCL.so.1
No symbol table info available.
#20 0x000055555555a050 in main (argc=<optimized out>, argv=<optimized out>) at src/clinfo.c:3190
        p = <optimized out>
        err = <optimized out>
        a = <optimized out>
        output = {mode = CLINFO_HUMAN, cond = COND_PROP_CHECK, brief = 1, detailed = 0, offline = 0, check_size = 0}
        plist = {num_platforms = 0, ndevs_total = 0, alloc_devs = 0, max_plat_version = 0, max_devs = 8388608, max_sname_len = 0, platform = 0x0, all_devs = 0x0, dev_offset = 0x0, pdata = 0x0, platform_checks = 0x0}
        __func__ = "main"
[New Thread 0x7fff527fc700 (LWP 3642)]
LoadLib(libhsa-ext-finalize64.so.1) failed: libhsa-ext-finalize64.so.1: cannot open shared object file: No such file or directory
LoadLib(libhsa-amd-aqlprofile64.so) failed: libhsa-amd-aqlprofile64.so: cannot open shared object file: No such file or directory

Thread 1 "clinfo" received signal SIGSEGV, Segmentation fault.
0x00007fffb04eadb0 in ?? () from /usr/lib64/libhsa-ext-image64.so.1
(gdb) bt full
#0  0x00007fffb04eadb0 in ?? () from /usr/lib64/libhsa-ext-image64.so.1
No symbol table info available.
#1  0x00007fffef914833 in hsa_amd_image_get_info_max_dim () from /usr/lib64/libhsa-runtime64.so.1
No symbol table info available.
#2  0x00007fffef8d3d14 in ?? () from /usr/lib64/libhsa-runtime64.so.1
No symbol table info available.
#3  0x00007fffef8fee8c in ?? () from /usr/lib64/libhsa-runtime64.so.1
No symbol table info available.
#4  0x00007fffef94436f in hsa_agent_get_info () from /usr/lib64/libhsa-runtime64.so.1
No symbol table info available.
#5  0x00007fffefab99d3 in ?? () from /usr/lib64/libamdocl64.so
No symbol table info available.
#6  0x00007fffefaba7d3 in ?? () from /usr/lib64/libamdocl64.so
No symbol table info available.
#7  0x00007fffefabbc9f in ?? () from /usr/lib64/libamdocl64.so
No symbol table info available.
#8  0x00007fffefa6d96f in ?? () from /usr/lib64/libamdocl64.so
No symbol table info available.
#9  0x00007fffefa91a8e in ?? () from /usr/lib64/libamdocl64.so
No symbol table info available.
#10 0x00007fffefaf4e75 in ?? () from /usr/lib64/libamdocl64.so
No symbol table info available.
#11 0x00007ffff7d59537 in __pthread_once_slow () from /usr/lib64/libpthread.so.0
No symbol table info available.
#12 0x00007fffefaf4f8c in clIcdGetPlatformIDsKHR () from /usr/lib64/libamdocl64.so
No symbol table info available.
#13 0x00007ffff7f39cf6 in ?? () from /usr/lib64/libOpenCL.so.1
No symbol table info available.
#14 0x00007ffff7d59537 in __pthread_once_slow () from /usr/lib64/libpthread.so.0
No symbol table info available.
#15 0x00007ffff7f38d10 in ?? () from /usr/lib64/libOpenCL.so.1
No symbol table info available.
#16 0x00007ffff7f3b524 in clGetPlatformIDs () from /usr/lib64/libOpenCL.so.1
No symbol table info available.
#17 0x000055555555a050 in main (argc=<optimized out>, argv=<optimized out>) at src/clinfo.c:3190
        p = <optimized out>
        err = <optimized out>
        a = <optimized out>
        output = {mode = CLINFO_HUMAN, cond = COND_PROP_CHECK, brief = 1, detailed = 0, offline = 0, check_size = 0}
        plist = {num_platforms = 0, ndevs_total = 0, alloc_devs = 0, max_plat_version = 0, max_devs = 8388608, max_sname_len = 0, platform = 0x0, all_devs = 0x0, dev_offset = 0x0, pdata = 0x0, platform_checks = 0x0}
        __func__ = "main"

The kernel module will have a null pointer dereference, a kernel work queue will lockup and eventually the whole system will be unusable, because this also affects disk I/O:

[Fri Apr 10 13:46:00 2020] [drm] PCIE GART of 256M enabled (table at 0x000000F400000000).
[Fri Apr 10 13:46:00 2020] [drm] UVD and UVD ENC initialized successfully.
[Fri Apr 10 13:46:00 2020] [drm] VCE initialized successfully.
[Fri Apr 10 13:46:00 2020] [drm] Cannot find any crtc or sizes
[Fri Apr 10 13:46:27 2020] [drm] PCIE GART of 256M enabled (table at 0x000000F400000000).
[Fri Apr 10 13:46:28 2020] [drm] UVD and UVD ENC initialized successfully.
[Fri Apr 10 13:46:28 2020] [drm] VCE initialized successfully.
[Fri Apr 10 13:46:28 2020] [drm] Cannot find any crtc or sizes
[Fri Apr 10 13:46:28 2020] Alloc host visible vram on small bar is not allowed
[Fri Apr 10 13:46:28 2020] clinfo[3152]: segfault at 10 ip 00007f47082eadb0 sp 00007ffe6d369140 error 4 in libhsa-ext-image64.so.1.1.30100[7f47082b4000+96000]
[Fri Apr 10 13:46:28 2020] Code: 0c 04 00 00 00 45 31 f6 31 ed e9 cb fe ff ff 0f 1f 00 49 89 cd c7 44 24 0c 06 00 00 00 45 31 f6 31 ed e9 b3 fe ff ff 0f 1f 00 <49> 8b 44 24 10 49 8d 4c 24 08 48 85 c0 0f 84 8d 00 00 00 48 89 ca
[Fri Apr 10 13:46:29 2020] Evicting PASID 0x8019 queues
[Fri Apr 10 13:46:29 2020] Evicting PASID 0x8019 queues
[Fri Apr 10 13:47:03 2020] [drm] PCIE GART of 256M enabled (table at 0x000000F400000000).
[Fri Apr 10 13:47:03 2020] [drm] UVD and UVD ENC initialized successfully.
[Fri Apr 10 13:47:03 2020] [drm] VCE initialized successfully.
[Fri Apr 10 13:47:03 2020] [drm] Cannot find any crtc or sizes
[Fri Apr 10 13:47:03 2020] Alloc host visible vram on small bar is not allowed
[Fri Apr 10 13:47:03 2020] clinfo[3438]: segfault at 10 ip 00007fc1902eadb0 sp 00007ffdd5932c90 error 4 in libhsa-ext-image64.so.1.1.30100[7fc1902b4000+96000]
[Fri Apr 10 13:47:03 2020] Code: 0c 04 00 00 00 45 31 f6 31 ed e9 cb fe ff ff 0f 1f 00 49 89 cd c7 44 24 0c 06 00 00 00 45 31 f6 31 ed e9 b3 fe ff ff 0f 1f 00 <49> 8b 44 24 10 49 8d 4c 24 08 48 85 c0 0f 84 8d 00 00 00 48 89 ca
[Fri Apr 10 13:47:04 2020] Evicting PASID 0x801a queues
[Fri Apr 10 13:47:04 2020] Evicting PASID 0x801a queues
[Fri Apr 10 13:47:16 2020] [drm] PCIE GART of 256M enabled (table at 0x000000F400000000).
[Fri Apr 10 13:47:16 2020] [drm] UVD and UVD ENC initialized successfully.
[Fri Apr 10 13:47:17 2020] [drm] VCE initialized successfully.
[Fri Apr 10 13:47:17 2020] [drm] Cannot find any crtc or sizes
[Fri Apr 10 13:47:18 2020] Alloc host visible vram on small bar is not allowed
[Fri Apr 10 13:47:24 2020] Evicting PASID 0x801a queues
[Fri Apr 10 13:47:24 2020] Evicting PASID 0x801a queues
[Fri Apr 10 14:44:18 2020] [drm] schedsdma0 is not ready, skipping
[Fri Apr 10 14:44:18 2020] [drm] schedsdma1 is not ready, skipping
[Fri Apr 10 14:44:18 2020] BUG: kernel NULL pointer dereference, address: 0000000000000008
[Fri Apr 10 14:44:18 2020] #PF: supervisor read access in kernel mode
[Fri Apr 10 14:44:18 2020] #PF: error_code(0x0000) - not-present page
[Fri Apr 10 14:44:18 2020] PGD 2774067 P4D 2774067 PUD 0
[Fri Apr 10 14:44:18 2020] Oops: 0000 [#1] PREEMPT SMP NOPTI
[Fri Apr 10 14:44:18 2020] CPU: 2 PID: 11067 Comm: kworker/2:0 Tainted: G                T 5.5.14 #2
[Fri Apr 10 14:44:18 2020] Hardware name: System manufacturer System Product Name/ROG STRIX B350-F GAMING, BIOS 5220 09/12/2019
[Fri Apr 10 14:44:18 2020] Workqueue: kfd_process_wq kfd_process_wq_release [amdgpu]
[Fri Apr 10 14:44:18 2020] RIP: 0010:amdgpu_vm_sdma_commit+0x55/0x140 [amdgpu]
[Fri Apr 10 14:44:18 2020] Code: 7f 10 00 4c 8b a2 88 01 00 00 4c 8b a8 80 00 00 00 4c 8d b0 b8 00 00 00 75 07 4c 8d b0 68 01 00 00 49 8b 46 10 41 8b 54 24 08 <48> 8b 40 08 48 8d 78 88 85 d2 0f 84 c5 00 00 00 48 8b 40 90 4c 89
[Fri Apr 10 14:44:18 2020] RSP: 0018:ffffb118c9a1fc00 EFLAGS: 00010246
[Fri Apr 10 14:44:18 2020] RAX: 0000000000000000 RBX: ffffb118c9a1fc40 RCX: 00000000000013fe
[Fri Apr 10 14:44:18 2020] RDX: 0000000000000008 RSI: ffffb118c9a1fcd8 RDI: ffffb118c9a1fc40
[Fri Apr 10 14:44:18 2020] RBP: ffffb118c9a1fcd8 R08: 0000000000001000 R09: 0000000000000000
[Fri Apr 10 14:44:18 2020] R10: 0000000000000003 R11: 00000000000003fe R12: ffff97e01ef275f8
[Fri Apr 10 14:44:18 2020] R13: ffff97e18a341c00 R14: ffff97e1d36c1968 R15: ffff97e285d5f280
[Fri Apr 10 14:44:18 2020] FS:  0000000000000000(0000) GS:ffff97e2b0680000(0000) knlGS:0000000000000000
[Fri Apr 10 14:44:18 2020] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[Fri Apr 10 14:44:18 2020] CR2: 0000000000000008 CR3: 000000025d8d6000 CR4: 00000000003406e0
[Fri Apr 10 14:44:18 2020] Call Trace:
[Fri Apr 10 14:44:18 2020]  amdgpu_vm_bo_update_mapping+0xdd/0xf0 [amdgpu]
[Fri Apr 10 14:44:18 2020]  amdgpu_vm_clear_freed+0xbd/0x200 [amdgpu]
[Fri Apr 10 14:44:18 2020]  amdgpu_amdkfd_gpuvm_unmap_memory_from_gpu+0x11d/0x1b0 [amdgpu]
[Fri Apr 10 14:44:18 2020]  kfd_process_device_free_bos+0x77/0xd0 [amdgpu]
[Fri Apr 10 14:44:18 2020]  kfd_process_wq_release+0x6d/0x130 [amdgpu]
[Fri Apr 10 14:44:18 2020]  process_one_work+0x1dc/0x3c0
[Fri Apr 10 14:44:18 2020]  worker_thread+0x45/0x3c0
[Fri Apr 10 14:44:18 2020]  kthread+0xf6/0x130
[Fri Apr 10 14:44:18 2020]  ? process_one_work+0x3c0/0x3c0
[Fri Apr 10 14:44:18 2020]  ? kthread_park+0x90/0x90
[Fri Apr 10 14:44:18 2020]  ret_from_fork+0x22/0x40
[Fri Apr 10 14:44:18 2020] Modules linked in: squashfs loop fuse nft_masq nf_conntrack_netbios_ns nf_conntrack_broadcast xt_CHECKSUM xt_MASQUERADE xt_conntrack ipt_REJECT xt_tcpudp nf_nat_tftp tun bridge stp llc nft_objref nf_conntrack_tftp nf_tables_set nft_fib_inet nft_fib_ipv4 nft_fib_ipv6 nft_fib nft_reject_inet nf_reject_ipv4 nf_reject_ipv6 nft_reject nft_ct nft_chain_nat cpufreq_conservative nf_tables ebtable_nat ebtable_broute ip6table_nat ip6table_mangle ip6table_raw ip6table_security iptable_nat nf_nat iptable_mangle iptable_raw iptable_security nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 ip_set nfnetlink ebtable_filter ebtables ip6table_filter ip6_tables iptable_filter ip_tables x_tables bpfilter bluetooth ecdh_generic ecc crc16 amdgpu snd_usb_audio edac_mce_amd kvm_amd ccp snd_usbmidi_lib snd_rawmidi rng_core snd_hda_codec_realtek snd_hda_codec_generic snd_seq_device uvcvideo ledtrig_audio kvm snd_hda_codec_hdmi gspca_vc032x snd_hda_intel gspca_main sp5100_tco videobuf2_vmalloc
[Fri Apr 10 14:44:18 2020]  eeepc_wmi pcspkr asus_wmi videobuf2_memops snd_intel_dspcfg videobuf2_v4l2 irqbypass battery videobuf2_common snd_hda_codec k10temp joydev i2c_piix4 sparse_keymap snd_hda_core gpu_sched rfkill snd_hwdep ttm wmi_bmof input_leds videodev snd_pcm mousedev snd_timer snd mc drm_kms_helper soundcore syscopyarea sysfillrect sysimgblt fb_sys_fops gpio_amdpt evdev mac_hid acpi_cpufreq nls_iso8859_1 nls_cp437 vfat fat sch_fq_codel drm sctp lm92 btrfs blake2b_generic xor raid6_pq dm_cache_smq dm_cache dm_persistent_data dm_bio_prison dm_bufio libcrc32c crc32c_generic ax88179_178a usbnet mii usbmouse hid_steam hid_generic usbkbd usbhid hid sd_mod uas usb_storage crct10dif_pclmul crc32_pclmul crc32c_intel ghash_clmulni_intel ahci libahci igb aesni_intel libata crypto_simd cryptd glue_helper i2c_algo_bit dca scsi_mod xhci_pci xhci_hcd wmi video backlight pinctrl_amd dm_mirror dm_region_hash dm_log dm_mod
[Fri Apr 10 14:44:18 2020] CR2: 0000000000000008
[Fri Apr 10 14:44:18 2020] ---[ end trace 5e9d8ca00e8ec1d1 ]---
[Fri Apr 10 14:44:18 2020] [drm] PCIE GART of 256M enabled (table at 0x000000F400000000).
[Fri Apr 10 14:44:18 2020] RIP: 0010:amdgpu_vm_sdma_commit+0x55/0x140 [amdgpu]
[Fri Apr 10 14:44:18 2020] Code: 7f 10 00 4c 8b a2 88 01 00 00 4c 8b a8 80 00 00 00 4c 8d b0 b8 00 00 00 75 07 4c 8d b0 68 01 00 00 49 8b 46 10 41 8b 54 24 08 <48> 8b 40 08 48 8d 78 88 85 d2 0f 84 c5 00 00 00 48 8b 40 90 4c 89
[Fri Apr 10 14:44:18 2020] RSP: 0018:ffffb118c9a1fc00 EFLAGS: 00010246
[Fri Apr 10 14:44:18 2020] RAX: 0000000000000000 RBX: ffffb118c9a1fc40 RCX: 00000000000013fe
[Fri Apr 10 14:44:18 2020] RDX: 0000000000000008 RSI: ffffb118c9a1fcd8 RDI: ffffb118c9a1fc40
[Fri Apr 10 14:44:18 2020] RBP: ffffb118c9a1fcd8 R08: 0000000000001000 R09: 0000000000000000
[Fri Apr 10 14:44:18 2020] R10: 0000000000000003 R11: 00000000000003fe R12: ffff97e01ef275f8
[Fri Apr 10 14:44:18 2020] R13: ffff97e18a341c00 R14: ffff97e1d36c1968 R15: ffff97e285d5f280
[Fri Apr 10 14:44:18 2020] FS:  0000000000000000(0000) GS:ffff97e2b0680000(0000) knlGS:0000000000000000
[Fri Apr 10 14:44:18 2020] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[Fri Apr 10 14:44:18 2020] CR2: 0000000000000008 CR3: 000000025d8d6000 CR4: 00000000003406e0
[Fri Apr 10 14:44:18 2020] [drm] UVD and UVD ENC initialized successfully.
[Fri Apr 10 14:44:18 2020] [drm] VCE initialized successfully.
[Fri Apr 10 14:44:18 2020] [drm] Cannot find any crtc or sizes
[Fri Apr 10 14:44:53 2020] BUG: workqueue lockup - pool cpus=2 node=0 flags=0x0 nice=0 stuck for 35s!
[Fri Apr 10 14:44:53 2020] Showing busy workqueues and worker pools:
[Fri Apr 10 14:44:53 2020] workqueue events: flags=0x0
[Fri Apr 10 14:44:53 2020]   pwq 4: cpus=2 node=0 flags=0x0 nice=0 active=16/256 refcnt=17
[Fri Apr 10 14:44:53 2020]     pending: efivar_update_sysfs_entries, console_callback, free_work, psi_avgs_work, psi_avgs_work, psi_avgs_work, psi_avgs_work, psi_avgs_work, psi_avgs_work, psi_avgs_work, ttm_bo_delayed_workqueue [ttm], psi_avgs_work, psi_avgs_work, psi_avgs_work, psi_avgs_work, psi_avgs_work
[Fri Apr 10 14:44:53 2020] workqueue events_unbound: flags=0x2
[Fri Apr 10 14:44:53 2020]   pwq 64: cpus=0-31 flags=0x4 nice=0 active=1/512 refcnt=3
[Fri Apr 10 14:44:53 2020]     in-flight: 9452:commit_work [drm_kms_helper]
[Fri Apr 10 14:44:53 2020] workqueue mm_percpu_wq: flags=0x8
[Fri Apr 10 14:44:53 2020]   pwq 4: cpus=2 node=0 flags=0x0 nice=0 active=1/256 refcnt=2
[Fri Apr 10 14:44:53 2020]     pending: vmstat_update
[Fri Apr 10 14:44:53 2020] workqueue dm-cache: flags=0x8
[Fri Apr 10 14:44:53 2020]   pwq 4: cpus=2 node=0 flags=0x0 nice=0 active=1/256 refcnt=2
[Fri Apr 10 14:44:53 2020]     pending: process_deferred_bios [dm_cache]
[Fri Apr 10 14:44:53 2020] workqueue kfd_process_wq: flags=0x0
[Fri Apr 10 14:44:53 2020]   pwq 4: cpus=2 node=0 flags=0x0 nice=0 active=1/256 refcnt=2
[Fri Apr 10 14:44:53 2020]     in-flight: 11067:kfd_process_wq_release [amdgpu]
[Fri Apr 10 14:44:53 2020] pool 4: cpus=2 node=0 flags=0x0 nice=0 hung=35s workers=3 idle: 13285 11402
[Fri Apr 10 14:44:53 2020] pool 64: cpus=0-31 flags=0x4 nice=0 hung=0s workers=14 idle: 9356 197 12870 10662 512 10553 7711 12711 550 511 13203 9842 7731
[Fri Apr 10 14:44:59 2020] nf_conntrack: default automatic helper assignment has been turned off for security reasons and CT-based  firewall rule not found. Use the iptables CT target to attach helpers instead.
[Fri Apr 10 14:45:24 2020] BUG: workqueue lockup - pool cpus=2 node=0 flags=0x0 nice=0 stuck for 66s!
[Fri Apr 10 14:45:24 2020] Showing busy workqueues and worker pools:
[Fri Apr 10 14:45:24 2020] workqueue events: flags=0x0
[Fri Apr 10 14:45:24 2020]   pwq 4: cpus=2 node=0 flags=0x0 nice=0 active=17/256 refcnt=18
[Fri Apr 10 14:45:24 2020]     pending: efivar_update_sysfs_entries, console_callback, free_work, psi_avgs_work, psi_avgs_work, psi_avgs_work, psi_avgs_work, psi_avgs_work, psi_avgs_work, psi_avgs_work, ttm_bo_delayed_workqueue [ttm], psi_avgs_work, psi_avgs_work, psi_avgs_work, psi_avgs_work, psi_avgs_work, psi_avgs_work
[Fri Apr 10 14:45:24 2020] workqueue events_unbound: flags=0x2
[Fri Apr 10 14:45:24 2020]   pwq 64: cpus=0-31 flags=0x4 nice=0 active=1/512 refcnt=3
[Fri Apr 10 14:45:24 2020]     in-flight: 512:commit_work [drm_kms_helper]
[Fri Apr 10 14:45:24 2020] workqueue mm_percpu_wq: flags=0x8
[Fri Apr 10 14:45:24 2020]   pwq 4: cpus=2 node=0 flags=0x0 nice=0 active=1/256 refcnt=2
[Fri Apr 10 14:45:24 2020]     pending: vmstat_update
[Fri Apr 10 14:45:24 2020] workqueue dm-cache: flags=0x8
[Fri Apr 10 14:45:24 2020]   pwq 4: cpus=2 node=0 flags=0x0 nice=0 active=1/256 refcnt=2
[Fri Apr 10 14:45:24 2020]     pending: process_deferred_bios [dm_cache]
[Fri Apr 10 14:45:24 2020] workqueue kfd_process_wq: flags=0x0
[Fri Apr 10 14:45:24 2020]   pwq 4: cpus=2 node=0 flags=0x0 nice=0 active=1/256 refcnt=2
[Fri Apr 10 14:45:24 2020]     in-flight: 11067:kfd_process_wq_release [amdgpu]
[Fri Apr 10 14:45:24 2020] pool 4: cpus=2 node=0 flags=0x0 nice=0 hung=66s workers=3 idle: 13285 11402
[Fri Apr 10 14:45:24 2020] pool 64: cpus=0-31 flags=0x4 nice=0 hung=0s workers=14 idle: 10662 9356 7711 10553 12870 12711 9452 197 550 511 13203 9842 7731

My system is running these versions:

Portage 2.3.98 (python 3.6.10-final-0, default/linux/amd64/17.1/desktop/plasma/systemd, gcc-9.3.0, glibc-2.30-r7, 5.5.14 x86_64)
=================================================================
                         System Settings
=================================================================
System uname: Linux-5.5.14-x86_64-AMD_Ryzen_5_2400G_with_Radeon_Vega_Graphics-with-gentoo-2.7
KiB Mem:    14128300 total,    667604 free
KiB Swap:          0 total,         0 free
Timestamp of repository gentoo: Fri, 10 Apr 2020 05:15:01 +0000
Head commit of repository gentoo: 143dda37b357fa949abd681742f465d8cb5f981e
Head commit of repository flatpak-overlay: 2bcc4b030ff8288683533a84f15777680c8c883e

Head commit of repository local: 9778e39ceebd0f17a7aafdf548ad31a135e3daff

sh bash 5.0_p16
ld GNU gold (Gentoo 2.34 p1 2.34.0) 1.16
ccache version 3.7.9 [disabled]
app-shells/bash:          5.0_p16::gentoo
dev-java/java-config:     2.2.0-r4::gentoo
dev-lang/perl:            5.30.1::gentoo
dev-lang/python:          2.7.17-r2::gentoo, 3.6.10-r1::gentoo, 3.7.7-r1::gentoo, 3.8.2-r1::gentoo, 3.9.0_alpha5::gentoo
dev-util/ccache:          3.7.9::gentoo
dev-util/cmake:           3.17.0::gentoo
dev-util/pkgconfig:       0.29.2::gentoo
sys-apps/baselayout:      2.7::gentoo
sys-apps/sandbox:         2.18::gentoo
sys-devel/autoconf:       2.13-r1::gentoo, 2.69-r5::gentoo
sys-devel/automake:       1.13.4-r2::gentoo, 1.16.2::gentoo
sys-devel/binutils:       2.34::gentoo
sys-devel/gcc:            9.3.0::gentoo
sys-devel/gcc-config:     2.2.1::gentoo
sys-devel/libtool:        2.4.6-r6::gentoo
sys-devel/make:           4.3::gentoo
sys-kernel/linux-headers: 5.6::gentoo (virtual/os-headers)
sys-libs/glibc:           2.30-r7::gentoo
Repositories:

gentoo
    location: /var/cache/portage/gentoo
    sync-type: rsync
    sync-uri: rsync://rsync.de.gentoo.org/gentoo-portage
    priority: -1000
    sync-rsync-verify-metamanifest: yes
    sync-rsync-verify-max-age: 24
    sync-rsync-verify-jobs: 1
    sync-rsync-extra-opts: 

flatpak-overlay
    location: /var/db/repos/flatpak-overlay
    sync-type: git
    sync-uri: https://github.com/fosero/flatpak-overlay.git
    masters: gentoo

local
    location: /var/cache/portage/local
    sync-type: git
    sync-uri: https://github.com/devurandom/gentoo-overlay.git
    masters: gentoo
    priority: 1000

ACCEPT_KEYWORDS="amd64 ~amd64"
ACCEPT_LICENSE="@FREE"
CBUILD="x86_64-pc-linux-gnu"
CFLAGS="-pipe -O2 -march=znver1"
CHOST="x86_64-pc-linux-gnu"
CONFIG_PROTECT="/etc /etc/grs/systems.conf /usr/lib64/libreoffice/program/sofficerc /usr/share/config /usr/share/gnupg/qualified.txt /usr/share/maven-bin-3.6/conf"
CONFIG_PROTECT_MASK="/etc/ca-certificates.conf /etc/dconf /etc/env.d /etc/fonts/fonts.conf /etc/gconf /etc/gentoo-release /etc/revdep-rebuild /etc/sandbox.d /etc/terminfo /etc/texmf/language.dat.d /etc/texmf/language.def.d /etc/texmf/updmap.d /etc/texmf/web2c"
CXXFLAGS="-pipe -O2 -march=znver1"
DISTDIR="/var/cache/portage/distfiles"
EMERGE_DEFAULT_OPTS="--nospinner"
ENV_UNSET="DBUS_SESSION_BUS_ADDRESS DISPLAY GOBIN PERL5LIB PERL5OPT PERLPREFIX PERL_CORE PERL_MB_OPT PERL_MM_OPT XAUTHORITY XDG_CACHE_HOME XDG_CONFIG_HOME XDG_DATA_HOME XDG_RUNTIME_DIR"
FCFLAGS="-O2 -pipe"
FEATURES="assume-digests binpkg-docompress binpkg-dostrip binpkg-logs buildsyspkg cgroup compressdebug config-protect-if-modified distlocks ebuild-locks fakeroot fixlafiles ipc-sandbox merge-sync mount-sandbox multilib-strict network-sandbox news parallel-fetch parallel-install pid-sandbox preserve-libs protect-owned qa-unresolved-soname-deps sandbox sfperms strict unknown-features-warn unmerge-logs unmerge-orphans userfetch userpriv usersandbox usersync xattr"
FFLAGS="-O2 -pipe"
GENTOO_MIRRORS="http://ftp.spline.inf.fu-berlin.de/mirrors/gentoo/ http://ftp-stud.hs-esslingen.de/pub/Mirrors/gentoo/ http://distfiles.gentoo.org"
LANG="en_US.UTF-8"
LDFLAGS="-Wl,-O1 -Wl,--as-needed -Wl,--hash-style=gnu"
MAKEOPTS="-j6 -l4"
PKGDIR="/var/cache/portage/packages"
PORTAGE_CONFIGROOT="/"
PORTAGE_RSYNC_OPTS="--recursive --links --safe-links --perms --times --omit-dir-times --compress --force --whole-file --delete --stats --human-readable --timeout=180 --exclude=/distfiles --exclude=/local --exclude=/packages --exclude=/.git"
PORTAGE_TMPDIR="/tmp"
USE="7z 7zip X a52 aac aacplus aacs acl acpi activities aio alsa amd64 appindicator appstream archive audit avahi ayatana bdplus berkdb blake2 bluetooth bluray branding brotli bs2b btrfs bzip2 cairo caps cdda cddb cdio cdr celt chromaprint cjk clang cli clipboard color-management colord conntrack crypt cups d3d9 dav1d dbus declarative device-mapper dirac djvu dri drm dts dvb dvd dvdr ed25519 editorconfig egl elf emboss encode epub evdev exif faudio fax fbcon fdk ffmpeg fftw filecaps firefox firewalld fish-completion fits flac fontconfig fontforge fortran fribidi gamepad gbm gdal gdbm geoclue geolocation gif git gles2 gmp gnome-online-accounts gnupg google googledrive gpg gps graphicsmagick gstreamer gtk gtk3 gzip harfbuzz hdf5 heif http2 ibus iconv icu idn imlib inotify introspection ipv6 jemalloc jpeg jpeg2k json kde kipi kms kwallet ladspa latex lcms libatomic libglvnd libidn2 libinput libnotify libproxy libsecret libsoxr libtirpc libvirt lm-sensors lrz lv2 lvm lz4 lzma lzo mad mariadb markdown mbim mercurial mjpeg mng mobi modemmanager modplug mp3 mp4 mpeg mplayer mpris mtp multilib mysql ncurses netlink networkmanager nls nptl numa office ofx ogg openal opencl opencv openexr opengl openh264 openmax openmp opus pam pango pcap pch pcre pcre2 pdf pgo phonon pixman pkcs11 pkcs7 plasma pm-utils png policykit postscript ppds prison pulseaudio pwquality python qml qrcode qt5 raw rdp readline redfish samba sasl scanner schroedinger screencast sctp sdl sdl2 seccomp semantic-desktop share smartcard snappy sparse speech speex spell spice ssl startup-notification steamruntime stemmer svg systemd systemtap tbb tcpd teamd telepathy tga theora threads thunderbolt tiff timezone tmux truetype tslib udev udisks uinput unicode unwind upnp upnp-av upower usb utempter v4l v4l2 vaapi vdpau vkd3d vorbis vpx vulkan wasm wavpack wayland webchannel webengine webp widgets wmf woff2 wps x264 x265 xattr xcb xcomposite xinerama xkb xml xmp xrandr xscreensaver xv xvid xwayland xxhash xz yaml zeroconf zeromq zimg zlib zstd" ABI_X86="64" ADA_TARGET="gnat_2018" ALSA_CARDS="hda-intel" APACHE2_MODULES="authn_core authz_core socache_shmcb unixd actions alias auth_basic authn_alias authn_anon authn_dbm authn_default authn_file authz_dbm authz_default authz_groupfile authz_host authz_owner authz_user autoindex cache cgi cgid dav dav_fs dav_lock deflate dir disk_cache env expires ext_filter file_cache filter headers include info log_config logio mem_cache mime mime_magic negotiation rewrite setenvif speling status unique_id userdir usertrack vhost_alias" CALLIGRA_FEATURES="karbon sheets words" CAMERAS="ptp2" COLLECTD_PLUGINS="df interface irq load memory rrdtool swap syslog" CPU_FLAGS_X86="aes avx avx2 f16c fma3 mmx mmxext pclmul popcnt sha sse sse2 sse3 sse4_1 sse4_2 sse4a ssse3" ELIBC="glibc" ENLIGHTENMENT_MODULES="*" GPSD_PROTOCOLS="ashtech aivdm earthmate evermore fv18 garmin garmintxt gpsclock greis isync itrax mtk3301 nmea ntrip navcom oceanserver oldstyle oncore rtcm104v2 rtcm104v3 sirf skytraq superstar2 timing tsip tripmate tnt ublox ubx" GRUB_PLATFORMS="efi-64" INPUT_DEVICES="joystick libinput" KERNEL="linux" L10N="de de-DE en en-GB ar fa tr ja ko zh zh-CN zh-TW" LCD_DEVICES="bayrad cfontz cfontz633 glk hd44780 lb216 lcdm001 mtxorb ncurses text" LIBREOFFICE_EXTENSIONS="nlpsolver scripting-javascript wiki-publisher" LIRC_DEVICES="devinput" LLVM_TARGETS="AMDGPU BPF RISCV WebAssembly" LUA_TARGET="lua5-2" OFFICE_IMPLEMENTATION="libreoffice" PHP_TARGETS="php7-2" POSTGRES_TARGETS="postgres10 postgres11" PYTHON_SINGLE_TARGET="python3_6" PYTHON_TARGETS="python2_7 python3_6 pypy pypy3" QEMU_SOFTMMU_TARGETS="riscv32 riscv64 x86_64" QEMU_USER_TARGETS="riscv32 riscv64" RUBY_TARGETS="ruby24 ruby25" USERLAND="GNU" VIDEO_CARDS="amdgpu virgl" XTABLES_ADDONS="quota2 psd pknock lscan length2 ipv4options ipset ipp2p iface geoip fuzzy condition tee tarpit sysrq steal rawnat logmark ipmark dhcpmac delude chaos account"
Unset:  CC, CPPFLAGS, CTARGET, CXX, INSTALL_MASK, LC_ALL, LINGUAS, PORTAGE_BINHOST, PORTAGE_BUNZIP2_COMMAND, PORTAGE_COMPRESS, PORTAGE_COMPRESS_FLAGS, PORTAGE_RSYNC_EXTRA_OPTS

=================================================================
                        Package Settings
=================================================================

sys-devel/llvm-roc-3.3.0::gentoo was built with the following:
USE="-debug" ABI_X86="(64)"


dev-util/rocm-cmake-3.3.0::gentoo was built with the following:
USE="" ABI_X86="(64)"


dev-libs/roct-thunk-interface-3.3.0::gentoo was built with the following:
USE="" ABI_X86="(64)"


dev-libs/rocr-runtime-3.3.0::gentoo was built with the following:
USE="-non-free" ABI_X86="(64)"


dev-libs/rocm-opencl-runtime-3.3.0::gentoo was built with the following:
USE="" ABI_X86="(64)"


dev-libs/rocm-device-libs-3.3.0::gentoo was built with the following:
USE="" ABI_X86="(64)"


dev-libs/rocm-comgr-3.3.0::gentoo was built with the following:
USE="" ABI_X86="(64)"


dev-libs/hsa-ext-rocr-1.1.30100.0::gentoo was built with the following:
USE="-deprecated" ABI_X86="(64)"


dev-util/clinfo-2.2.18.04.06-r1::gentoo was built with the following:
USE="" ABI_X86="(64)"
CFLAGS="-pipe -O2 -march=znver1 -g"
CXXFLAGS="-pipe -O2 -march=znver1 -g"
FEATURES="assume-digests binpkg-docompress binpkg-dostrip binpkg-logs buildsyspkg cgroup compressdebug config-protect-if-modified distlocks ebuild-locks fakeroot fixlafiles ipc-sandbox merge-sync mount-sandbox multilib-strict network-sandbox news parallel-fetch parallel-install pid-sandbox preserve-libs protect-owned qa-unresolved-soname-deps sandbox sfperms splitdebug strict unknown-features-warn unmerge-logs unmerge-orphans userfetch userpriv usersandbox usersync xattr"

My hardware is:

computer
    description: Desktop Computer
    product: System Product Name (SKU)
    vendor: System manufacturer
    version: System Version
    serial: [REMOVED]
    width: 4294967295 bits
    capabilities: smbios-3.1 dmi-3.1 smp vsyscall32
    configuration: boot=normal chassis=desktop family=To be filled by O.E.M. sku=SKU uuid=[REMOVED]
  *-core
       description: Motherboard
       product: ROG STRIX B350-F GAMING
       vendor: ASUSTeK COMPUTER INC.
       physical id: 0
       version: Rev X.0x
       serial: [REMOVED]
       slot: Default string
     *-firmware
          description: BIOS
          vendor: American Megatrends Inc.
          physical id: 0
          version: 5220
          date: 09/12/2019
          size: 64KiB
          capacity: 15MiB
          capabilities: pci apm upgrade shadowing cdboot bootselect socketedrom edd int13floppy1200 int13floppy720 int13floppy2880 int5printscreen int9keyboard int14serial int17printer acpi usb biosbootspecification uefi
     *-memory
          description: System Memory
          physical id: 27
          slot: System board or motherboard
          size: 16GiB
        *-bank:0
             description: [empty]
             product: Unknown
             vendor: Unknown
             physical id: 0
             serial: [REMOVED]
             slot: DIMM_A1
        *-bank:1
             description: DIMM DDR4 Synchronous Unbuffered (Unregistered) 2666 MHz (0.4 ns)
             product: 9965684-013.A00G
             vendor: Kingston
             physical id: 1
             serial: [REMOVED]
             slot: DIMM_A2
             size: 8GiB
             width: 64 bits
             clock: 2666MHz (0.4ns)
        *-bank:2
             description: [empty]
             product: Unknown
             vendor: Unknown
             physical id: 2
             serial: [REMOVED]
             slot: DIMM_B1
        *-bank:3
             description: DIMM DDR4 Synchronous Unbuffered (Unregistered) 2666 MHz (0.4 ns)
             product: 9965684-013.A00G
             vendor: Kingston
             physical id: 3
             serial: [REMOVED]
             slot: DIMM_B2
             size: 8GiB
             width: 64 bits
             clock: 2666MHz (0.4ns)
     *-cache:0
          description: L1 cache
          physical id: 29
          slot: L1 - Cache
          size: 384KiB
          capacity: 384KiB
          clock: 1GHz (1.0ns)
          capabilities: pipeline-burst internal write-back unified
          configuration: level=1
     *-cache:1
          description: L2 cache
          physical id: 2a
          slot: L2 - Cache
          size: 2MiB
          capacity: 2MiB
          clock: 1GHz (1.0ns)
          capabilities: pipeline-burst internal write-back unified
          configuration: level=2
     *-cache:2
          description: L3 cache
          physical id: 2b
          slot: L3 - Cache
          size: 4MiB
          capacity: 4MiB
          clock: 1GHz (1.0ns)
          capabilities: pipeline-burst internal write-back unified
          configuration: level=3
     *-cpu
          description: CPU
          product: AMD Ryzen 5 2400G with Radeon Vega Graphics
          vendor: Advanced Micro Devices [AMD]
          physical id: 2c
          bus info: cpu@0
          version: AMD Ryzen 5 2400G with Radeon Vega Graphics
          serial: [REMOVED]
          slot: AM4
          size: 2566MHz
          capacity: 3900MHz
          width: 64 bits
          clock: 100MHz
          capabilities: x86-64 fpu fpu_exception wp vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ht syscall nx mmxext fxsr_opt pdpe1gb rdtscp constant_tsc rep_good nopl nonstop_tsc cpuid extd_apicid aperfmperf pni pclmulqdq monitor ssse3 fma cx16 sse4_1 sse4_2 movbe popcnt aes xsave avx f16c rdrand lahf_lm cmp_legacy svm extapic cr8_legacy abm sse4a misalignsse 3dnowprefetch osvw skinit wdt tce topoext perfctr_core perfctr_nb bpext perfctr_llc mwaitx cpb hw_pstate sme ssbd sev ibpb vmmcall fsgsbase bmi1 avx2 smep bmi2 rdseed adx smap clflushopt sha_ni xsaveopt xsavec xgetbv1 xsaves clzero irperf xsaveerptr arat npt lbrv svm_lock nrip_save tsc_scale vmcb_clean flushbyasid decodeassists pausefilter pfthreshold avic v_vmsave_vmload vgif overflow_recov succor smca cpufreq
          configuration: cores=4 enabledcores=4 threads=8
     *-pci:0
          description: Host bridge
          product: Raven/Raven2 Root Complex
          vendor: Advanced Micro Devices, Inc. [AMD]
          physical id: 100
          bus info: pci@0000:00:00.0
          version: 00
          width: 32 bits
          clock: 33MHz
        *-generic UNCLAIMED
             description: IOMMU
             product: Raven/Raven2 IOMMU
             vendor: Advanced Micro Devices, Inc. [AMD]
             physical id: 0.2
             bus info: pci@0000:00:00.2
             version: 00
             width: 32 bits
             clock: 33MHz
             capabilities: msi ht bus_master cap_list
             configuration: latency=0
        *-pci:0
             description: PCI bridge
             product: Raven/Raven2 PCIe GPP Bridge [6:0]
             vendor: Advanced Micro Devices, Inc. [AMD]
             physical id: 1.1
             bus info: pci@0000:00:01.1
             version: 00
             width: 32 bits
             clock: 33MHz
             capabilities: pci pm pciexpress msi ht normal_decode bus_master cap_list
             configuration: driver=pcieport
             resources: irq:26 ioport:e000(size=4096) memory:fcf00000-fcffffff ioport:e0000000(size=270532608)
           *-display
                description: VGA compatible controller
                product: Baffin [Radeon RX 550 640SP / RX 560/560X]
                vendor: Advanced Micro Devices, Inc. [AMD/ATI]
                physical id: 0
                bus info: pci@0000:01:00.0
                version: cf
                width: 64 bits
                clock: 33MHz
                capabilities: pm pciexpress msi vga_controller bus_master cap_list rom
                configuration: driver=amdgpu latency=0
                resources: irq:106 memory:e0000000-efffffff memory:f0000000-f01fffff ioport:e000(size=256) memory:fcf00000-fcf3ffff memory:fcf40000-fcf5ffff
        *-pci:1
             description: PCI bridge
             product: Raven/Raven2 PCIe GPP Bridge [6:0]
             vendor: Advanced Micro Devices, Inc. [AMD]
             physical id: 1.2
             bus info: pci@0000:00:01.2
             version: 00
             width: 32 bits
             clock: 33MHz
             capabilities: pci pm pciexpress msi ht normal_decode bus_master cap_list
             configuration: driver=pcieport
             resources: irq:27 ioport:d000(size=4096) memory:fcb00000-fccfffff
           *-usb
                description: USB controller
                product: 300 Series Chipset USB 3.1 xHCI Controller
                vendor: Advanced Micro Devices, Inc. [AMD]
                physical id: 0
                bus info: pci@0000:02:00.0
                version: 02
                width: 64 bits
                clock: 33MHz
                capabilities: msi pm pciexpress xhci bus_master cap_list
                configuration: driver=xhci_hcd latency=0
                resources: irq:83 memory:fcca0000-fcca7fff
              *-usbhost:0
                   product: xHCI Host Controller
                   vendor: Linux 5.5.14 xhci-hcd
                   physical id: 0
                   bus info: usb@1
                   logical name: usb1
                   version: 5.05
                   capabilities: usb-2.00
                   configuration: driver=hub slots=10 speed=480Mbit/s
              *-usbhost:1
                   product: xHCI Host Controller
                   vendor: Linux 5.5.14 xhci-hcd
                   physical id: 1
                   bus info: usb@2
                   logical name: usb2
                   version: 5.05
                   capabilities: usb-3.10
                   configuration: driver=hub slots=4 speed=10000Mbit/s
                 *-usb
                      description: Mass storage device
                      product: Expansion
                      vendor: Seagate
                      physical id: 4
                      bus info: usb@2:4
                      version: 0.00
                      serial: [REMOVED]
                      capabilities: usb-3.00 scsi
                      configuration: driver=uas maxpower=144mA speed=5000Mbit/s
           *-storage
                description: SATA controller
                product: 300 Series Chipset SATA Controller
                vendor: Advanced Micro Devices, Inc. [AMD]
                physical id: 0.1
                bus info: pci@0000:02:00.1
                version: 02
                width: 32 bits
                clock: 33MHz
                capabilities: storage msi pm pciexpress ahci_1.0 bus_master cap_list rom
                configuration: driver=ahci latency=0
                resources: irq:80 memory:fcc80000-fcc9ffff memory:fcc00000-fcc7ffff
           *-pci
                description: PCI bridge
                product: Advanced Micro Devices, Inc. [AMD]
                vendor: Advanced Micro Devices, Inc. [AMD]
                physical id: 0.2
                bus info: pci@0000:02:00.2
                version: 02
                width: 32 bits
                clock: 33MHz
                capabilities: pci msi pm pciexpress normal_decode bus_master cap_list
                configuration: driver=pcieport
                resources: irq:32 ioport:d000(size=4096) memory:fcb00000-fcbfffff
              *-pci:0
                   description: PCI bridge
                   product: 300 Series Chipset PCIe Port
                   vendor: Advanced Micro Devices, Inc. [AMD]
                   physical id: 0
                   bus info: pci@0000:03:00.0
                   version: 02
                   width: 32 bits
                   clock: 33MHz
                   capabilities: pci msi pm pciexpress normal_decode bus_master cap_list
                   configuration: driver=pcieport
                   resources: irq:34 ioport:d000(size=4096) memory:fcb00000-fcbfffff
                 *-network
                      description: Ethernet interface
                      product: I211 Gigabit Network Connection
                      vendor: Intel Corporation
                      physical id: 0
                      bus info: pci@0000:04:00.0
                      logical name: enp4s0
                      version: 03
                      serial: [REMOVED]
                      size: 1Gbit/s
                      capacity: 1Gbit/s
                      width: 32 bits
                      clock: 33MHz
                      capabilities: pm msi msix pciexpress bus_master cap_list ethernet physical tp 10bt 10bt-fd 100bt 100bt-fd 1000bt-fd autonegotiation
                      configuration: autonegotiation=on broadcast=yes driver=igb driverversion=5.6.0-k duplex=full firmware=0. 6-1 ip=[REMOVED] latency=0 link=yes multicast=yes port=twisted pair speed=1Gbit/s
                      resources: irq:33 memory:fcb00000-fcb1ffff ioport:d000(size=32) memory:fcb20000-fcb23fff
              *-pci:1
                   description: PCI bridge
                   product: 300 Series Chipset PCIe Port
                   vendor: Advanced Micro Devices, Inc. [AMD]
                   physical id: 1
                   bus info: pci@0000:03:01.0
                   version: 02
                   width: 32 bits
                   clock: 33MHz
                   capabilities: pci msi pm pciexpress normal_decode bus_master cap_list
                   configuration: driver=pcieport
                   resources: irq:36
              *-pci:2
                   description: PCI bridge
                   product: 300 Series Chipset PCIe Port
                   vendor: Advanced Micro Devices, Inc. [AMD]
                   physical id: 4
                   bus info: pci@0000:03:04.0
                   version: 02
                   width: 32 bits
                   clock: 33MHz
                   capabilities: pci msi pm pciexpress normal_decode bus_master cap_list
                   configuration: driver=pcieport
                   resources: irq:37
              *-pci:3
                   description: PCI bridge
                   product: 300 Series Chipset PCIe Port
                   vendor: Advanced Micro Devices, Inc. [AMD]
                   physical id: 6
                   bus info: pci@0000:03:06.0
                   version: 02
                   width: 32 bits
                   clock: 33MHz
                   capabilities: pci msi pm pciexpress normal_decode bus_master cap_list
                   configuration: driver=pcieport
                   resources: irq:38
              *-pci:4
                   description: PCI bridge
                   product: 300 Series Chipset PCIe Port
                   vendor: Advanced Micro Devices, Inc. [AMD]
                   physical id: 7
                   bus info: pci@0000:03:07.0
                   version: 02
                   width: 32 bits
                   clock: 33MHz
                   capabilities: pci msi pm pciexpress normal_decode bus_master cap_list
                   configuration: driver=pcieport
                   resources: irq:40
        *-pci:2
             description: PCI bridge
             product: Raven/Raven2 PCIe GPP Bridge [6:0]
             vendor: Advanced Micro Devices, Inc. [AMD]
             physical id: 1.6
             bus info: pci@0000:00:01.6
             version: 00
             width: 32 bits
             clock: 33MHz
             capabilities: pci pm pciexpress msi ht normal_decode bus_master cap_list
             configuration: driver=pcieport
             resources: irq:28 memory:fce00000-fcefffff
           *-storage
                description: Non-Volatile memory controller
                product: NVMe SSD Controller SM981/PM981/PM983
                vendor: Samsung Electronics Co Ltd
                physical id: 0
                bus info: pci@0000:09:00.0
                version: 00
                width: 64 bits
                clock: 33MHz
                capabilities: storage pm msi pciexpress msix nvm_express bus_master cap_list
                configuration: driver=nvme latency=0
                resources: irq:41 memory:fce00000-fce03fff
        *-pci:3
             description: PCI bridge
             product: Raven/Raven2 Internal PCIe GPP Bridge 0 to Bus A
             vendor: Advanced Micro Devices, Inc. [AMD]
             physical id: 8.1
             bus info: pci@0000:00:08.1
             version: 00
             width: 32 bits
             clock: 33MHz
             capabilities: pci pm pciexpress msi normal_decode bus_master cap_list
             configuration: driver=pcieport
             resources: irq:29 ioport:c000(size=4096) memory:fc700000-fcafffff ioport:c0000000(size=270532608)
           *-display
                description: VGA compatible controller
                product: Raven Ridge [Radeon Vega Series / Radeon Vega Mobile Series]
                vendor: Advanced Micro Devices, Inc. [AMD/ATI]
                physical id: 0
                bus info: pci@0000:0a:00.0
                version: c6
                width: 64 bits
                clock: 33MHz
                capabilities: pm pciexpress msi msix vga_controller bus_master cap_list
                configuration: driver=amdgpu latency=0
                resources: irq:93 memory:c0000000-cfffffff memory:d0000000-d01fffff ioport:c000(size=256) memory:fca00000-fca7ffff
           *-multimedia:0
                description: Audio device
                product: Raven/Raven2/Fenghuang HDMI/DP Audio Controller
                vendor: Advanced Micro Devices, Inc. [AMD/ATI]
                physical id: 0.1
                bus info: pci@0000:0a:00.1
                version: 00
                width: 32 bits
                clock: 33MHz
                capabilities: pm pciexpress msi bus_master cap_list
                configuration: driver=snd_hda_intel latency=0
                resources: irq:103 memory:fca88000-fca8bfff
           *-generic UNCLAIMED
                description: Encryption controller
                product: Family 17h (Models 10h-1fh) Platform Security Processor
                vendor: Advanced Micro Devices, Inc. [AMD]
                physical id: 0.2
                bus info: pci@0000:0a:00.2
                version: 00
                width: 32 bits
                clock: 33MHz
                capabilities: pm pciexpress msi msix cap_list
                configuration: latency=0
                resources: memory:fc900000-fc9fffff memory:fca8c000-fca8dfff
           *-usb:0
                description: USB controller
                product: Raven USB 3.1
                vendor: Advanced Micro Devices, Inc. [AMD]
                physical id: 0.3
                bus info: pci@0000:0a:00.3
                version: 00
                width: 64 bits
                clock: 33MHz
                capabilities: pm pciexpress msi msix xhci bus_master cap_list
                configuration: driver=xhci_hcd latency=0
                resources: irq:84 memory:fc800000-fc8fffff
              *-usbhost:0
                   product: xHCI Host Controller
                   vendor: Linux 5.5.14 xhci-hcd
                   physical id: 0
                   bus info: usb@3
                   logical name: usb3
                   version: 5.05
                   capabilities: usb-2.00
                   configuration: driver=hub slots=4 speed=480Mbit/s
                 *-usb
                      description: Keyboard
                      product: Steam Controller
                      vendor: Valve Software
                      physical id: 3
                      bus info: usb@3:3
                      version: 0.01
                      capabilities: usb-2.00
                      configuration: driver=usbhid maxpower=100mA speed=12Mbit/s
              *-usbhost:1
                   product: xHCI Host Controller
                   vendor: Linux 5.5.14 xhci-hcd
                   physical id: 1
                   bus info: usb@4
                   logical name: usb4
                   version: 5.05
                   capabilities: usb-3.10
                   configuration: driver=hub slots=4 speed=10000Mbit/s
           *-usb:1
                description: USB controller
                product: Raven USB 3.1
                vendor: Advanced Micro Devices, Inc. [AMD]
                physical id: 0.4
                bus info: pci@0000:0a:00.4
                version: 00
                width: 64 bits
                clock: 33MHz
                capabilities: pm pciexpress msi msix xhci bus_master cap_list
                configuration: driver=xhci_hcd latency=0
                resources: irq:93 memory:fc700000-fc7fffff
              *-usbhost:0
                   product: xHCI Host Controller
                   vendor: Linux 5.5.14 xhci-hcd
                   physical id: 0
                   bus info: usb@5
                   logical name: usb5
                   version: 5.05
                   capabilities: usb-2.00
                   configuration: driver=hub slots=2 speed=480Mbit/s
                 *-usb
                      description: USB hub
                      product: USB2.0 Hub
                      vendor: VIA Labs, Inc.
                      physical id: 1
                      bus info: usb@5:1
                      version: 90.11
                      capabilities: usb-2.10
                      configuration: driver=hub slots=4 speed=480Mbit/s
                    *-usb:0
                         description: Mouse
                         product: Comfort Mouse 6000
                         vendor: Microsoft
                         physical id: 2
                         bus info: usb@5:1.2
                         version: 1.44
                         capabilities: usb-2.00
                         configuration: driver=usbhid maxpower=100mA speed=2Mbit/s
                    *-usb:1
                         description: Video
                         product: HD Pro Webcam C920
                         vendor: Logitech, Inc.
                         physical id: 3
                         bus info: usb@5:1.3
                         version: 0.19
                         serial: [REMOVED]
                         capabilities: usb-2.00
                         configuration: driver=snd-usb-audio maxpower=500mA speed=480Mbit/s
                    *-usb:2
                         description: Keyboard
                         product: Dell USB Keyboard
                         vendor: Dell
                         physical id: 4
                         bus info: usb@5:1.4
                         version: 3.06
                         capabilities: usb-1.10
                         configuration: driver=usbhid maxpower=70mA speed=2Mbit/s
              *-usbhost:1
                   product: xHCI Host Controller
                   vendor: Linux 5.5.14 xhci-hcd
                   physical id: 1
                   bus info: usb@6
                   logical name: usb6
                   version: 5.05
                   capabilities: usb-3.10
                   configuration: driver=hub slots=1 speed=10000Mbit/s
                 *-usb
                      description: USB hub
                      product: USB3.0 Hub
                      vendor: VIA Labs, Inc.
                      physical id: 1
                      bus info: usb@6:1
                      version: 90.11
                      capabilities: usb-3.00
                      configuration: driver=hub slots=4 speed=5000Mbit/s
                    *-usb
                         description: Generic USB device
                         product: AX88179
                         vendor: ASIX Elec. Corp.
                         physical id: 1
                         bus info: usb@6:1.1
                         version: 1.00
                         serial: [REMOVED]
                         capabilities: usb-3.00
                         configuration: driver=ax88179_178a maxpower=496mA speed=5000Mbit/s
           *-multimedia:1
                description: Audio device
                product: Family 17h (Models 10h-1fh) HD Audio Controller
                vendor: Advanced Micro Devices, Inc. [AMD]
                physical id: 0.6
                bus info: pci@0000:0a:00.6
                version: 00
                width: 32 bits
                clock: 33MHz
                capabilities: pm pciexpress msi bus_master cap_list
                configuration: driver=snd_hda_intel latency=0
                resources: irq:104 memory:fca80000-fca87fff
        *-pci:4
             description: PCI bridge
             product: Raven/Raven2 Internal PCIe GPP Bridge 0 to Bus B
             vendor: Advanced Micro Devices, Inc. [AMD]
             physical id: 8.2
             bus info: pci@0000:00:08.2
             version: 00
             width: 32 bits
             clock: 33MHz
             capabilities: pci pm pciexpress msi normal_decode bus_master cap_list
             configuration: driver=pcieport
             resources: irq:30 memory:fcd00000-fcdfffff
           *-storage
                description: SATA controller
                product: FCH SATA Controller [AHCI mode]
                vendor: Advanced Micro Devices, Inc. [AMD]
                physical id: 0
                bus info: pci@0000:0b:00.0
                version: 61
                width: 32 bits
                clock: 33MHz
                capabilities: storage pm pciexpress msi ahci_1.0 bus_master cap_list
                configuration: driver=ahci latency=0
                resources: irq:82 memory:fcd00000-fcd007ff
        *-serial
             description: SMBus
             product: FCH SMBus Controller
             vendor: Advanced Micro Devices, Inc. [AMD]
             physical id: 14
             bus info: pci@0000:00:14.0
             version: 61
             width: 32 bits
             clock: 66MHz
             configuration: driver=piix4_smbus latency=0
             resources: irq:0
        *-isa
             description: ISA bridge
             product: FCH LPC Bridge
             vendor: Advanced Micro Devices, Inc. [AMD]
             physical id: 14.3
             bus info: pci@0000:00:14.3
             version: 51
             width: 32 bits
             clock: 66MHz
             capabilities: isa bus_master
             configuration: latency=0
     *-pci:1
          description: Host bridge
          product: Family 17h (Models 00h-1fh) PCIe Dummy Host Bridge
          vendor: Advanced Micro Devices, Inc. [AMD]
          physical id: 101
          bus info: pci@0000:00:01.0
          version: 00
          width: 32 bits
          clock: 33MHz
     *-pci:2
          description: Host bridge
          product: Family 17h (Models 00h-1fh) PCIe Dummy Host Bridge
          vendor: Advanced Micro Devices, Inc. [AMD]
          physical id: 102
          bus info: pci@0000:00:08.0
          version: 00
          width: 32 bits
          clock: 33MHz
     *-pci:3
          description: Host bridge
          product: Raven/Raven2 Device 24: Function 0
          vendor: Advanced Micro Devices, Inc. [AMD]
          physical id: 103
          bus info: pci@0000:00:18.0
          version: 00
          width: 32 bits
          clock: 33MHz
     *-pci:4
          description: Host bridge
          product: Raven/Raven2 Device 24: Function 1
          vendor: Advanced Micro Devices, Inc. [AMD]
          physical id: 104
          bus info: pci@0000:00:18.1
          version: 00
          width: 32 bits
          clock: 33MHz
     *-pci:5
          description: Host bridge
          product: Raven/Raven2 Device 24: Function 2
          vendor: Advanced Micro Devices, Inc. [AMD]
          physical id: 105
          bus info: pci@0000:00:18.2
          version: 00
          width: 32 bits
          clock: 33MHz
     *-pci:6
          description: Host bridge
          product: Raven/Raven2 Device 24: Function 3
          vendor: Advanced Micro Devices, Inc. [AMD]
          physical id: 106
          bus info: pci@0000:00:18.3
          version: 00
          width: 32 bits
          clock: 33MHz
          configuration: driver=k10temp
          resources: irq:0
     *-pci:7
          description: Host bridge
          product: Raven/Raven2 Device 24: Function 4
          vendor: Advanced Micro Devices, Inc. [AMD]
          physical id: 107
          bus info: pci@0000:00:18.4
          version: 00
          width: 32 bits
          clock: 33MHz
     *-pci:8
          description: Host bridge
          product: Raven/Raven2 Device 24: Function 5
          vendor: Advanced Micro Devices, Inc. [AMD]
          physical id: 108
          bus info: pci@0000:00:18.5
          version: 00
          width: 32 bits
          clock: 33MHz
     *-pci:9
          description: Host bridge
          product: Raven/Raven2 Device 24: Function 6
          vendor: Advanced Micro Devices, Inc. [AMD]
          physical id: 109
          bus info: pci@0000:00:18.6
          version: 00
          width: 32 bits
          clock: 33MHz
     *-pci:10
          description: Host bridge
          product: Raven/Raven2 Device 24: Function 7
          vendor: Advanced Micro Devices, Inc. [AMD]
          physical id: 10a
          bus info: pci@0000:00:18.7
          version: 00
          width: 32 bits
          clock: 33MHz
  *-network:0
       description: Ethernet interface
       physical id: 1
       bus info: usb@6:1.1
       logical name: enp10s0f4u1u1
       serial: [REMOVED]
       size: 1Gbit/s
       capacity: 1Gbit/s
       capabilities: ethernet physical tp mii 10bt 10bt-fd 100bt 100bt-fd 1000bt 1000bt-fd autonegotiation
       configuration: autonegotiation=on broadcast=yes driver=ax88179_178a duplex=full ip=[REMOVED] link=yes multicast=yes port=MII speed=1Gbit/s
  *-network:1
       description: Ethernet interface
       physical id: 2
       logical name: virbr0
       serial: [REMOVED]
       capabilities: ethernet physical
       configuration: autonegotiation=off broadcast=yes driver=bridge driverversion=2.3 firmware=N/A ip=[REMOVED] link=no multicast=yes
  *-network:2 DISABLED
       description: Ethernet interface
       physical id: 3
       logical name: virbr0-nic
       serial: [REMOVED]
       size: 10Mbit/s
       capabilities: ethernet physical
       configuration: autonegotiation=off broadcast=yes driver=tun driverversion=1.6 duplex=full link=no multicast=yes port=twisted pair speed=10Mbit/s

Are Stoney Ridge APUs supported?

I get a crash on trying to run MatrixMultiplication example (from rocm-gpudebugsdk_1.5.270-g3e9d2df_amd64.deb, rocm-gdb_1.5.270-gc4fb045_amd64.deb):

$ cd /opt/rocm/gpudebugsdk/samples/MatrixMultiplication && /opt/rocm/bin/rocm-gdb MatrixMul
GNU gdb (GDB) 7.11
Copyright (C) 2016 Free Software Foundation, Inc.
License GPLv3+: GNU GPL version 3 or later <http://gnu.org/licenses/gpl.html>
This is free software: you are free to change and redistribute it.
There is NO WARRANTY, to the extent permitted by law.  Type "show copying"
and "show warranty" for details.
AMD rocm-gdb 1.5.270
This GDB was configured as "x86_64-pc-linux-gnu".
Type "show configuration" for configuration details.
For bug reporting instructions, please see:
<http://www.gnu.org/software/gdb/bugs/>.
Find the GDB manual and other documentation resources online at:
<http://www.gnu.org/software/gdb/documentation/>.
For help, type "help".
For ROCm GPU debugging specific help, type "help rocm".
Type "apropos word" to search for commands related to "word"...
ROCm Configure Steps Done
....Certain GDB signals have been changed
Reading symbols from MatrixMul...done.
(ROCm-gdb) r
Starting program: /opt/rocm/gpudebugsdk/samples/MatrixMultiplication/MatrixMul 
[Thread debugging using libthread_db enabled]
Using host libthread_db library "/lib/x86_64-linux-gnu/libthread_db.so.1".
Initializing HSA runtime...

Program received signal SIGSEGV, Segmentation fault.
0x00007ffff7b4bfa5 in ?? () from /opt/rocm/hsa/lib/libhsa-runtime64.so.1
(ROCm-gdb) bt
#0  0x00007ffff7b4bfa5 in ?? () from /opt/rocm/hsa/lib/libhsa-runtime64.so.1
#1  0x00007ffff7b57ba3 in ?? () from /opt/rocm/hsa/lib/libhsa-runtime64.so.1
#2  0x00007ffff7b57c14 in ?? () from /opt/rocm/hsa/lib/libhsa-runtime64.so.1
#3  0x00007ffff7b701ce in ?? () from /opt/rocm/hsa/lib/libhsa-runtime64.so.1
#4  0x00007ffff7b58b0a in ?? () from /opt/rocm/hsa/lib/libhsa-runtime64.so.1
#5  0x0000000000403635 in AMDT::HSAResourceManager::InitRuntime (verbosePrint=true, gpuIndex=0) at ../Common/HSAResourceManager.cpp:80
#6  0x000000000040df84 in RunTest (doVerify=false) at MatrixMul.cpp:76
#7  0x000000000040deeb in main (argc=1, argv=0x7fffffffdb08) at MatrixMul.cpp:65

APU model: AMD A4-9120. More info:

$ uname -r
4.11.0-kfd-compute-rocm-rel-1.6-180

$ cat /sys/class/kfd/kfd/topology/nodes/0/properties
cpu_cores_count 2
simd_count 40
mem_banks_count 1
caches_count 4
io_links_count 0
cpu_core_id_base 16
simd_id_base 2147483648
max_waves_per_simd 40
lds_size_in_kb 64
gds_size_in_kb 0
wave_front_size 64
array_count 1
simd_arrays_per_engine 0
cu_per_simd_array 10
simd_per_cu 4
max_slots_scratch_cu 32
vendor_id 0
device_id 0
location_id 0
max_engine_clk_ccompute 2200

$ dmesg | grep IOMMU
[    1.269895] AMD-Vi: IOMMU performance counters supported
[    1.271662] AMD-Vi: Found IOMMU at 0000:00:00.2 cap 0x40
[    1.628151] AMD IOMMUv2 driver by Joerg Roedel <[email protected]>

$ dmesg | grep CRAT
[    0.000000] ACPI: CRAT 0x00000000E7BD9000 0002E8 (v01 HPQOEM INSYDE   00000001 HP   00040000)
[    1.633109] Parsing CRAT table with 1 nodes

Thanks!

ROCM 1.6 Assertion failure on Bristol Ridge APU: ROCR-Runtime/src/core/runtime/runtime.cpp:162: void core::Runtime::RegisterAgent(core::Agent*): Assertion `system_regions_fine_.size() > 0' failed.

The full stack trace is below. It looks like the root cause is that the kernel reports mem_banks_count as 0:

user@host ]$ cat /sys/devices/virtual/kfd/kfd/topology/nodes/0/properties
cpu_cores_count 4
simd_count 40
mem_banks_count 0
caches_count 8
io_links_count 0
cpu_core_id_base 16
simd_id_base 2147483648
max_waves_per_simd 40
lds_size_in_kb 64
gds_size_in_kb 0
wave_front_size 64
array_count 1
simd_arrays_per_engine 0
cu_per_simd_array 10
simd_per_cu 4
max_slots_scratch_cu 32
vendor_id 0
device_id 0
location_id 0
max_engine_clk_ccompute 3100

src/core/runtime/runtime.cpp:162: void core::Runtime::RegisterAgent(core::Agent*): Assertion `system_regions_fine_.size() > 0' failed.

#4 0x00007ffff7b24782 in core::Runtime::RegisterAgent (this=0x6179b0, agent=0x61d720)
at src/core/runtime/runtime.cpp:162
#5 0x00007ffff7b0e43b in amd::DiscoverCpu (node_id=0, node_prop=...) at src/core/runtime/amd_topology.cpp:68
#6 0x00007ffff7b0e83d in amd::BuildTopology () at src/core/runtime/amd_topology.cpp:175
#7 0x00007ffff7b0e8cb in amd::Load () at src/core/runtime/amd_topology.cpp:191
#8 0x00007ffff7b27909 in core::Runtime::Load (this=0x6179b0) at src/core/runtime/runtime.cpp:992
#9 0x00007ffff7b24381 in core::Runtime::Acquire () at src/core/runtime/runtime.cpp:113
#10 0x00007ffff7b11a8b in HSA::hsa_init () at src/core/runtime/hsa.cpp:194
#11 0x00007ffff7b39537 in hsa_init () at src/core/common/hsa_table_interface.cpp:61

Missing library libhsa-ext-finalize64

Description

When I run a ROCm related program like rocminfo or hipcc, I get the following warning:

$ rocminfo > /dev/null
LoadLib(libhsa-ext-finalize64.so.1) failed: libhsa-ext-finalize64.so.1: cannot open shared object file: No such file or directory

$ hipcc --version > /dev/null
LoadLib(libhsa-ext-finalize64.so.1) failed: libhsa-ext-finalize64.so.1: cannot open shared object file: No such file or directory

The rocminfo command runs fine, see its output

System

Arch Linux (Kernel 5.6.2) with Vega 56, compiled ROCm 3.3.0 from source

$ lspci | grep VGA
1f:00.0 VGA compatible controller: Advanced Micro Devices, Inc. [AMD/ATI] Vega 10 XL/XT [Radeon RX Vega 56/64] (rev c3)
$ lsmod | grep amd
edac_mce_amd           32768  0
kvm_amd               114688  0
kvm                   827392  1 kvm_amd
ccp                   114688  1 kvm_amd
gpio_amdpt             20480  0
pinctrl_amd            32768  0
amdgpu               5496832  3
gpu_sched              40960  1 amdgpu
i2c_algo_bit           16384  1 amdgpu
ttm                   122880  1 amdgpu
drm_kms_helper        241664  1 amdgpu
drm                   565248  7 gpu_sched,drm_kms_helper,amdgpu,ttm
$ ls -l /dev/ | grep kfd
crw-rw-rw-  1 root render    240,   0  8. Apr 08:33 kfd

This is my ROCm directory
Output of strace for rocminfo here

Memory access fault by GPU node-1 RX 480 - Ubuntu 18.x

Hi, I'm testing our various already made software packages for Neural Style Transfers systems that use Tensor Flow. I'm trying to get this one up and running on my AMD RX 480 https://github.com/cysmith/neural-style-tf

However I keep running into this error.

Memory access fault by GPU node-1 (Agent handle: 0x559d51a74e00) on address 0x5e2206000. Reason: Page not present or supervisor privilege.

My system is pretty basic:

Ubuntu 18.x Desktop
ROCm 1.9.2

Here is a dump output from running the script to transfer a demo style to an image.

bash stylize_image.sh ./image_input/lion.jpg ./styles/kandinsky.jpg

Rendering stylized image. This may take a while...
2.7.15rc1 (default, Nov 12 2018, 14:31:15) 
[GCC 7.3.0]
WARNING:tensorflow:From /home/gateway/.local/lib/python2.7/site-packages/tensorflow/python/ops/distributions/distribution.py:265: __init__ (from tensorflow.python.ops.distributions.distribution) is deprecated and will be removed after 2019-01-01.
Instructions for updating:
The TensorFlow Distributions library has moved to TensorFlow Probability (https://github.com/tensorflow/probability). You should update all references to use `tfp.distributions` instead of `tf.distributions`.
WARNING:tensorflow:From /home/gateway/.local/lib/python2.7/site-packages/tensorflow/python/ops/distributions/bernoulli.py:169: __init__ (from tensorflow.python.ops.distributions.kullback_leibler) is deprecated and will be removed after 2019-01-01.
Instructions for updating:
The TensorFlow Distributions library has moved to TensorFlow Probability (https://github.com/tensorflow/probability). You should update all references to use `tfp.distributions` instead of `tf.distributions`.

---- RENDERING SINGLE IMAGE ----

2018-12-22 10:39:13.204832: I tensorflow/core/platform/cpu_feature_guard.cc:141] Your CPU supports instructions that this TensorFlow binary was not compiled to use: SSE4.1 SSE4.2
2018-12-22 10:39:13.205173: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1530] Found device 0 with properties: 
name: Ellesmere [Radeon RX 470/480]
AMDGPU ISA: gfx803
memoryClockRate (GHz) 1.29
pciBusID 0000:01:00.0
Total memory: 4.00GiB
Free memory: 3.75GiB
2018-12-22 10:39:13.205189: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1641] Adding visible gpu devices: 0
2018-12-22 10:39:13.205206: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1051] Device interconnect StreamExecutor with strength 1 edge matrix:
2018-12-22 10:39:13.205212: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1057]      0 
2018-12-22 10:39:13.205217: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1070] 0:   N 
2018-12-22 10:39:13.205249: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1189] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:0 with 3540 MB memory) -> physical GPU (device: 0, name: Ellesmere [Radeon RX 470/480], pci bus id: 0000:01:00.0)

BUILDING VGG-19 NETWORK
loading model weights...
constructing layers...
LAYER GROUP 1
--conv1_1 | shape=(1, 512, 512, 64) | weights_shape=(3, 3, 3, 64)
--relu1_1 | shape=(1, 512, 512, 64) | bias_shape=(64,)
--conv1_2 | shape=(1, 512, 512, 64) | weights_shape=(3, 3, 64, 64)
--relu1_2 | shape=(1, 512, 512, 64) | bias_shape=(64,)
--pool1   | shape=(1, 256, 256, 64)
LAYER GROUP 2
--conv2_1 | shape=(1, 256, 256, 128) | weights_shape=(3, 3, 64, 128)
--relu2_1 | shape=(1, 256, 256, 128) | bias_shape=(128,)
--conv2_2 | shape=(1, 256, 256, 128) | weights_shape=(3, 3, 128, 128)
--relu2_2 | shape=(1, 256, 256, 128) | bias_shape=(128,)
--pool2   | shape=(1, 128, 128, 128)
LAYER GROUP 3
--conv3_1 | shape=(1, 128, 128, 256) | weights_shape=(3, 3, 128, 256)
--relu3_1 | shape=(1, 128, 128, 256) | bias_shape=(256,)
--conv3_2 | shape=(1, 128, 128, 256) | weights_shape=(3, 3, 256, 256)
--relu3_2 | shape=(1, 128, 128, 256) | bias_shape=(256,)
--conv3_3 | shape=(1, 128, 128, 256) | weights_shape=(3, 3, 256, 256)
--relu3_3 | shape=(1, 128, 128, 256) | bias_shape=(256,)
--conv3_4 | shape=(1, 128, 128, 256) | weights_shape=(3, 3, 256, 256)
--relu3_4 | shape=(1, 128, 128, 256) | bias_shape=(256,)
--pool3   | shape=(1, 64, 64, 256)
LAYER GROUP 4
--conv4_1 | shape=(1, 64, 64, 512) | weights_shape=(3, 3, 256, 512)
--relu4_1 | shape=(1, 64, 64, 512) | bias_shape=(512,)
--conv4_2 | shape=(1, 64, 64, 512) | weights_shape=(3, 3, 512, 512)
--relu4_2 | shape=(1, 64, 64, 512) | bias_shape=(512,)
--conv4_3 | shape=(1, 64, 64, 512) | weights_shape=(3, 3, 512, 512)
--relu4_3 | shape=(1, 64, 64, 512) | bias_shape=(512,)
--conv4_4 | shape=(1, 64, 64, 512) | weights_shape=(3, 3, 512, 512)
--relu4_4 | shape=(1, 64, 64, 512) | bias_shape=(512,)
--pool4   | shape=(1, 32, 32, 512)
LAYER GROUP 5
--conv5_1 | shape=(1, 32, 32, 512) | weights_shape=(3, 3, 512, 512)
--relu5_1 | shape=(1, 32, 32, 512) | bias_shape=(512,)
--conv5_2 | shape=(1, 32, 32, 512) | weights_shape=(3, 3, 512, 512)
--relu5_2 | shape=(1, 32, 32, 512) | bias_shape=(512,)
--conv5_3 | shape=(1, 32, 32, 512) | weights_shape=(3, 3, 512, 512)
--relu5_3 | shape=(1, 32, 32, 512) | bias_shape=(512,)
--conv5_4 | shape=(1, 32, 32, 512) | weights_shape=(3, 3, 512, 512)
--relu5_4 | shape=(1, 32, 32, 512) | bias_shape=(512,)
--pool5   | shape=(1, 16, 16, 512)
Memory access fault by GPU node-1 (Agent handle: 0x559701510fa0) on address 0x5e2203000. Reason: Page not present or supervisor privilege.
stylize_image.sh: line 44: 19003 Aborted                 (core dumped) python neural_style.py --content_img "${content_filename}" --content_img_dir "${content_dir}" --style_imgs "${style_filename}" --style_imgs_dir "${style_dir}" --device "${device}" --verbose

Please take note I'm just getting into programming/testing with neural styles and had a bunch of RX 480 cards laying around to test with.

Any suggestions?

Install dependency problem

The newest update to ROCR that uses 2 packages instead of 1 causes an install problem for me:
hsa-rocr-dev depends on hsakmt-roct-dev (= 2.0.0); however: Package hsakmt-roct-dev is not installed.

I do have the latest ROCK (1.0 release) installed, which should have included ROCT.

hsa_amd_agent_memory_pool_get_info segfault for self

When using hc::accelerator::get_peers() I get a segmentation fault. (see backtrace.txt)

The problem appears to originate from here:

  const core::Runtime::LinkInfo link_info =
      core::Runtime::runtime_singleton_->GetLinkInfo(node_id_from, node_id_to);

src/core/runtime/amd_memory_region.cpp:374

const Runtime::LinkInfo& Runtime::GetLinkInfo(uint32_t node_id_from,
                                              uint32_t node_id_to) {
  return link_matrix_[GetIndexLinkInfo(node_id_from, node_id_to)];

src/core/runtime/runtime.cp:266

(gdb) print core::Runtime::runtime_singleton_->GetIndexLinkInfo(0,0)
$3 = 0
(gdb) print core::Runtime::runtime_singleton_->link_matrix_ 
$4 = std::vector of length 0, capacity 0

`sample/vector_copy` fails: 'Create the program failed.'

Trying to make HSA/ROC work on an A10-7700K. Building and installing ROCK, ROCT works.
With a stock 4.10 kernel initializing hsa runtime fails. Using the patched ROCK kernel things fail a bit later:

 # ./vector_copy
Initializing the hsa runtime succeeded.
Checking finalizer 1.0 extension support succeeded.
Generating function table for finalizer succeeded.
Getting a gpu agent succeeded.
Querying the agent name succeeded.
The agent name is gfx700.
Querying the agent maximum queue size succeeded.
The maximum queue size is 131072.
Creating the queue succeeded.
"Obtaining machine model" succeeded.
"Getting agent profile" succeeded.
Create the program failed.

strace says:

write(1, "Creating the queue succeeded.\n", 30Creating the queue succeeded.
) = 30
write(1, "\"Obtaining machine model\" succee"..., 37"Obtaining machine model" succeeded.
) = 37
write(1, "\"Getting agent profile\" succeede"..., 35"Getting agent profile" succeeded.
) = 35
open("vector_copy_full.brig", O_RDONLY) = 5
fstat(5, {st_mode=S_IFREG|0644, st_size=3456, ...}) = 0
fstat(5, {st_mode=S_IFREG|0644, st_size=3456, ...}) = 0
lseek(5, 0, SEEK_SET)                   = 0
read(5, "HSA BRIG\1\0\0\0\0\0\0\0\200\r\0\0\0\0\0\0\0\0\0\0\0\0\0\0"..., 3456) = 3456
lseek(5, 3456, SEEK_SET)                = 3456
close(5)                                = 0
write(1, "Create the program failed.\n", 27Create the program failed.
) = 27
exit_group(1)                           = ?
+++ exited with 1 +++

Compilation errors

Few undeclared things

[ 96%] Building CXX object CMakeFiles/hsa-runtime64.dir/libamdhsacode/amd_hsa_code.cpp.o
/tmp/rocm/runtime/src/core/runtime/runtime.cpp: In member function ‘hsa_status_t core::Runtime::IPCCreate(void*, size_t, hsa_amd_ipc_memory_t*)’:
/tmp/rocm/runtime/src/core/runtime/runtime.cpp:720:56: error: ‘HsaSharedMemoryHandle’ was not declared in this scope
   static_assert(sizeof(hsa_amd_ipc_memory_t) == sizeof(HsaSharedMemoryHandle),
                                                        ^~~~~~~~~~~~~~~~~~~~~
/tmp/rocm/runtime/src/core/runtime/runtime.cpp:722:58: error: expected primary-expression before ‘)’ token
   if (hsaKmtShareMemory(ptr, len, (HsaSharedMemoryHandle*)handle) == HSAKMT_STATUS_SUCCESS)
                                                          ^
/tmp/rocm/runtime/src/core/runtime/runtime.cpp:722:65: error: ‘hsaKmtShareMemory’ was not declared in this scope
   if (hsaKmtShareMemory(ptr, len, (HsaSharedMemoryHandle*)handle) == HSAKMT_STATUS_SUCCESS)
                                                                 ^
/tmp/rocm/runtime/src/core/runtime/runtime.cpp: In member function ‘hsa_status_t core::Runtime::IPCAttach(const hsa_amd_ipc_memory_t*, size_t, uint32_t, core::Agent**, void**)’:
/tmp/rocm/runtime/src/core/runtime/runtime.cpp:735:59: error: expected ‘>’ before ‘HsaSharedMemoryHandle’
     if (hsaKmtRegisterSharedHandle(reinterpret_cast<const HsaSharedMemoryHandle*>(handle),
                                                           ^~~~~~~~~~~~~~~~~~~~~
/tmp/rocm/runtime/src/core/runtime/runtime.cpp:735:59: error: expected ‘(’ before ‘HsaSharedMemoryHandle’
/tmp/rocm/runtime/src/core/runtime/runtime.cpp:735:59: error: ‘HsaSharedMemoryHandle’ was not declared in this scope
/tmp/rocm/runtime/src/core/runtime/runtime.cpp:735:81: error: expected primary-expression before ‘>’ token
     if (hsaKmtRegisterSharedHandle(reinterpret_cast<const HsaSharedMemoryHandle*>(handle),
                                                                                 ^
/tmp/rocm/runtime/src/core/runtime/runtime.cpp:736:89: error: ‘hsaKmtRegisterSharedHandle’ was not declared in this scope
                                    &importAddress, &importSize) != HSAKMT_STATUS_SUCCESS)
                                                                                         ^
/tmp/rocm/runtime/src/core/runtime/runtime.cpp:737:7: error: expected ‘)’ before ‘return’
       return HSA_STATUS_ERROR_INVALID_ARGUMENT;
       ^~~~~~
/tmp/rocm/runtime/src/core/runtime/runtime.cpp:760:64: error: expected ‘>’ before ‘HsaSharedMemoryHandle’
   if (hsaKmtRegisterSharedHandleToNodes(reinterpret_cast<const HsaSharedMemoryHandle*>(handle),
                                                                ^~~~~~~~~~~~~~~~~~~~~
/tmp/rocm/runtime/src/core/runtime/runtime.cpp:760:64: error: expected ‘(’ before ‘HsaSharedMemoryHandle’
/tmp/rocm/runtime/src/core/runtime/runtime.cpp:760:64: error: ‘HsaSharedMemoryHandle’ was not declared in this scope
/tmp/rocm/runtime/src/core/runtime/runtime.cpp:760:86: error: expected primary-expression before ‘>’ token
   if (hsaKmtRegisterSharedHandleToNodes(reinterpret_cast<const HsaSharedMemoryHandle*>(handle),
                                                                                      ^
/tmp/rocm/runtime/src/core/runtime/runtime.cpp:762:72: error: ‘hsaKmtRegisterSharedHandleToNodes’ was not declared in this scope
                                         nodes) != HSAKMT_STATUS_SUCCESS)
                                                                        ^
/tmp/rocm/runtime/src/core/runtime/runtime.cpp:763:5: error: expected ‘)’ before ‘return’
     return HSA_STATUS_ERROR_INVALID_ARGUMENT;
     ^~~~~~
/tmp/rocm/runtime/src/core/runtime/runtime.cpp: In member function ‘hsa_status_t core::Runtime::IPCCreate(void*, size_t, hsa_amd_ipc_memory_t*)’:
/tmp/rocm/runtime/src/core/runtime/runtime.cpp:726:1: error: control reaches end of non-void function [-Werror=return-type]
 }
 ^
cc1plus: all warnings being treated as errors

hsainfo utility

It would be nice to have a hsainfo utility that would show information about the HSA setup on the current system: Whether HSA is functional on my platform, what features it provides, and if it is not functional, why not.

In sample/vector_copy I get "Getting a gpu agent failed.". By hacking in a few printfs I already figured out that there seem to be no HSA devices available, but I am so far unable to figure out why that is - my kernel seems to detect everything just fine:

# uname -p
AMD A10-7800 Radeon R7, 12 Compute Cores 4C+8G
# dmesg | tail
[   75.584986] Found CRAT image with size=1440
[   75.584990] Parsing CRAT table with 1 nodes
[   75.584992] Found CU entry in CRAT table with proximity_domain=0 caps=0
[   75.584993] CU CPU: cores=4 id_base=16
[   75.584995] Found CU entry in CRAT table with proximity_domain=0 caps=0
[   75.584996] CU GPU: simds=32 id_base=-2147483648
[   75.584997] Found memory entry in CRAT table with proximity_domain=0
[   75.584999] Found memory entry in CRAT table with proximity_domain=0
[   75.585000] Found memory entry in CRAT table with proximity_domain=0
[   75.585001] Found memory entry in CRAT table with proximity_domain=0
[   75.585002] Found cache entry in CRAT table with processor_id=16
[   75.585003] Found cache entry in CRAT table with processor_id=16
[   75.585004] Found cache entry in CRAT table with processor_id=16
[   75.585005] Found cache entry in CRAT table with processor_id=17
[   75.585006] Found cache entry in CRAT table with processor_id=18
[   75.585006] Found cache entry in CRAT table with processor_id=18
[   75.585007] Found cache entry in CRAT table with processor_id=18
[   75.585008] Found cache entry in CRAT table with processor_id=19
[   75.585009] Found TLB entry in CRAT table (not processing)
[   75.585010] Found TLB entry in CRAT table (not processing)
[   75.585010] Found TLB entry in CRAT table (not processing)
[   75.585011] Found TLB entry in CRAT table (not processing)
[   75.585012] Found TLB entry in CRAT table (not processing)
[   75.585013] Found TLB entry in CRAT table (not processing)
[   75.585013] Found TLB entry in CRAT table (not processing)
[   75.585014] Found TLB entry in CRAT table (not processing)
[   75.585015] Found TLB entry in CRAT table (not processing)
[   75.585016] Found TLB entry in CRAT table (not processing)
[   75.585017] Creating topology SYSFS entries
[   75.585046] Finished initializing topology ret=0
[   75.587499] kfd kfd: Initialized module
# cat /sys/class/kfd/kfd/topology/nodes/0/properties
cpu_cores_count 4
simd_count 32
mem_banks_count 2
caches_count 3
io_links_count 0
cpu_core_id_base 16
simd_id_base 2147483648
max_waves_per_simd 40
lds_size_in_kb 64
gds_size_in_kb 0
wave_front_size 64
array_count 2
simd_arrays_per_engine 0
cu_per_simd_array 16
simd_per_cu 4
max_slots_scratch_cu 128
vendor_id 0
device_id 0
location_id 0
max_engine_clk_ccompute 3500

Cannot build Runtime - sp3 missing and not found in build instructions

"The core runtime requires the sp3.a library to be able to compiler on x86_64 architechtures. The binaries for the sp3.a librariy can be found on the amd-codexl-analyzer GitHub repository:"

/home/spstarr/tmp/amd/ROCR-Runtime/src/core/runtime/amd_gpu_agent.cpp:59:27: fatal error: utils/sp3/sp3.h: No such file or directory
#include "utils/sp3/sp3.h"
^
Except the github repo in the build instructions do not contain the sp3 libraries or include headers.

https://github.com/RadeonOpenCompute/ROCR-Runtime/blob/master/src/utils/sp3/sp3.h but this no longer exists (says google history).

Can you please tell me where this is now found?

Thanks.

.deb build with cpack doesn't create "hsa" directory in /opt/rocm/include

When building this package from source on Ubuntu 20.04 (gcc 9.3.0, cmake 3.16.3, using ROCm tag roc-3.3.0 for the source clone) after cloning all ROCm related packages using repo and the documentation, I can successfully package a .deb after following these steps (roughly):

  • repo init + repo sync
  • build and install hsakmt-roct + hsakmt-roct-dev to /opt/rocm
  • configure an out-of-source ROCR-Runtime build using cmake -DCPACK_PACKAGING_INSTALL_PREFIX=/opt/rocm -DCMAKE_INSTALL_PREFIX=/opt/rocm -DCMAKE_BUILD_TYPE=Release ../src
  • make (succeeds)
  • cpack -G DEB (succeeds)

Trying to install the resulting package, however, fails:

root@rocm:/rocm/ROCR-Runtime/build# dpkg -i /rocm/ROCR-Runtime/build/hsa-rocr-dev-1.1.9.0-local-build-1a56c09-Linux.deb
Selecting previously unselected package hsa-rocr-dev.
(Reading database ... 21066 files and directories currently installed.)
Preparing to unpack .../hsa-rocr-dev-1.1.9.0-local-build-1a56c09-Linux.deb ...
Unpacking hsa-rocr-dev (1.1.9.0-local-build-1a56c09) ...
dpkg: error processing archive /rocm/ROCR-Runtime/build/hsa-rocr-dev-1.1.9.0-local-build-1a56c09-Linux.deb (--install):
 unable to create '/opt/rocm/include/hsa/Brig.h.dpkg-new' (while processing './opt/rocm/include/hsa/Brig.h'): No such file or directory
Errors were encountered while processing:
 /rocm/ROCR-Runtime/build/hsa-rocr-dev-1.1.9.0-local-build-1a56c09-Linux.deb

However, I can manually create the directory at /opt/rocm/include/hsa. Doing this means the package can now install successfully:

root@rocm:/rocm/ROCR-Runtime/build# mkdir /opt/rocm/include/hsa/
root@rocm:/rocm/ROCR-Runtime/build# dpkg -i /rocm/ROCR-Runtime/build/hsa-rocr-dev-1.1.9.0-local-build-1a56c09-Linux.deb
(Reading database ... 21066 files and directories currently installed.)
Preparing to unpack .../hsa-rocr-dev-1.1.9.0-local-build-1a56c09-Linux.deb ...
Unpacking hsa-rocr-dev (1.1.9.0-local-build-1a56c09) ...
Setting up hsa-rocr-dev (1.1.9.0-local-build-1a56c09) ...
Processing triggers for libc-bin (2.31-0ubuntu9) ...
root@rocm:/rocm/ROCR-Runtime/build# 

Should this directory be created automatically by the .deb, or am I building + installing packages in the wrong order?

libsp3.a for FreeBSD

I am in the process of porting rocr to FreeBSD. Everything compiles already in thunk and runtime (I will send patches after cleanup and testing), however linking the runtime fails due to the missing libsp3.a. Is there a chance to obtain either the library or, under NDA, the sources for compiling it myself for our ports tree?

Thanks so much for this project and your time!

Endianness unset on ppc64el systems

LITTLEENDIAN_CPU is not set on ppc64le systems. The following patch fixes the issue:

diff --git a/src/inc/hsa.h b/src/inc/hsa.h
index 0ed2b68..0342eea 100644
--- a/src/inc/hsa.h
+++ b/src/inc/hsa.h
@@ -80,7 +80,7 @@
 // Try to detect CPU endianness
 #if !defined(LITTLEENDIAN_CPU) && !defined(BIGENDIAN_CPU)
 #if defined(__i386__) || defined(__x86_64__) || defined(_M_IX86) || \
-    defined(_M_X64)
+    defined(_M_X64) || (defined(__PPC64__) && (__BYTE_ORDER__ == __ORDER_LITTLE_ENDIAN__))
 #define LITTLEENDIAN_CPU
 #endif
 #endif

rocminfo: static void amd::MemoryRegion::FreeKfdMemory(void*, size_t): Assertion `status == HSAKMT_STATUS_SUCCESS' failed.

System information

❯ inxi -GSC -xx
System:    Host: ernie Kernel: 5.7.7 x86_64 bits: 64 compiler: gcc v: 10.1.0 Desktop: N/A wm: kwin_x11 dm: SDDM 
           Distro: Gentoo Base System release 2.7 
CPU:       Topology: Quad Core model: AMD Ryzen 5 2400G with Radeon Vega Graphics bits: 64 type: MT MCP arch: Zen 
           L2 cache: 2048 KiB 
           flags: avx avx2 lm nx pae sse sse2 sse3 sse4_1 sse4_2 sse4a ssse3 svm bogomips: 57518 
           Speed: 1352 MHz min/max: 1600/3600 MHz Core speeds (MHz): 1: 1352 2: 1352 3: 2973 4: 1351 5: 1352 6: 1352 7: 2974 
           8: 1352 
Graphics:  Device-1: Advanced Micro Devices [AMD/ATI] Baffin [Radeon RX 550 640SP / RX 560/560X] vendor: ASUSTeK 
           driver: amdgpu v: kernel bus ID: 01:00.0 chip ID: 1002:67ff 
           Device-2: AMD Raven Ridge [Radeon Vega Series / Radeon Vega Mobile Series] vendor: ASUSTeK driver: amdgpu v: kernel 
           bus ID: 0a:00.0 chip ID: 1002:15dd 
           Display: server: X.Org 1.20.8 driver: amdgpu compositor: kwin_x11 resolution: 2560x1080~60Hz 
           OpenGL: renderer: AMD RAVEN (DRM 3.37.0 5.7.7 LLVM 10.0.0) v: 4.6 Mesa 20.1.3 direct render: Yes 

rocminfo is at version 3.5.0.

Problem

When running gdb rocminfo and typing run, I see:

ROCk module is loaded                                             
Able to open /dev/kfd read-write                                
[New Thread 0x7ffff779f700 (LWP 106384)]                        
LoadLib(libhsa-ext-image64.so.1) failed: libhsa-ext-image64.so.1: cannot open shared object file: No such file or directory
=====================                                               
HSA System Attributes                                             
=====================                                                                                                                                                                 
Runtime Version:         1.1                                          
System Timestamp Freq.:  1000.000000MHz                                                                                                                                               
Sig. Max Wait Duration:  18446744073709551615 (0xFFFFFFFFFFFFFFFF) (timestamp count)
Machine Model:           LARGE                                                                                                                                                        
System Endianness:       LITTLE                                     
                                                                                                                                                                                      
==========                                                          
HSA Agents                                                                                                                                                                                                                                                                                                                                                                  
==========                                                        
*******                                                                                                                                                                                                                                                                                                                                                                     Agent 1                                                             
*******                                                             
  Name:                    AMD Ryzen 5 2400G with Radeon Vega Graphics                                                                                                                                                                                                                                                                                                      
  Uuid:                    CPU-XX                                 
  Marketing Name:          AMD Ryzen 5 2400G with Radeon Vega Graphics                                                                                                                
  Vendor Name:             CPU                                    
  Feature:                 None specified                                                                                                                                             
  Profile:                 FULL_PROFILE                           
  Float Round Mode:        NEAR                                                                                                                                                       
  Max Queue Number:        0(0x0)                             
  Queue Min Size:          0(0x0)                                                                                                                                                     
  Queue Max Size:          0(0x0)                                 
  Queue Type:              MULTI                                                                                                                                                      
  Node:                    0                                      
  Device Type:             CPU                                                                                                                                                                                                                                                                                                                                                Cache Info:                                                        
    L1:                      32(0x20) KB                          
  Chip ID:                 5597(0x15dd)                                                   
  Cacheline Size:          64(0x40)                             
  Max Clock Freq. (MHz):   3600                                                            
  BDFID:                   2560                                     
  Internal Node ID:        0                                                           
  Compute Unit:            8                                      
  SIMDs per CU:            4                                                    
  Shader Engines:          1                                        
  Shader Arrs. per Eng.:   1                                                                                                                                                          
  WatchPts on Addr. Ranges:4                                        
  Features:                None                                                                                                                                                                                                                                                                                                                                             
  Pool Info:                                                  
    Pool 1                                                                                                                                                                                                                                                                                                                                                                  
      Segment:                 GLOBAL; FLAGS: KERNARG, FINE GRAINED
      Size:                    16776832(0xfffe80) KB              
      Allocatable:             TRUE                               
      Alloc Granule:           4KB                                
      Alloc Alignment:         4KB                                
      Accessible by all:       TRUE                               
  ISA Info:                
    N/A                      
*******                  
Agent 2                  
*******                  
  Name:                    gfx902                             
  Uuid:                    GPU-XX                             
  Marketing Name:          AMD Ryzen 5 2400G with Radeon Vega Graphics
  Vendor Name:             AMD                                
  Feature:                 KERNEL_DISPATCH                    
  Profile:                 FULL_PROFILE                       
  Float Round Mode:        NEAR                               
  Max Queue Number:        128(0x80)                          
  Queue Min Size:          4096(0x1000)                       
  Queue Max Size:          131072(0x20000)                    
  Queue Type:              MULTI                              
  Node:                    0                                  
  Device Type:             GPU                                
  Cache Info:              
    L1:                      16(0x10) KB                        
  Chip ID:                 5597(0x15dd)                       
  Cacheline Size:          64(0x40)                           
  Max Clock Freq. (MHz):   1250                               
  BDFID:                   2560                               
  Internal Node ID:        0                                  
  Compute Unit:            11                                 
  SIMDs per CU:            4                                  
  Shader Engines:          1                                  
  Shader Arrs. per Eng.:   1                                  
  WatchPts on Addr. Ranges:4                                  
  Features:                KERNEL_DISPATCH 
  Fast F16 Operation:      FALSE                              
  Wavefront Size:          64(0x40)                           
  Workgroup Max Size:      1024(0x400)                        
  Workgroup Max Size per Dimension:
    x                        1024(0x400)                        
    y                        1024(0x400)                        
    z                        1024(0x400)                        
  Max Waves Per CU:        160(0xa0)                          
  Max Work-item Per CU:    10240(0x2800)                      
  Grid Max Size:           4294967295(0xffffffff)             
  Grid Max Size per Dimension:
    x                        4294967295(0xffffffff)             
    y                        4294967295(0xffffffff)             
    z                        4294967295(0xffffffff)             
  Max fbarriers/Workgrp:   32                                 
  Pool Info:               
    Pool 1                   
      Segment:                 GROUP                              
      Size:                    64(0x40) KB                        
      Allocatable:             FALSE                              
      Alloc Granule:           0KB                                
      Alloc Alignment:         0KB                                
      Accessible by all:       FALSE                              
  ISA Info:                
    ISA 1                    
      Name:                    amdgcn-amd-amdhsa--gfx902+xnack    
      Machine Models:          HSA_MACHINE_MODEL_LARGE            
      Profiles:                HSA_PROFILE_BASE                   
      Default Rounding Mode:   NEAR                               
      Default Rounding Mode:   NEAR                               
      Fast f16:                TRUE                               
      Workgroup Max Size:      1024(0x400)                        
      Workgroup Max Size per Dimension:
        x                        1024(0x400)                        
        y                        1024(0x400)                        
        z                        1024(0x400)                        
      Grid Max Size:           4294967295(0xffffffff)             
      Grid Max Size per Dimension:
        x                        4294967295(0xffffffff)             
        y                        4294967295(0xffffffff)             
        z                        4294967295(0xffffffff)             
      FBarrier Max Size:       32                                 
*******                  
Agent 3                  
*******                  
  Name:                    gfx803                             
  Uuid:                    GPU-XX                             
  Marketing Name:          Baffin [Radeon RX 550 640SP / RX 560/560X]
  Vendor Name:             AMD                                
  Feature:                 KERNEL_DISPATCH                    
  Profile:                 BASE_PROFILE                       
  Float Round Mode:        NEAR                               
  Max Queue Number:        128(0x80)                          
  Queue Min Size:          4096(0x1000)                       
  Queue Max Size:          131072(0x20000)                    
  Queue Type:              MULTI                              
  Node:                    1                                  
  Device Type:             GPU                                
  Cache Info:              
    L1:                      16(0x10) KB                        
  Chip ID:                 26623(0x67ff)                      
  Cacheline Size:          64(0x40)                           
  Max Clock Freq. (MHz):   1210                               
  BDFID:                   256                                
  Internal Node ID:        1                                  
  Compute Unit:            16                                 
  SIMDs per CU:            4                                  
  Shader Engines:          2                                  
  Shader Arrs. per Eng.:   1                                  
  WatchPts on Addr. Ranges:4                                  
  Features:                KERNEL_DISPATCH 
  Fast F16 Operation:      FALSE                              
  Wavefront Size:          64(0x40)                           
  Workgroup Max Size:      1024(0x400)                        
  Workgroup Max Size per Dimension:
    x                        1024(0x400)                        
    y                        1024(0x400)                        
    z                        1024(0x400)                        
  Max Waves Per CU:        40(0x28)                           
  Max Work-item Per CU:    2560(0xa00)                        
  Grid Max Size:           4294967295(0xffffffff)             
  Grid Max Size per Dimension:
    x                        4294967295(0xffffffff)             
    y                        4294967295(0xffffffff)             
    z                        4294967295(0xffffffff)             
  Max fbarriers/Workgrp:   32                                 
  Pool Info:               
    Pool 1                   
      Segment:                 GLOBAL; FLAGS: COARSE GRAINED      
      Size:                    2097152(0x200000) KB               
      Allocatable:             TRUE                               
      Alloc Granule:           4KB                                
      Alloc Alignment:         4KB                                
      Accessible by all:       FALSE                              
    Pool 2                   
      Segment:                 GROUP                              
      Size:                    64(0x40) KB                        
      Allocatable:             FALSE                              
      Alloc Granule:           0KB                                
      Alloc Alignment:         0KB                                
      Accessible by all:       FALSE                              
  ISA Info:                
    ISA 1                    
      Name:                    amdgcn-amd-amdhsa--gfx803          
      Machine Models:          HSA_MACHINE_MODEL_LARGE            
      Profiles:                HSA_PROFILE_BASE                   
      Default Rounding Mode:   NEAR                               
      Default Rounding Mode:   NEAR                               
      Fast f16:                TRUE                               
      Workgroup Max Size:      1024(0x400)                        
      Workgroup Max Size per Dimension:
        x                        1024(0x400)                        
        y                        1024(0x400)                        
        z                        1024(0x400)                        
      Grid Max Size:           4294967295(0xffffffff)             
      Grid Max Size per Dimension:
        x                        4294967295(0xffffffff)             
        y                        4294967295(0xffffffff)             
        z                        4294967295(0xffffffff)             
      FBarrier Max Size:       32                                 
*** Done ***             
rocminfo: /tmp/portage/dev-libs/rocr-runtime-3.5.0/work/ROCR-Runtime-rocm-3.5.0/src/core/runtime/amd_memory_region.cpp:72: static void amd::MemoryRegion::FreeKfdMemory(void*, size_t): Assertion `status == HSAKMT_STATUS_SUCCESS' failed.

Note the failed assertion.

The backtrace at that point is:

#0  0x00007ffff79abf91 in raise () from /usr/lib64/libc.so.6
No symbol table info available.
#1  0x00007ffff7995537 in abort () from /usr/lib64/libc.so.6
No symbol table info available.
#2  0x00007ffff799540f in ?? () from /usr/lib64/libc.so.6
No symbol table info available.
#3  0x00007ffff79a43e2 in __assert_fail () from /usr/lib64/libc.so.6
No symbol table info available.
#4  0x00007ffff7e257d9 in amd::MemoryRegion::FreeKfdMemory(void*, unsigned long) () from /usr/lib64/libhsa-runtime64.so.1
No symbol table info available.
#5  0x00007ffff7e2660d in amd::MemoryRegion::Free(void*, unsigned long) const () from /usr/lib64/libhsa-runtime64.so.1
No symbol table info available.
#6  0x00007ffff7e68f19 in core::Runtime::FreeMemory(void*) () from /usr/lib64/libhsa-runtime64.so.1
No symbol table info available.
#7  0x00007ffff7e68568 in core::Runtime::RegisterAgent(core::Agent*)::{lambda(void*)#2}::operator()(void*) const () from /usr/lib64/libhsa-runtime64.so.1
No symbol table info available.
#8  0x00007ffff7e70546 in void std::__invoke_impl<void, core::Runtime::RegisterAgent(core::Agent*)::{lambda(void*)#2}&, void*>(std::__invoke_other, core::Runtime::RegisterAgent(core::Agent*)::{lambda(void*)#2}&, void*&&) () from /usr/lib64/libhsa-runtime64.so.1
No symbol table info available.
#9  0x00007ffff7e7025c in std::enable_if<std::__and_<std::is_void<void>, std::__is_invocable<core::Runtime::RegisterAgent(core::Agent*)::{lambda(void*)#2}&, void*> >::value, std::is_void>::type std::__invoke_r<void, core::Runtime::RegisterAgent(core::Agent*)::{lambda(void*)#2}&, void*>(std::__is_invocable&&, (core::Runtime::RegisterAgent(core::Agent*)::{lambda(v
oid*)#2}&)...) () from /usr/lib64/libhsa-runtime64.so.1
No symbol table info available.
#10 0x00007ffff7e6fd65 in std::_Function_handler<void (void*), core::Runtime::RegisterAgent(core::Agent*)::{lambda(void*)#2}>::_M_invoke(std::_Any_data const&, void*&&) () from /usr/lib64/libhsa-runtime64.so.1
No symbol table info available.
#11 0x00007ffff7df9087 in std::function<void (void*)>::operator()(void*) const () from /usr/lib64/libhsa-runtime64.so.1
No symbol table info available.
#12 0x00007ffff7e0e454 in amd::GpuAgent::ReleaseShader(void*, unsigned long) const () from /usr/lib64/libhsa-runtime64.so.1
No symbol table info available.
#13 0x00007ffff7e0d7cb in amd::GpuAgent::~GpuAgent() () from /usr/lib64/libhsa-runtime64.so.1
No symbol table info available.
#14 0x00007ffff7e0d960 in amd::GpuAgent::~GpuAgent() () from /usr/lib64/libhsa-runtime64.so.1
No symbol table info available.
#15 0x00007ffff7e76764 in void DeleteObject::operator()<core::Agent>(core::Agent const*) const () from /usr/lib64/libhsa-runtime64.so.1
No symbol table info available.
#16 0x00007ffff7e736a4 in DeleteObject std::for_each<__gnu_cxx::__normal_iterator<core::Agent**, std::vector<core::Agent*, std::allocator<core::Agent*> > >, DeleteObject>(__gnu_cxx::__normal_iterator<core::Agent**, std::vector<core::Agent*, std::allocator<core::Agent*> > >, __gnu_cxx::__normal_iterator<core::Agent**, std::vector<core::Agent*, std::allocator<core
::Agent*> > >, DeleteObject) () from /usr/lib64/libhsa-runtime64.so.1
No symbol table info available.
#17 0x00007ffff7e6dc83 in core::Runtime::Unload() () from /usr/lib64/libhsa-runtime64.so.1
No symbol table info available.
#18 0x00007ffff7e683a3 in core::Runtime::Release() () from /usr/lib64/libhsa-runtime64.so.1
No symbol table info available.
#19 0x00007ffff7e40452 in HSA::hsa_shut_down() () from /usr/lib64/libhsa-runtime64.so.1
No symbol table info available.
#20 0x00007ffff7e8af92 in hsa_shut_down () from /usr/lib64/libhsa-runtime64.so.1
No symbol table info available.
#21 0x000055555555c931 in main (argc=1, argv=0x7fffffffd9f8) at /tmp/portage/dev-util/rocminfo-3.5.0/work/rocminfo-rocm-3.5.0/rocminfo.cc:1167
        err = HSA_STATUS_SUCCESS
        sys_info = {major = 1, minor = 1, timestamp_frequency = 1000000000, max_wait = 18446744073709551615, endianness = HSA_ENDIANNESS_LITTLE, machine_model = HSA_MACHINE_MODEL_LARGE}
        agent_ind = 3

Sadly debug symbols are missing, since ROCR-Runtime's build system seems to override CXXFLAGS and LDFLAGS when building shared libraries. c.f. https://bugs.gentoo.org/729898

This is reproducible every time I run rocminfo.

Regression

I never got ROCm to work on this system. Still working on it. :)

Logs

dmesg prints during execution of rocminfo:

[Sat Jul 11 22:49:59 2020] Alloc host visible vram on small bar is not allowed
[Sat Jul 11 22:49:59 2020] Evicting PASID 0x8021 queues
[Sat Jul 11 22:49:59 2020] Evicting PASID 0x8021 queues

Other information

I also see exceptions and segfaults in Clover and ROCm's OpenCL implementation when executing clinfo:

I assume rocminfo is the more low-level command, so I guess first getting that to work without problems might help debugging the OpenCL problems.

HSA_STATUS_ERROR_VARIABLE_ALREADY_DEFINED

If a HSACO binary has been linked from multiple object files, and two of those objects happen to have a local symbol with the same name, then the loader will reject the code object with HSA_STATUS_ERROR_VARIABLE_ALREADY_DEFINED.

This behaviour is a problem because it's common for runtime libraries to include common names in many files, and because at least GCC uses the same names for labels associated with switch conversion and the like. (The recently released CodeBench Lite compiler for GCN uses name mangling to work around this, for now, but that's not an acceptable solution for upstream GCC.)

I found the issues in GCC, but this testcase, loaderbug.zip, demonstrates the problem using only tools included with a ROCM install. I've tested it with the latest Ubuntu packages (hsa-rocr-dev: 1.1.9-45-ge88639f).

Please run ./build.sh, substituting the correct -mcpu flag for your system, then run ./a.out to demonstrate the error.

The testcase does nothing interesting, but there are two instances of var which should not conflict, but do. If you exclude the second object file from the link command then the testcase runs to completion with no errors.

The launch.c code is derived from the old "vector_copy" sample code, and does nothing interesting; I include it only to make the testcase easier to try.

Compilation terminated with "fatal error: hsakmt.h: No such file or directory"

I'm unable to build ROCR-Runtime from source. I've also looked at #47
My build steps are:

Download per instructions here:
https://github.com/RadeonOpenCompute/ROCm#downloading-the-rocm-source-code

Then

cd  ~/ROCm/ROCR-Runtime/src
mkdir build
cd build
cmake ..
make

Error follows:

-rw-rw-r--  1 prj47-rack-76 prj47-rack-76 45307 Feb 28 19:34 Makefile
prj47-rack-76@prj47-rack-76:~/ROCm/ROCR-Runtime/src/build$ make
Scanning dependencies of target hsa-link
[  0%] Built target hsa-link
Scanning dependencies of target hsa-runtime64
[  2%] Building CXX object CMakeFiles/hsa-runtime64.dir/core/util/lnx/os_linux.cpp.o
[  5%] Building CXX object CMakeFiles/hsa-runtime64.dir/core/util/small_heap.cpp.o
[  8%] Building CXX object CMakeFiles/hsa-runtime64.dir/core/util/timer.cpp.o
[ 11%] Building CXX object CMakeFiles/hsa-runtime64.dir/core/runtime/amd_blit_kernel.cpp.o
In file included from /home/prj47-rack-76/ROCm/ROCR-Runtime/src/core/inc/agent.h:54:0,
                 from /home/prj47-rack-76/ROCm/ROCR-Runtime/src/core/inc/blit.h:48,
                 from /home/prj47-rack-76/ROCm/ROCR-Runtime/src/core/inc/amd_blit_kernel.h:50,
                 from /home/prj47-rack-76/ROCm/ROCR-Runtime/src/core/runtime/amd_blit_kernel.cpp:43:
/home/prj47-rack-76/ROCm/ROCR-Runtime/src/core/inc/queue.h:57:20: fatal error: hsakmt.h: No such file or directory
compilation terminated.
CMakeFiles/hsa-runtime64.dir/build.make:134: recipe for target 'CMakeFiles/hsa-runtime64.dir/core/runtime/amd_blit_kernel.cpp.o' failed
make[2]: *** [CMakeFiles/hsa-runtime64.dir/core/runtime/amd_blit_kernel.cpp.o] Error 1
CMakeFiles/Makefile2:104: recipe for target 'CMakeFiles/hsa-runtime64.dir/all' failed
make[1]: *** [CMakeFiles/hsa-runtime64.dir/all] Error 2
Makefile:149: recipe for target 'all' failed
make: *** [all] Error 2
prj47-rack-76@prj47-rack-76:~/ROCm/ROCR-Runtime/src/build$

Support for no exceptions

Google would like to build the ROCR-Runtime internally, but we do not use C++ exceptions. Would it be possible to support this case through some #if __cpp_exceptions macros?

Device attributes

So I upgraded the ROCR runtime recently. I remember in the previous version when running the vector copy it used to show the agent name is Spectre. But now it shows the agent name is Kaveri. I know that the Kaveri APU has the Spectre GPU. But was little bit skeptical if everything is working as it should be. I can execute applications correctly though. But just confused about this. Could anyone verify if it should return Spectre or Kaveri?.

error: clearing an object of non-trivial type; use assignment or value-initialization instead

I get this error while compiling from git.

/home/imyxh/git/ROCm/ROCR-Runtime/src/core/runtime/runtime.cpp: In member function ‘void core::Runtime::SetLinkCount(size_t)’:
/home/imyxh/git/ROCm/ROCR-Runtime/src/core/runtime/runtime.cpp:260:71: error: ‘void* memset(void*, int, size_t)’ clearing an object of non-trivial type ‘__gnu_cxx::__alloc_traits<std::allocator<core::Runtime::LinkInfo>, core::Runtime::LinkInfo>::value_type’ {aka ‘struct core::Runtime::LinkInfo’}; use assignment or value-initialization instead [-Werror=class-memaccess]
          link_matrix_.size() * sizeof(hsa_amd_memory_pool_link_info_t));
                                                                       ^
In file included from /home/imyxh/git/ROCm/ROCR-Runtime/src/core/runtime/runtime.cpp:43:
/home/imyxh/git/ROCm/ROCR-Runtime/src/core/inc/runtime.h:96:10: note: ‘__gnu_cxx::__alloc_traits<std::allocator<core::Runtime::LinkInfo>, core::Runtime::LinkInfo>::value_type’ {aka ‘struct core::Runtime::LinkInfo’} declared here
   struct LinkInfo {
          ^~~~~~~~
cc1plus: all warnings being treated as errors
make[2]: *** [CMakeFiles/hsa-runtime64.dir/build.make:349: CMakeFiles/hsa-runtime64.dir/core/runtime/runtime.cpp.o] Error 1
make[1]: *** [CMakeFiles/Makefile2:73: CMakeFiles/hsa-runtime64.dir/all] Error 2
make: *** [Makefile:152: all] Error 2

hsa_executable_get_symbol_by_name is broken for V3 code metadata?

I am the maintainer of Julia's AMDGPU computing stack, and was recently upgrading our ROCR-Runtime wrapper package HSARuntime.jl to support builds of Julia that use LLVM >= 7. Those versions of LLVM switch the default code metadata format from V2 to V3, which have separate code paths within ROCR. I ran into an issue during this upgrade where hsa_executable_get_symbol_by_name fails to query a symbol in an executable using V3 metadata; the specific error is HSA_STATUS_ERROR_INVALID_SYMBOL_NAME. By examining my emitted binary with readelf, I saw the two symbols that I should expect to see according to the LLVM AMDGPU User Guide, symbol_name and symbol_name.kd, with the correct ELF type and section. However, specifying neither symbol_name nor symbol_name.kd to the above function worked.

I was eventually able to create a workaround for this issue by iterating all agent symbols with hsa_executable_iterate_agent_symbols and selecting the first kernel symbol found (which currently is fine, since our stack's compiler only emits one agent kernel per binary), after seeing a similar approach used by HIP. While I'm okay with leaving this workaround in place for now, I'd like to ask if this behavior is intended, or is it a bug in ROCR that isn't tested for?

Thanks!

Question: Finalizer Extension in ROCm 1.9

Hi,

with ROCm 1.9 the package "hsa-ext-rocr-dev" does not contain the finalizer extension (ibhsa-ext-finalize64.so) anymore and the vector copy sample was removed from the runtime.
Did you remove this extension completely or was it moved to a different package?

Regards
Michael

Page not present or supervisor privilege

Since ROCm 2.0 we get about the same as #50, but then in rocPRIM and only on older GPUs (specifically S9300, Fiji architecture). Ubuntu 16.04.5, kernel 4.13. It works perfectly on a newer GPU.

Memory access fault by GPU node-2 (Agent handle: 0x1edfe60) on address 0x1a0490d000. Reason: Page not present or supervisor privilege. Aborted (core dumped)

Thing is that it happens irregularly, which makes it difficult to debug. As it worked before and still works on newer GPUs, we thought it was a good idea to ask for help to pinpoint the problem. The reason to submit the issue here in ROCr-runtime is because it looks like a runtime-error. It could also be a problem with the compiler or with rocPRIM, but understanding why it's not ROCr would help in understanding the problem.

To replicate:

  • run rocPRIM's tests on a Fiji GPU several times

core::Signal::WaitAny never get the signal

I am trying to run OpenCL applications on the APU that comes with Ryzen 3400G (i.e., gfx902+xnack). Sometimes it will work and return me correct results, but sometimes it blocks forever. I traced it with gdb, and I find that the main thread is blocked at clCreateCommandQueue() waiting for events:

#1  do_futex_wait (sem=sem@entry=0x627148, abstime=0x0) at sem_waitcommon.c:111
#2  0x00007ffff6f6c8d4 in __new_sem_wait_slow (sem=0x627148, abstime=0x0) at sem_waitcommon.c:181
#3  0x00007ffff6f6c97a in __new_sem_wait (sem=<optimized out>) at sem_wait.c:29
#4  0x00007ffff688df60 in amd::Semaphore::wait() () from /opt/rocm/lib/../opencl/lib/libamdocl64.so
#5  0x00007ffff688dd4a in amd::Monitor::wait() () from /opt/rocm/lib/../opencl/lib/libamdocl64.so
#6  0x00007ffff68a3bb4 in amd::HostQueue::HostQueue(amd::Context&, amd::Device&, unsigned long, unsigned int, amd::CommandQueue::Priority) ()
   from /opt/rocm/lib/../opencl/lib/libamdocl64.so
#7  0x00007ffff687fa56 in clCreateCommandQueueWithProperties () from /opt/rocm/lib/../opencl/lib/libamdocl64.so
#8  0x00007ffff687fd28 in clCreateCommandQueue () from /opt/rocm/lib/../opencl/lib/libamdocl64.so
#9  0x0000000000401193 in main ()

It is waiting for updates from another thread, which is thread 2 in gdb. The trace of thread 2 is like this:

#0  0x00007ffff7276f47 in ioctl () at ../sysdeps/unix/syscall-template.S:84
#1  0x00007ffff6330654 in kmtIoctl () from /usr/local/lib/libhsakmt.so.1
#2  0x00007ffff632a84f in hsaKmtWaitOnMultipleEvents () from /usr/local/lib/libhsakmt.so.1
#3  0x00007ffff65aa2a3 in core::Signal::WaitAny(unsigned int, hsa_signal_s const*, hsa_signal_condition_t const*, long const*, unsigned long, hsa_wait_state_t, long*) ()
   from /opt/rocm/lib/../opencl/lib/../../lib/libhsa-runtime64.so.1
#4  0x00007ffff6591516 in AMD::hsa_amd_signal_wait_any(unsigned int, hsa_signal_s*, hsa_signal_condition_t*, long*, unsigned long, hsa_wait_state_t, long*) ()
   from /opt/rocm/lib/../opencl/lib/../../lib/libhsa-runtime64.so.1
#5  0x00007ffff65a256a in core::Runtime::AsyncEventsLoop(void*) () from /opt/rocm/lib/../opencl/lib/../../lib/libhsa-runtime64.so.1
#6  0x00007ffff6561217 in os::ThreadTrampoline(void*) () from /opt/rocm/lib/../opencl/lib/../../lib/libhsa-runtime64.so.1
#7  0x00007ffff6f646ba in start_thread (arg=0x7ffff5adf700) at pthread_create.c:333
#8  0x00007ffff728141d in clone () at ../sysdeps/unix/sysv/linux/x86_64/clone.S:109

If I run with HSA_ENABLE_INTERRUPT=0, I get the following instead:

#0  0x00007ffff66e6232 in core::Signal::WaitAny(unsigned int, hsa_signal_s const*, hsa_signal_condition_t const*, long const*, unsigned long, hsa_wait_state_t, long*) ()
   from /opt/rocm/lib/libhsa-runtime64.so.1
#1  0x00007ffff66cd516 in AMD::hsa_amd_signal_wait_any(unsigned int, hsa_signal_s*, hsa_signal_condition_t*, long*, unsigned long, hsa_wait_state_t, long*) ()
   from /opt/rocm/lib/libhsa-runtime64.so.1
#2  0x00007ffff66de56a in core::Runtime::AsyncEventsLoop(void*) () from /opt/rocm/lib/libhsa-runtime64.so.1
#3  0x00007ffff669d217 in os::ThreadTrampoline(void*) () from /opt/rocm/lib/libhsa-runtime64.so.1
#4  0x00007ffff79ab6ba in start_thread (arg=0x7ffeef514700) at pthread_create.c:333
#5  0x00007ffff6c5841d in clone () at ../sysdeps/unix/sysv/linux/x86_64/clone.S:109

Doing some printf tricks shows that the program is stuck at an infinite loop because it never receives the signal.

Originally I thought it was a problem of ROCT and so I posted an issue here: ROCm/ROCT-Thunk-Interface#56. Following the suggestion, I run with HSA_ENABLE_SDMA=0 but it does not change anything.

I am quite sure that it is not directly related to OpenCL. I also tried to run HIP applications (compiled with an unofficial hipcc hacked by me) and a similar problem happens. The thread launching kernels gets blocked here:

#0  0x00007ffff6c3b827 in sched_yield () at ../sysdeps/unix/syscall-template.S:84
#1  0x00007ffff66b2c65 in amd::AqlQueue::ExecutePM4(unsigned int*, unsigned long) () from /opt/rocm/lib/libhsa-runtime64.so.1
#2  0x00007ffff66a87e8 in amd::GpuAgent::InvalidateCodeCaches() () from /opt/rocm/lib/libhsa-runtime64.so.1
#3  0x00007ffff66b50be in amd::LoaderContext::SegmentAlloc(amdgpu_hsa_elf_segment_t, hsa_agent_s, unsigned long, unsigned long, bool) ()
   from /opt/rocm/lib/libhsa-runtime64.so.1
#4  0x00007ffff66eed15 in amd::hsa::loader::ExecutableImpl::LoadSegmentsV2(hsa_agent_s, amd::hsa::code::AmdHsaCode const*) () from /opt/rocm/lib/libhsa-runtime64.so.1
#5  0x00007ffff66ef40e in amd::hsa::loader::ExecutableImpl::LoadSegments(hsa_agent_s, amd::hsa::code::AmdHsaCode const*, unsigned int) ()
   from /opt/rocm/lib/libhsa-runtime64.so.1
#6  0x00007ffff66f24f3 in amd::hsa::loader::ExecutableImpl::LoadCodeObject(hsa_agent_s, hsa_code_object_s, unsigned long, char const*, hsa_loaded_code_object_s*) ()
   from /opt/rocm/lib/libhsa-runtime64.so.1
#7  0x00007ffff66c73e7 in HSA::hsa_executable_load_agent_code_object(hsa_executable_s, hsa_agent_s, hsa_code_object_reader_s, char const*, hsa_loaded_code_object_s*) ()
   from /opt/rocm/lib/libhsa-runtime64.so.1
#8  0x00007ffff742f845 in roc::LightningProgram::setKernels(amd::option::Options*, void*, unsigned long) () from /opt/rocm-3.5.1/hip/lib/libamdhip64.so.3
#9  0x00007ffff73d8d45 in device::Program::linkImplLC(amd::option::Options*) () from /opt/rocm-3.5.1/hip/lib/libamdhip64.so.3
#10 0x00007ffff73d9975 in device::Program::build(std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> > const&, char const*, amd::option::Options*) () from /opt/rocm-3.5.1/hip/lib/libamdhip64.so.3
#11 0x00007ffff73ec5fc in amd::Program::build(std::vector<amd::Device*, std::allocator<amd::Device*> > const&, char const*, void (*)(_cl_program*, void*), void*, bool) ()
   from /opt/rocm-3.5.1/hip/lib/libamdhip64.so.3
#12 0x00007ffff7377e15 in PlatformState::getFunc(void const*, int) () from /opt/rocm-3.5.1/hip/lib/libamdhip64.so.3
#13 0x00007ffff7380653 in hipLaunchKernel () from /opt/rocm-3.5.1/hip/lib/libamdhip64.so.3
#14 0x0000000000400f8e in main ()

Thread 2 in gdb shows the same trace as OpenCL.

When I run HIP applications, sometimes it can pass this part but get blocked at hipDeviceSynchronize() instead. Thread 2 shows the exact same trace. However, if the first hipDeviceSynchronize() can pass, other hipDeviceSynchronize() always pass.

To conclude,

  1. Thread 2 is waiting for a signal, but it never receives it
  2. The main thread get blocked only because it does not get update from thread 2
  3. The same problem can happen for OpenCL or HIP, for creating command queue, kernel launch, or device sync
  4. The problem is random. Sometimes the program (both OpenCL and HIP) can finish and return me correct results

Any help in fixing the bug is appreciated. Thank you.

SIGSEGV, Segmentation fault from /opt/rocm/hsa/lib/libhsa-runtime64.so.1

Hi,

recently with external programs I have difficulties:

prompt:~ 0#  clinfo 
Segmentation fault
prompt:~ 139# gdb clinfo 
NU gdb (GDB; openSUSE Tumbleweed) 8.3
Copyright (C) 2019 Free Software Foundation, Inc.
License GPLv3+: GNU GPL version 3 or later <http://gnu.org/licenses/gpl.html>
This is free software: you are free to change and redistribute it.
There is NO WARRANTY, to the extent permitted by law.
Type "show copying" and "show warranty" for details.
This GDB was configured as "x86_64-suse-linux".
Type "show configuration" for configuration details.
For bug reporting instructions, please see:
<http://bugs.opensuse.org/>.
Find the GDB manual and other documentation resources online at:
    <http://www.gnu.org/software/gdb/documentation/>.

For help, type "help".
Type "apropos word" to search for commands related to "word"...
Reading symbols from clinfo...
Missing separate debuginfo for /usr/bin/clinfo
Try: zypper install -C "debuginfo(build-id)=552db0e18d0b01ce77fea81502069ff4470c6f84"
(No debugging symbols found in clinfo)
(gdb) r
Starting program: /usr/bin/clinfo 
[..]
New Thread 0x7ffff1cfc700 (LWP 13343)]
[New Thread 0x7ffff14fb700 (LWP 13344)]
[New Thread 0x7ffff0cfa700 (LWP 13345)]
[New Thread 0x7ffee3fff700 (LWP 13346)]
[New Thread 0x7ffee37fe700 (LWP 13347)]
[New Thread 0x7ffee2ffd700 (LWP 13348)]
[New Thread 0x7ffee27fc700 (LWP 13349)]
[New Thread 0x7ffee1ffb700 (LWP 13350)]

Thread 1 "clinfo" received signal SIGSEGV, Segmentation fault.
0x00007ffff2cab8d5 in std::_Function_handler<core::Queue* (), amd::GpuAgent::InitDma()::{lambda()#1}>::_M_invoke(std::_Any_data const&) () from /opt/rocm/hsa/lib/libhsa-runtime64.so.1

Same with hashcat e.g. .

prompt:~ 0#  rpm -qf /opt/rocm/hsa/lib/libhsa-runtime64.so.1
hsa-rocr-dev-1.1.9_99_g835b876a-1.x86_64
prompt:~ 0#  uname -rsp  
Linux 5.2.8-1-default x86_64
prompt:~ 0#  grep -w NAME /etc/os-release 
NAME="openSUSE Tumbleweed"
prompt:~ 0# cat /proc/cpuinfo  | grep 'model name' | uniq
model name      : AMD Ryzen 5 2400G with Radeon Vega Graphics
prompt:~ 0#

Stock AMDGPU driver is loaded. It did work with kernel 5.1.x and the previous set of RPMs from this project.

The code at https://github.com/RadeonOpenCompute/ROCR-Runtime/blob/8ea15e12ee4760dc6ec394841a5de8e8b9e8c845/src/core/runtime/amd_gpu_agent.cpp#L570 didn't help me .

Any advice?

Thanks, Dirk

Compile fail on Ubuntu

When build ROCR on Ubuntu 16.04/18.04 I got many hsa.h symbol not declared or defined error:

yuq@yuq-Aspire-4738G:~/workspace/projects/amd/rocm/ROCR-Runtime/src/build$ cmake -D CMAKE_PREFIX_PATH=/opt/rocm ..
-- The C compiler identification is GNU 7.3.0
-- The CXX compiler identification is GNU 7.3.0
-- Check for working C compiler: /usr/bin/cc
-- Check for working C compiler: /usr/bin/cc -- works
-- Detecting C compiler ABI info
-- Detecting C compiler ABI info - done
-- Detecting C compile features
-- Detecting C compile features - done
-- Check for working CXX compiler: /usr/bin/c++
-- Check for working CXX compiler: /usr/bin/c++ -- works
-- Detecting CXX compiler ABI info
-- Detecting CXX compiler ABI info - done
-- Detecting CXX compile features
-- Detecting CXX compile features - done
-- Found LibElf: /usr/lib/x86_64-linux-gnu/libelf.so
-- Performing Test ELF_GETSHDRSTRNDX
-- Performing Test ELF_GETSHDRSTRNDX - Success
-- Looking for __NR_memfd_create
-- Looking for __NR_memfd_create - found
-- Configuring done
-- Generating done
-- Build files have been written to: /home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/build
yuq@yuq-Aspire-4738G:~/workspace/projects/amd/rocm/ROCR-Runtime/src/build$ make
Scanning dependencies of target hsa-runtime64
[  2%] Building CXX object CMakeFiles/hsa-runtime64.dir/core/util/lnx/os_linux.cpp.o
[  5%] Building CXX object CMakeFiles/hsa-runtime64.dir/core/util/small_heap.cpp.o
[  8%] Building CXX object CMakeFiles/hsa-runtime64.dir/core/util/timer.cpp.o
[ 11%] Building CXX object CMakeFiles/hsa-runtime64.dir/core/runtime/amd_blit_kernel.cpp.o
In file included from /home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/core/inc/agent.h:53:0,
                 from /home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/core/inc/blit.h:48,
                 from /home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/core/inc/amd_blit_kernel.h:50,
                 from /home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/core/runtime/amd_blit_kernel.cpp:43:
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/core/inc/isa.h:63:10: error: ‘hsa_wavefront_t’ does not name a type; did you mean ‘hsa_agent_t’?
   static hsa_wavefront_t Handle(const Wavefront *object) {
          ^~~~~~~~~~~~~~~
          hsa_agent_t
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/core/inc/isa.h:68:34: error: ‘hsa_wavefront_t’ does not name a type; did you mean ‘hsa_agent_t’?
   static Wavefront *Object(const hsa_wavefront_t &handle) {
                                  ^~~~~~~~~~~~~~~
                                  hsa_agent_t
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/core/inc/isa.h:74:22: error: ‘hsa_wavefront_info_t’ does not name a type; did you mean ‘hsa_agent_info_t’?
   bool GetInfo(const hsa_wavefront_info_t &attribute, void *value) const;
                      ^~~~~~~~~~~~~~~~~~~~
                      hsa_agent_info_t
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/core/inc/isa.h: In static member function ‘static core::Wavefront* core::Wavefront::Object(const int&)’:
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/core/inc/isa.h:69:70: error: request for member ‘handle’ in ‘handle’, which is of non-class type ‘const int’
     Wavefront *object = amd::hsa::common::ObjectAt<Wavefront>(handle.handle);
                                                                      ^~~~~~
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/core/inc/isa.h: At global scope:
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/core/inc/isa.h:178:3: error: ‘hsa_round_method_t’ does not name a type; did you mean ‘hsa_code_symbol_t’?
   hsa_round_method_t GetRoundMethod(
   ^~~~~~~~~~~~~~~~~~
   hsa_code_symbol_t
In file included from /home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/core/inc/agent.h:54:0,
                 from /home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/core/inc/blit.h:48,
                 from /home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/core/inc/amd_blit_kernel.h:50,
                 from /home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/core/runtime/amd_blit_kernel.cpp:43:
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/core/inc/queue.h: In member function ‘std::__cxx11::string core::AqlPacket::string() const’:
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/core/inc/queue.h:90:53: error: ‘HSA_PACKET_HEADER_SCACQUIRE_FENCE_SCOPE’ was not declared in this scope
            << "\nacquire: " << ((dispatch.header >> HSA_PACKET_HEADER_SCACQUIRE_FENCE_SCOPE) &
                                                     ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/core/inc/queue.h:90:53: note: suggested alternative: ‘HSA_PACKET_HEADER_ACQUIRE_FENCE_SCOPE’
            << "\nacquire: " << ((dispatch.header >> HSA_PACKET_HEADER_SCACQUIRE_FENCE_SCOPE) &
                                                     ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
                                                     HSA_PACKET_HEADER_ACQUIRE_FENCE_SCOPE
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/core/inc/queue.h:91:40: error: ‘HSA_PACKET_HEADER_WIDTH_SCACQUIRE_FENCE_SCOPE’ was not declared in this scope
                                 ((1 << HSA_PACKET_HEADER_WIDTH_SCACQUIRE_FENCE_SCOPE) - 1))
                                        ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/core/inc/queue.h:91:40: note: suggested alternative: ‘HSA_PACKET_HEADER_WIDTH_ACQUIRE_FENCE_SCOPE’
                                 ((1 << HSA_PACKET_HEADER_WIDTH_SCACQUIRE_FENCE_SCOPE) - 1))
                                        ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
                                        HSA_PACKET_HEADER_WIDTH_ACQUIRE_FENCE_SCOPE
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/core/inc/queue.h:92:53: error: ‘HSA_PACKET_HEADER_SCRELEASE_FENCE_SCOPE’ was not declared in this scope
            << "\nrelease: " << ((dispatch.header >> HSA_PACKET_HEADER_SCRELEASE_FENCE_SCOPE) &
                                                     ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/core/inc/queue.h:92:53: note: suggested alternative: ‘HSA_PACKET_HEADER_RELEASE_FENCE_SCOPE’
            << "\nrelease: " << ((dispatch.header >> HSA_PACKET_HEADER_SCRELEASE_FENCE_SCOPE) &
                                                     ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
                                                     HSA_PACKET_HEADER_RELEASE_FENCE_SCOPE
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/core/inc/queue.h:93:40: error: ‘HSA_PACKET_HEADER_WIDTH_SCRELEASE_FENCE_SCOPE’ was not declared in this scope
                                 ((1 << HSA_PACKET_HEADER_WIDTH_SCRELEASE_FENCE_SCOPE) - 1));
                                        ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/core/inc/queue.h:93:40: note: suggested alternative: ‘HSA_PACKET_HEADER_WIDTH_RELEASE_FENCE_SCOPE’
                                 ((1 << HSA_PACKET_HEADER_WIDTH_SCRELEASE_FENCE_SCOPE) - 1));
                                        ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
                                        HSA_PACKET_HEADER_WIDTH_RELEASE_FENCE_SCOPE
In file included from /home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/core/inc/blit.h:48:0,
                 from /home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/core/inc/amd_blit_kernel.h:50,
                 from /home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/core/runtime/amd_blit_kernel.cpp:43:
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/core/inc/agent.h: At global scope:
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/core/inc/agent.h:193:61: error: expected ‘,’ or ‘...’ before ‘(’ token
   virtual hsa_status_t IterateCache(hsa_status_t (*callback)(hsa_cache_t cache, void* data),
                                                             ^
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/core/inc/agent.h:215:49: error: ‘hsa_queue_type32_t’ has not been declared
   virtual hsa_status_t QueueCreate(size_t size, hsa_queue_type32_t queue_type,
                                                 ^~~~~~~~~~~~~~~~~~
In file included from /home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/core/inc/hsa_api_trace_int.h:46:0,
                 from /home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/core/inc/hsa_ext_interface.h:49,
                 from /home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/core/inc/runtime.h:51,
                 from /home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/core/inc/amd_gpu_agent.h:53,
                 from /home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/core/runtime/amd_blit_kernel.cpp:49:
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:91:46: error: ‘hsa_queue_type32_t’ has not been declared
     hsa_agent_t agent_handle, uint32_t size, hsa_queue_type32_t type,
                                              ^~~~~~~~~~~~~~~~~~
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:192:12: error: ‘hsa_queue_load_read_index_scacquire’ was not declared in this scope
   decltype(hsa_queue_load_read_index_scacquire)* hsa_queue_load_read_index_scacquire_fn;
            ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:192:12: note: suggested alternative: ‘hsa_queue_load_read_index_acquire’
   decltype(hsa_queue_load_read_index_scacquire)* hsa_queue_load_read_index_scacquire_fn;
            ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
            hsa_queue_load_read_index_acquire
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:192:12: error: ‘hsa_queue_load_read_index_scacquire’ was not declared in this scope
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:192:12: note: suggested alternative: ‘hsa_queue_load_read_index_acquire’
   decltype(hsa_queue_load_read_index_scacquire)* hsa_queue_load_read_index_scacquire_fn;
            ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
            hsa_queue_load_read_index_acquire
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:194:12: error: ‘hsa_queue_load_write_index_scacquire’ was not declared in this scope
   decltype(hsa_queue_load_write_index_scacquire)* hsa_queue_load_write_index_scacquire_fn;
            ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:194:12: note: suggested alternative: ‘hsa_queue_load_write_index_acquire’
   decltype(hsa_queue_load_write_index_scacquire)* hsa_queue_load_write_index_scacquire_fn;
            ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
            hsa_queue_load_write_index_acquire
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:194:12: error: ‘hsa_queue_load_write_index_scacquire’ was not declared in this scope
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:194:12: note: suggested alternative: ‘hsa_queue_load_write_index_acquire’
   decltype(hsa_queue_load_write_index_scacquire)* hsa_queue_load_write_index_scacquire_fn;
            ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
            hsa_queue_load_write_index_acquire
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:197:12: error: ‘hsa_queue_store_write_index_screlease’ was not declared in this scope
   decltype(hsa_queue_store_write_index_screlease)* hsa_queue_store_write_index_screlease_fn;
            ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:197:12: note: suggested alternative: ‘hsa_queue_store_write_index_release’
   decltype(hsa_queue_store_write_index_screlease)* hsa_queue_store_write_index_screlease_fn;
            ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
            hsa_queue_store_write_index_release
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:197:12: error: ‘hsa_queue_store_write_index_screlease’ was not declared in this scope
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:197:12: note: suggested alternative: ‘hsa_queue_store_write_index_release’
   decltype(hsa_queue_store_write_index_screlease)* hsa_queue_store_write_index_screlease_fn;
            ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
            hsa_queue_store_write_index_release
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:198:12: error: ‘hsa_queue_cas_write_index_scacq_screl’ was not declared in this scope
   decltype(hsa_queue_cas_write_index_scacq_screl)* hsa_queue_cas_write_index_scacq_screl_fn;
            ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:198:12: note: suggested alternative: ‘hsa_queue_cas_write_index_acq_rel’
   decltype(hsa_queue_cas_write_index_scacq_screl)* hsa_queue_cas_write_index_scacq_screl_fn;
            ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
            hsa_queue_cas_write_index_acq_rel
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:198:12: error: ‘hsa_queue_cas_write_index_scacq_screl’ was not declared in this scope
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:198:12: note: suggested alternative: ‘hsa_queue_cas_write_index_acq_rel’
   decltype(hsa_queue_cas_write_index_scacq_screl)* hsa_queue_cas_write_index_scacq_screl_fn;
            ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
            hsa_queue_cas_write_index_acq_rel
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:199:12: error: ‘hsa_queue_cas_write_index_scacquire’ was not declared in this scope
   decltype(hsa_queue_cas_write_index_scacquire)* hsa_queue_cas_write_index_scacquire_fn;
            ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:199:12: note: suggested alternative: ‘hsa_queue_cas_write_index_acquire’
   decltype(hsa_queue_cas_write_index_scacquire)* hsa_queue_cas_write_index_scacquire_fn;
            ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
            hsa_queue_cas_write_index_acquire
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:199:12: error: ‘hsa_queue_cas_write_index_scacquire’ was not declared in this scope
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:199:12: note: suggested alternative: ‘hsa_queue_cas_write_index_acquire’
   decltype(hsa_queue_cas_write_index_scacquire)* hsa_queue_cas_write_index_scacquire_fn;
            ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
            hsa_queue_cas_write_index_acquire
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:201:12: error: ‘hsa_queue_cas_write_index_screlease’ was not declared in this scope
   decltype(hsa_queue_cas_write_index_screlease)* hsa_queue_cas_write_index_screlease_fn;
            ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:201:12: note: suggested alternative: ‘hsa_queue_cas_write_index_release’
   decltype(hsa_queue_cas_write_index_screlease)* hsa_queue_cas_write_index_screlease_fn;
            ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
            hsa_queue_cas_write_index_release
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:201:12: error: ‘hsa_queue_cas_write_index_screlease’ was not declared in this scope
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:201:12: note: suggested alternative: ‘hsa_queue_cas_write_index_release’
   decltype(hsa_queue_cas_write_index_screlease)* hsa_queue_cas_write_index_screlease_fn;
            ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
            hsa_queue_cas_write_index_release
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:202:12: error: ‘hsa_queue_add_write_index_scacq_screl’ was not declared in this scope
   decltype(hsa_queue_add_write_index_scacq_screl)* hsa_queue_add_write_index_scacq_screl_fn;
            ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:202:12: note: suggested alternative: ‘hsa_queue_add_write_index_acq_rel’
   decltype(hsa_queue_add_write_index_scacq_screl)* hsa_queue_add_write_index_scacq_screl_fn;
            ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
            hsa_queue_add_write_index_acq_rel
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:202:12: error: ‘hsa_queue_add_write_index_scacq_screl’ was not declared in this scope
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:202:12: note: suggested alternative: ‘hsa_queue_add_write_index_acq_rel’
   decltype(hsa_queue_add_write_index_scacq_screl)* hsa_queue_add_write_index_scacq_screl_fn;
            ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
            hsa_queue_add_write_index_acq_rel
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:203:12: error: ‘hsa_queue_add_write_index_scacquire’ was not declared in this scope
   decltype(hsa_queue_add_write_index_scacquire)* hsa_queue_add_write_index_scacquire_fn;
            ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:203:12: note: suggested alternative: ‘hsa_queue_add_write_index_acquire’
   decltype(hsa_queue_add_write_index_scacquire)* hsa_queue_add_write_index_scacquire_fn;
            ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
            hsa_queue_add_write_index_acquire
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:203:12: error: ‘hsa_queue_add_write_index_scacquire’ was not declared in this scope
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:203:12: note: suggested alternative: ‘hsa_queue_add_write_index_acquire’
   decltype(hsa_queue_add_write_index_scacquire)* hsa_queue_add_write_index_scacquire_fn;
            ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
            hsa_queue_add_write_index_acquire
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:205:12: error: ‘hsa_queue_add_write_index_screlease’ was not declared in this scope
   decltype(hsa_queue_add_write_index_screlease)* hsa_queue_add_write_index_screlease_fn;
            ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:205:12: note: suggested alternative: ‘hsa_queue_add_write_index_release’
   decltype(hsa_queue_add_write_index_screlease)* hsa_queue_add_write_index_screlease_fn;
            ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
            hsa_queue_add_write_index_release
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:205:12: error: ‘hsa_queue_add_write_index_screlease’ was not declared in this scope
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:205:12: note: suggested alternative: ‘hsa_queue_add_write_index_release’
   decltype(hsa_queue_add_write_index_screlease)* hsa_queue_add_write_index_screlease_fn;
            ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
            hsa_queue_add_write_index_release
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:207:12: error: ‘hsa_queue_store_read_index_screlease’ was not declared in this scope
   decltype(hsa_queue_store_read_index_screlease)* hsa_queue_store_read_index_screlease_fn;
            ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:207:12: note: suggested alternative: ‘hsa_queue_store_read_index_release’
   decltype(hsa_queue_store_read_index_screlease)* hsa_queue_store_read_index_screlease_fn;
            ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
            hsa_queue_store_read_index_release
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:207:12: error: ‘hsa_queue_store_read_index_screlease’ was not declared in this scope
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:207:12: note: suggested alternative: ‘hsa_queue_store_read_index_release’
   decltype(hsa_queue_store_read_index_screlease)* hsa_queue_store_read_index_screlease_fn;
            ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
            hsa_queue_store_read_index_release
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:221:12: error: ‘hsa_signal_load_scacquire’ was not declared in this scope
   decltype(hsa_signal_load_scacquire)* hsa_signal_load_scacquire_fn;
            ^~~~~~~~~~~~~~~~~~~~~~~~~
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:221:12: note: suggested alternative: ‘hsa_signal_load_acquire’
   decltype(hsa_signal_load_scacquire)* hsa_signal_load_scacquire_fn;
            ^~~~~~~~~~~~~~~~~~~~~~~~~
            hsa_signal_load_acquire
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:221:12: error: ‘hsa_signal_load_scacquire’ was not declared in this scope
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:221:12: note: suggested alternative: ‘hsa_signal_load_acquire’
   decltype(hsa_signal_load_scacquire)* hsa_signal_load_scacquire_fn;
            ^~~~~~~~~~~~~~~~~~~~~~~~~
            hsa_signal_load_acquire
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:223:12: error: ‘hsa_signal_store_screlease’ was not declared in this scope
   decltype(hsa_signal_store_screlease)* hsa_signal_store_screlease_fn;
            ^~~~~~~~~~~~~~~~~~~~~~~~~~
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:223:12: note: suggested alternative: ‘hsa_signal_store_release’
   decltype(hsa_signal_store_screlease)* hsa_signal_store_screlease_fn;
            ^~~~~~~~~~~~~~~~~~~~~~~~~~
            hsa_signal_store_release
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:223:12: error: ‘hsa_signal_store_screlease’ was not declared in this scope
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:223:12: note: suggested alternative: ‘hsa_signal_store_release’
   decltype(hsa_signal_store_screlease)* hsa_signal_store_screlease_fn;
            ^~~~~~~~~~~~~~~~~~~~~~~~~~
            hsa_signal_store_release
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:225:12: error: ‘hsa_signal_wait_scacquire’ was not declared in this scope
   decltype(hsa_signal_wait_scacquire)* hsa_signal_wait_scacquire_fn;
            ^~~~~~~~~~~~~~~~~~~~~~~~~
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:225:12: note: suggested alternative: ‘hsa_signal_wait_acquire’
   decltype(hsa_signal_wait_scacquire)* hsa_signal_wait_scacquire_fn;
            ^~~~~~~~~~~~~~~~~~~~~~~~~
            hsa_signal_wait_acquire
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:225:12: error: ‘hsa_signal_wait_scacquire’ was not declared in this scope
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:225:12: note: suggested alternative: ‘hsa_signal_wait_acquire’
   decltype(hsa_signal_wait_scacquire)* hsa_signal_wait_scacquire_fn;
            ^~~~~~~~~~~~~~~~~~~~~~~~~
            hsa_signal_wait_acquire
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:227:12: error: ‘hsa_signal_and_scacquire’ was not declared in this scope
   decltype(hsa_signal_and_scacquire)* hsa_signal_and_scacquire_fn;
            ^~~~~~~~~~~~~~~~~~~~~~~~
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:227:12: note: suggested alternative: ‘hsa_signal_and_acquire’
   decltype(hsa_signal_and_scacquire)* hsa_signal_and_scacquire_fn;
            ^~~~~~~~~~~~~~~~~~~~~~~~
            hsa_signal_and_acquire
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:227:12: error: ‘hsa_signal_and_scacquire’ was not declared in this scope
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:227:12: note: suggested alternative: ‘hsa_signal_and_acquire’
   decltype(hsa_signal_and_scacquire)* hsa_signal_and_scacquire_fn;
            ^~~~~~~~~~~~~~~~~~~~~~~~
            hsa_signal_and_acquire
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:228:12: error: ‘hsa_signal_and_screlease’ was not declared in this scope
   decltype(hsa_signal_and_screlease)* hsa_signal_and_screlease_fn;
            ^~~~~~~~~~~~~~~~~~~~~~~~
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:228:12: note: suggested alternative: ‘hsa_signal_and_release’
   decltype(hsa_signal_and_screlease)* hsa_signal_and_screlease_fn;
            ^~~~~~~~~~~~~~~~~~~~~~~~
            hsa_signal_and_release
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:228:12: error: ‘hsa_signal_and_screlease’ was not declared in this scope
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:228:12: note: suggested alternative: ‘hsa_signal_and_release’
   decltype(hsa_signal_and_screlease)* hsa_signal_and_screlease_fn;
            ^~~~~~~~~~~~~~~~~~~~~~~~
            hsa_signal_and_release
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:229:12: error: ‘hsa_signal_and_scacq_screl’ was not declared in this scope
   decltype(hsa_signal_and_scacq_screl)* hsa_signal_and_scacq_screl_fn;
            ^~~~~~~~~~~~~~~~~~~~~~~~~~
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:229:12: note: suggested alternative: ‘hsa_signal_and_acq_rel’
   decltype(hsa_signal_and_scacq_screl)* hsa_signal_and_scacq_screl_fn;
            ^~~~~~~~~~~~~~~~~~~~~~~~~~
            hsa_signal_and_acq_rel
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:229:12: error: ‘hsa_signal_and_scacq_screl’ was not declared in this scope
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:229:12: note: suggested alternative: ‘hsa_signal_and_acq_rel’
   decltype(hsa_signal_and_scacq_screl)* hsa_signal_and_scacq_screl_fn;
            ^~~~~~~~~~~~~~~~~~~~~~~~~~
            hsa_signal_and_acq_rel
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:231:12: error: ‘hsa_signal_or_scacquire’ was not declared in this scope
   decltype(hsa_signal_or_scacquire)* hsa_signal_or_scacquire_fn;
            ^~~~~~~~~~~~~~~~~~~~~~~
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:231:12: note: suggested alternative: ‘hsa_signal_or_acquire’
   decltype(hsa_signal_or_scacquire)* hsa_signal_or_scacquire_fn;
            ^~~~~~~~~~~~~~~~~~~~~~~
            hsa_signal_or_acquire
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:231:12: error: ‘hsa_signal_or_scacquire’ was not declared in this scope
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:231:12: note: suggested alternative: ‘hsa_signal_or_acquire’
   decltype(hsa_signal_or_scacquire)* hsa_signal_or_scacquire_fn;
            ^~~~~~~~~~~~~~~~~~~~~~~
            hsa_signal_or_acquire
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:232:12: error: ‘hsa_signal_or_screlease’ was not declared in this scope
   decltype(hsa_signal_or_screlease)* hsa_signal_or_screlease_fn;
            ^~~~~~~~~~~~~~~~~~~~~~~
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:232:12: note: suggested alternative: ‘hsa_signal_or_release’
   decltype(hsa_signal_or_screlease)* hsa_signal_or_screlease_fn;
            ^~~~~~~~~~~~~~~~~~~~~~~
            hsa_signal_or_release
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:232:12: error: ‘hsa_signal_or_screlease’ was not declared in this scope
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:232:12: note: suggested alternative: ‘hsa_signal_or_release’
   decltype(hsa_signal_or_screlease)* hsa_signal_or_screlease_fn;
            ^~~~~~~~~~~~~~~~~~~~~~~
            hsa_signal_or_release
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:233:12: error: ‘hsa_signal_or_scacq_screl’ was not declared in this scope
   decltype(hsa_signal_or_scacq_screl)* hsa_signal_or_scacq_screl_fn;
            ^~~~~~~~~~~~~~~~~~~~~~~~~
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:233:12: note: suggested alternative: ‘hsa_signal_or_acq_rel’
   decltype(hsa_signal_or_scacq_screl)* hsa_signal_or_scacq_screl_fn;
            ^~~~~~~~~~~~~~~~~~~~~~~~~
            hsa_signal_or_acq_rel
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:233:12: error: ‘hsa_signal_or_scacq_screl’ was not declared in this scope
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:233:12: note: suggested alternative: ‘hsa_signal_or_acq_rel’
   decltype(hsa_signal_or_scacq_screl)* hsa_signal_or_scacq_screl_fn;
            ^~~~~~~~~~~~~~~~~~~~~~~~~
            hsa_signal_or_acq_rel
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:235:12: error: ‘hsa_signal_xor_scacquire’ was not declared in this scope
   decltype(hsa_signal_xor_scacquire)* hsa_signal_xor_scacquire_fn;
            ^~~~~~~~~~~~~~~~~~~~~~~~
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:235:12: note: suggested alternative: ‘hsa_signal_xor_acquire’
   decltype(hsa_signal_xor_scacquire)* hsa_signal_xor_scacquire_fn;
            ^~~~~~~~~~~~~~~~~~~~~~~~
            hsa_signal_xor_acquire
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:235:12: error: ‘hsa_signal_xor_scacquire’ was not declared in this scope
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:235:12: note: suggested alternative: ‘hsa_signal_xor_acquire’
   decltype(hsa_signal_xor_scacquire)* hsa_signal_xor_scacquire_fn;
            ^~~~~~~~~~~~~~~~~~~~~~~~
            hsa_signal_xor_acquire
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:236:12: error: ‘hsa_signal_xor_screlease’ was not declared in this scope
   decltype(hsa_signal_xor_screlease)* hsa_signal_xor_screlease_fn;
            ^~~~~~~~~~~~~~~~~~~~~~~~
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:236:12: note: suggested alternative: ‘hsa_signal_xor_release’
   decltype(hsa_signal_xor_screlease)* hsa_signal_xor_screlease_fn;
            ^~~~~~~~~~~~~~~~~~~~~~~~
            hsa_signal_xor_release
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:236:12: error: ‘hsa_signal_xor_screlease’ was not declared in this scope
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:236:12: note: suggested alternative: ‘hsa_signal_xor_release’
   decltype(hsa_signal_xor_screlease)* hsa_signal_xor_screlease_fn;
            ^~~~~~~~~~~~~~~~~~~~~~~~
            hsa_signal_xor_release
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:237:12: error: ‘hsa_signal_xor_scacq_screl’ was not declared in this scope
   decltype(hsa_signal_xor_scacq_screl)* hsa_signal_xor_scacq_screl_fn;
            ^~~~~~~~~~~~~~~~~~~~~~~~~~
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:237:12: note: suggested alternative: ‘hsa_signal_or_scacq_screl_fn’
   decltype(hsa_signal_xor_scacq_screl)* hsa_signal_xor_scacq_screl_fn;
            ^~~~~~~~~~~~~~~~~~~~~~~~~~
            hsa_signal_or_scacq_screl_fn
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:237:12: error: ‘hsa_signal_xor_scacq_screl’ was not declared in this scope
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:237:12: note: suggested alternative: ‘hsa_signal_or_scacq_screl_fn’
   decltype(hsa_signal_xor_scacq_screl)* hsa_signal_xor_scacq_screl_fn;
            ^~~~~~~~~~~~~~~~~~~~~~~~~~
            hsa_signal_or_scacq_screl_fn
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:239:12: error: ‘hsa_signal_exchange_scacquire’ was not declared in this scope
   decltype(hsa_signal_exchange_scacquire)* hsa_signal_exchange_scacquire_fn;
            ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:239:12: note: suggested alternative: ‘hsa_signal_exchange_acquire’
   decltype(hsa_signal_exchange_scacquire)* hsa_signal_exchange_scacquire_fn;
            ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~
            hsa_signal_exchange_acquire
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:239:12: error: ‘hsa_signal_exchange_scacquire’ was not declared in this scope
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:239:12: note: suggested alternative: ‘hsa_signal_exchange_acquire’
   decltype(hsa_signal_exchange_scacquire)* hsa_signal_exchange_scacquire_fn;
            ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~
            hsa_signal_exchange_acquire
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:240:12: error: ‘hsa_signal_exchange_screlease’ was not declared in this scope
   decltype(hsa_signal_exchange_screlease)* hsa_signal_exchange_screlease_fn;
            ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:240:12: note: suggested alternative: ‘hsa_signal_exchange_release’
   decltype(hsa_signal_exchange_screlease)* hsa_signal_exchange_screlease_fn;
            ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~
            hsa_signal_exchange_release
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:240:12: error: ‘hsa_signal_exchange_screlease’ was not declared in this scope
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:240:12: note: suggested alternative: ‘hsa_signal_exchange_release’
   decltype(hsa_signal_exchange_screlease)* hsa_signal_exchange_screlease_fn;
            ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~
            hsa_signal_exchange_release
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:241:12: error: ‘hsa_signal_exchange_scacq_screl’ was not declared in this scope
   decltype(hsa_signal_exchange_scacq_screl)* hsa_signal_exchange_scacq_screl_fn;
            ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:241:12: note: suggested alternative: ‘hsa_signal_exchange_acq_rel’
   decltype(hsa_signal_exchange_scacq_screl)* hsa_signal_exchange_scacq_screl_fn;
            ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
            hsa_signal_exchange_acq_rel
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:241:12: error: ‘hsa_signal_exchange_scacq_screl’ was not declared in this scope
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:241:12: note: suggested alternative: ‘hsa_signal_exchange_acq_rel’
   decltype(hsa_signal_exchange_scacq_screl)* hsa_signal_exchange_scacq_screl_fn;
            ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
            hsa_signal_exchange_acq_rel
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:243:12: error: ‘hsa_signal_add_scacquire’ was not declared in this scope
   decltype(hsa_signal_add_scacquire)* hsa_signal_add_scacquire_fn;
            ^~~~~~~~~~~~~~~~~~~~~~~~
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:243:12: note: suggested alternative: ‘hsa_signal_add_acquire’
   decltype(hsa_signal_add_scacquire)* hsa_signal_add_scacquire_fn;
            ^~~~~~~~~~~~~~~~~~~~~~~~
            hsa_signal_add_acquire
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:243:12: error: ‘hsa_signal_add_scacquire’ was not declared in this scope
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:243:12: note: suggested alternative: ‘hsa_signal_add_acquire’
   decltype(hsa_signal_add_scacquire)* hsa_signal_add_scacquire_fn;
            ^~~~~~~~~~~~~~~~~~~~~~~~
            hsa_signal_add_acquire
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:244:12: error: ‘hsa_signal_add_screlease’ was not declared in this scope
   decltype(hsa_signal_add_screlease)* hsa_signal_add_screlease_fn;
            ^~~~~~~~~~~~~~~~~~~~~~~~
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:244:12: note: suggested alternative: ‘hsa_signal_add_release’
   decltype(hsa_signal_add_screlease)* hsa_signal_add_screlease_fn;
            ^~~~~~~~~~~~~~~~~~~~~~~~
            hsa_signal_add_release
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:244:12: error: ‘hsa_signal_add_screlease’ was not declared in this scope
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:244:12: note: suggested alternative: ‘hsa_signal_add_release’
   decltype(hsa_signal_add_screlease)* hsa_signal_add_screlease_fn;
            ^~~~~~~~~~~~~~~~~~~~~~~~
            hsa_signal_add_release
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:245:12: error: ‘hsa_signal_add_scacq_screl’ was not declared in this scope
   decltype(hsa_signal_add_scacq_screl)* hsa_signal_add_scacq_screl_fn;
            ^~~~~~~~~~~~~~~~~~~~~~~~~~
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:245:12: note: suggested alternative: ‘hsa_signal_and_scacq_screl_fn’
   decltype(hsa_signal_add_scacq_screl)* hsa_signal_add_scacq_screl_fn;
            ^~~~~~~~~~~~~~~~~~~~~~~~~~
            hsa_signal_and_scacq_screl_fn
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:245:12: error: ‘hsa_signal_add_scacq_screl’ was not declared in this scope
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:245:12: note: suggested alternative: ‘hsa_signal_and_scacq_screl_fn’
   decltype(hsa_signal_add_scacq_screl)* hsa_signal_add_scacq_screl_fn;
            ^~~~~~~~~~~~~~~~~~~~~~~~~~
            hsa_signal_and_scacq_screl_fn
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:247:12: error: ‘hsa_signal_subtract_scacquire’ was not declared in this scope
   decltype(hsa_signal_subtract_scacquire)* hsa_signal_subtract_scacquire_fn;
            ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:247:12: note: suggested alternative: ‘hsa_signal_subtract_acquire’
   decltype(hsa_signal_subtract_scacquire)* hsa_signal_subtract_scacquire_fn;
            ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~
            hsa_signal_subtract_acquire
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:247:12: error: ‘hsa_signal_subtract_scacquire’ was not declared in this scope
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:247:12: note: suggested alternative: ‘hsa_signal_subtract_acquire’
   decltype(hsa_signal_subtract_scacquire)* hsa_signal_subtract_scacquire_fn;
            ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~
            hsa_signal_subtract_acquire
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:248:12: error: ‘hsa_signal_subtract_screlease’ was not declared in this scope
   decltype(hsa_signal_subtract_screlease)* hsa_signal_subtract_screlease_fn;
            ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:248:12: note: suggested alternative: ‘hsa_signal_subtract_release’
   decltype(hsa_signal_subtract_screlease)* hsa_signal_subtract_screlease_fn;
            ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~
            hsa_signal_subtract_release
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:248:12: error: ‘hsa_signal_subtract_screlease’ was not declared in this scope
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:248:12: note: suggested alternative: ‘hsa_signal_subtract_release’
   decltype(hsa_signal_subtract_screlease)* hsa_signal_subtract_screlease_fn;
            ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~
            hsa_signal_subtract_release
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:249:12: error: ‘hsa_signal_subtract_scacq_screl’ was not declared in this scope
   decltype(hsa_signal_subtract_scacq_screl)* hsa_signal_subtract_scacq_screl_fn;
            ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:249:12: note: suggested alternative: ‘hsa_signal_subtract_acq_rel’
   decltype(hsa_signal_subtract_scacq_screl)* hsa_signal_subtract_scacq_screl_fn;
            ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
            hsa_signal_subtract_acq_rel
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:249:12: error: ‘hsa_signal_subtract_scacq_screl’ was not declared in this scope
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:249:12: note: suggested alternative: ‘hsa_signal_subtract_acq_rel’
   decltype(hsa_signal_subtract_scacq_screl)* hsa_signal_subtract_scacq_screl_fn;
            ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
            hsa_signal_subtract_acq_rel
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:251:12: error: ‘hsa_signal_cas_scacquire’ was not declared in this scope
   decltype(hsa_signal_cas_scacquire)* hsa_signal_cas_scacquire_fn;
            ^~~~~~~~~~~~~~~~~~~~~~~~
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:251:12: note: suggested alternative: ‘hsa_signal_cas_acquire’
   decltype(hsa_signal_cas_scacquire)* hsa_signal_cas_scacquire_fn;
            ^~~~~~~~~~~~~~~~~~~~~~~~
            hsa_signal_cas_acquire
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:251:12: error: ‘hsa_signal_cas_scacquire’ was not declared in this scope
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:251:12: note: suggested alternative: ‘hsa_signal_cas_acquire’
   decltype(hsa_signal_cas_scacquire)* hsa_signal_cas_scacquire_fn;
            ^~~~~~~~~~~~~~~~~~~~~~~~
            hsa_signal_cas_acquire
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:252:12: error: ‘hsa_signal_cas_screlease’ was not declared in this scope
   decltype(hsa_signal_cas_screlease)* hsa_signal_cas_screlease_fn;
            ^~~~~~~~~~~~~~~~~~~~~~~~
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:252:12: note: suggested alternative: ‘hsa_signal_cas_release’
   decltype(hsa_signal_cas_screlease)* hsa_signal_cas_screlease_fn;
            ^~~~~~~~~~~~~~~~~~~~~~~~
            hsa_signal_cas_release
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:252:12: error: ‘hsa_signal_cas_screlease’ was not declared in this scope
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:252:12: note: suggested alternative: ‘hsa_signal_cas_release’
   decltype(hsa_signal_cas_screlease)* hsa_signal_cas_screlease_fn;
            ^~~~~~~~~~~~~~~~~~~~~~~~
            hsa_signal_cas_release
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:253:12: error: ‘hsa_signal_cas_scacq_screl’ was not declared in this scope
   decltype(hsa_signal_cas_scacq_screl)* hsa_signal_cas_scacq_screl_fn;
            ^~~~~~~~~~~~~~~~~~~~~~~~~~
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:253:12: note: suggested alternative: ‘hsa_signal_cas_acq_rel’
   decltype(hsa_signal_cas_scacq_screl)* hsa_signal_cas_scacq_screl_fn;
            ^~~~~~~~~~~~~~~~~~~~~~~~~~
            hsa_signal_cas_acq_rel
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:253:12: error: ‘hsa_signal_cas_scacq_screl’ was not declared in this scope
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:253:12: note: suggested alternative: ‘hsa_signal_cas_acq_rel’
   decltype(hsa_signal_cas_scacq_screl)* hsa_signal_cas_scacq_screl_fn;
            ^~~~~~~~~~~~~~~~~~~~~~~~~~
            hsa_signal_cas_acq_rel
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:307:12: error: ‘hsa_extension_get_name’ was not declared in this scope
   decltype(hsa_extension_get_name)* hsa_extension_get_name_fn;
            ^~~~~~~~~~~~~~~~~~~~~~
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:307:12: note: suggested alternative: ‘hsa_extension_t’
   decltype(hsa_extension_get_name)* hsa_extension_get_name_fn;
            ^~~~~~~~~~~~~~~~~~~~~~
            hsa_extension_t
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:307:12: error: ‘hsa_extension_get_name’ was not declared in this scope
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:307:12: note: suggested alternative: ‘hsa_extension_t’
   decltype(hsa_extension_get_name)* hsa_extension_get_name_fn;
            ^~~~~~~~~~~~~~~~~~~~~~
            hsa_extension_t
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:308:12: error: ‘hsa_system_major_extension_supported’ was not declared in this scope
   decltype(hsa_system_major_extension_supported)* hsa_system_major_extension_supported_fn;
            ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:308:12: note: suggested alternative: ‘hsa_system_extension_supported’
   decltype(hsa_system_major_extension_supported)* hsa_system_major_extension_supported_fn;
            ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
            hsa_system_extension_supported
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:308:12: error: ‘hsa_system_major_extension_supported’ was not declared in this scope
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:308:12: note: suggested alternative: ‘hsa_system_extension_supported’
   decltype(hsa_system_major_extension_supported)* hsa_system_major_extension_supported_fn;
            ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
            hsa_system_extension_supported
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:309:12: error: ‘hsa_system_get_major_extension_table’ was not declared in this scope
   decltype(hsa_system_get_major_extension_table)* hsa_system_get_major_extension_table_fn;
            ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:309:12: note: suggested alternative: ‘hsa_system_get_extension_table’
   decltype(hsa_system_get_major_extension_table)* hsa_system_get_major_extension_table_fn;
            ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
            hsa_system_get_extension_table
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:309:12: error: ‘hsa_system_get_major_extension_table’ was not declared in this scope
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:309:12: note: suggested alternative: ‘hsa_system_get_extension_table’
   decltype(hsa_system_get_major_extension_table)* hsa_system_get_major_extension_table_fn;
            ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
            hsa_system_get_extension_table
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:310:12: error: ‘hsa_agent_major_extension_supported’ was not declared in this scope
   decltype(hsa_agent_major_extension_supported)* hsa_agent_major_extension_supported_fn;
            ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:310:12: note: suggested alternative: ‘hsa_agent_extension_supported’
   decltype(hsa_agent_major_extension_supported)* hsa_agent_major_extension_supported_fn;
            ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
            hsa_agent_extension_supported
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:310:12: error: ‘hsa_agent_major_extension_supported’ was not declared in this scope
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:310:12: note: suggested alternative: ‘hsa_agent_extension_supported’
   decltype(hsa_agent_major_extension_supported)* hsa_agent_major_extension_supported_fn;
            ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
            hsa_agent_extension_supported
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:311:12: error: ‘hsa_cache_get_info’ was not declared in this scope
   decltype(hsa_cache_get_info)* hsa_cache_get_info_fn;
            ^~~~~~~~~~~~~~~~~~
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:311:12: note: suggested alternative: ‘hsa_isa_get_info’
   decltype(hsa_cache_get_info)* hsa_cache_get_info_fn;
            ^~~~~~~~~~~~~~~~~~
            hsa_isa_get_info
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:311:12: error: ‘hsa_cache_get_info’ was not declared in this scope
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:311:12: note: suggested alternative: ‘hsa_isa_get_info’
   decltype(hsa_cache_get_info)* hsa_cache_get_info_fn;
            ^~~~~~~~~~~~~~~~~~
            hsa_isa_get_info
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:312:12: error: ‘hsa_agent_iterate_caches’ was not declared in this scope
   decltype(hsa_agent_iterate_caches)* hsa_agent_iterate_caches_fn;
            ^~~~~~~~~~~~~~~~~~~~~~~~
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:312:12: note: suggested alternative: ‘hsa_agent_iterate_regions’
   decltype(hsa_agent_iterate_caches)* hsa_agent_iterate_caches_fn;
            ^~~~~~~~~~~~~~~~~~~~~~~~
            hsa_agent_iterate_regions
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:312:12: error: ‘hsa_agent_iterate_caches’ was not declared in this scope
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:312:12: note: suggested alternative: ‘hsa_agent_iterate_regions’
   decltype(hsa_agent_iterate_caches)* hsa_agent_iterate_caches_fn;
            ^~~~~~~~~~~~~~~~~~~~~~~~
            hsa_agent_iterate_regions
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:313:12: error: ‘hsa_signal_silent_store_relaxed’ was not declared in this scope
   decltype(hsa_signal_silent_store_relaxed)* hsa_signal_silent_store_relaxed_fn;
            ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:313:12: note: suggested alternative: ‘hsa_signal_store_relaxed’
   decltype(hsa_signal_silent_store_relaxed)* hsa_signal_silent_store_relaxed_fn;
            ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
            hsa_signal_store_relaxed
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:313:12: error: ‘hsa_signal_silent_store_relaxed’ was not declared in this scope
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:313:12: note: suggested alternative: ‘hsa_signal_store_relaxed’
   decltype(hsa_signal_silent_store_relaxed)* hsa_signal_silent_store_relaxed_fn;
            ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
            hsa_signal_store_relaxed
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:314:12: error: ‘hsa_signal_silent_store_screlease’ was not declared in this scope
   decltype(hsa_signal_silent_store_screlease)* hsa_signal_silent_store_screlease_fn;
            ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:314:12: note: suggested alternative: ‘hsa_signal_silent_store_relaxed_fn’
   decltype(hsa_signal_silent_store_screlease)* hsa_signal_silent_store_screlease_fn;
            ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
            hsa_signal_silent_store_relaxed_fn
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:314:12: error: ‘hsa_signal_silent_store_screlease’ was not declared in this scope
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:314:12: note: suggested alternative: ‘hsa_signal_silent_store_relaxed_fn’
   decltype(hsa_signal_silent_store_screlease)* hsa_signal_silent_store_screlease_fn;
            ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
            hsa_signal_silent_store_relaxed_fn
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:315:12: error: ‘hsa_signal_group_create’ was not declared in this scope
   decltype(hsa_signal_group_create)* hsa_signal_group_create_fn;
            ^~~~~~~~~~~~~~~~~~~~~~~
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:315:12: note: suggested alternative: ‘hsa_signal_create’
   decltype(hsa_signal_group_create)* hsa_signal_group_create_fn;
            ^~~~~~~~~~~~~~~~~~~~~~~
            hsa_signal_create
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:315:12: error: ‘hsa_signal_group_create’ was not declared in this scope
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:315:12: note: suggested alternative: ‘hsa_signal_create’
   decltype(hsa_signal_group_create)* hsa_signal_group_create_fn;
            ^~~~~~~~~~~~~~~~~~~~~~~
            hsa_signal_create
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:316:12: error: ‘hsa_signal_group_destroy’ was not declared in this scope
   decltype(hsa_signal_group_destroy)* hsa_signal_group_destroy_fn;
            ^~~~~~~~~~~~~~~~~~~~~~~~
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:316:12: note: suggested alternative: ‘hsa_signal_destroy’
   decltype(hsa_signal_group_destroy)* hsa_signal_group_destroy_fn;
            ^~~~~~~~~~~~~~~~~~~~~~~~
            hsa_signal_destroy
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:316:12: error: ‘hsa_signal_group_destroy’ was not declared in this scope
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:316:12: note: suggested alternative: ‘hsa_signal_destroy’
   decltype(hsa_signal_group_destroy)* hsa_signal_group_destroy_fn;
            ^~~~~~~~~~~~~~~~~~~~~~~~
            hsa_signal_destroy
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:317:12: error: ‘hsa_signal_group_wait_any_scacquire’ was not declared in this scope
   decltype(hsa_signal_group_wait_any_scacquire)* hsa_signal_group_wait_any_scacquire_fn;
            ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:317:12: note: suggested alternative: ‘hsa_signal_wait_acquire’
   decltype(hsa_signal_group_wait_any_scacquire)* hsa_signal_group_wait_any_scacquire_fn;
            ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
            hsa_signal_wait_acquire
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:317:12: error: ‘hsa_signal_group_wait_any_scacquire’ was not declared in this scope
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:317:12: note: suggested alternative: ‘hsa_signal_wait_acquire’
   decltype(hsa_signal_group_wait_any_scacquire)* hsa_signal_group_wait_any_scacquire_fn;
            ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
            hsa_signal_wait_acquire
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:318:12: error: ‘hsa_signal_group_wait_any_relaxed’ was not declared in this scope
   decltype(hsa_signal_group_wait_any_relaxed)* hsa_signal_group_wait_any_relaxed_fn;
            ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:318:12: note: suggested alternative: ‘hsa_signal_wait_relaxed’
   decltype(hsa_signal_group_wait_any_relaxed)* hsa_signal_group_wait_any_relaxed_fn;
            ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
            hsa_signal_wait_relaxed
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:318:12: error: ‘hsa_signal_group_wait_any_relaxed’ was not declared in this scope
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:318:12: note: suggested alternative: ‘hsa_signal_wait_relaxed’
   decltype(hsa_signal_group_wait_any_relaxed)* hsa_signal_group_wait_any_relaxed_fn;
            ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
            hsa_signal_wait_relaxed
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:322:12: error: ‘hsa_agent_iterate_isas’ was not declared in this scope
   decltype(hsa_agent_iterate_isas)* hsa_agent_iterate_isas_fn;
            ^~~~~~~~~~~~~~~~~~~~~~
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:322:12: note: suggested alternative: ‘hsa_agent_iterate_regions’
   decltype(hsa_agent_iterate_isas)* hsa_agent_iterate_isas_fn;
            ^~~~~~~~~~~~~~~~~~~~~~
            hsa_agent_iterate_regions
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:322:12: error: ‘hsa_agent_iterate_isas’ was not declared in this scope
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:322:12: note: suggested alternative: ‘hsa_agent_iterate_regions’
   decltype(hsa_agent_iterate_isas)* hsa_agent_iterate_isas_fn;
            ^~~~~~~~~~~~~~~~~~~~~~
            hsa_agent_iterate_regions
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:323:12: error: ‘hsa_isa_get_info_alt’ was not declared in this scope
   decltype(hsa_isa_get_info_alt)* hsa_isa_get_info_alt_fn;
            ^~~~~~~~~~~~~~~~~~~~
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:323:12: note: suggested alternative: ‘hsa_isa_get_info_fn’
   decltype(hsa_isa_get_info_alt)* hsa_isa_get_info_alt_fn;
            ^~~~~~~~~~~~~~~~~~~~
            hsa_isa_get_info_fn
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:323:12: error: ‘hsa_isa_get_info_alt’ was not declared in this scope
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:323:12: note: suggested alternative: ‘hsa_isa_get_info_fn’
   decltype(hsa_isa_get_info_alt)* hsa_isa_get_info_alt_fn;
            ^~~~~~~~~~~~~~~~~~~~
            hsa_isa_get_info_fn
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:324:12: error: ‘hsa_isa_get_exception_policies’ was not declared in this scope
   decltype(hsa_isa_get_exception_policies)* hsa_isa_get_exception_policies_fn;
            ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:324:12: note: suggested alternative: ‘hsa_agent_get_exception_policies’
   decltype(hsa_isa_get_exception_policies)* hsa_isa_get_exception_policies_fn;
            ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
            hsa_agent_get_exception_policies
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:324:12: error: ‘hsa_isa_get_exception_policies’ was not declared in this scope
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:324:12: note: suggested alternative: ‘hsa_agent_get_exception_policies’
   decltype(hsa_isa_get_exception_policies)* hsa_isa_get_exception_policies_fn;
            ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
            hsa_agent_get_exception_policies
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:325:12: error: ‘hsa_isa_get_round_method’ was not declared in this scope
   decltype(hsa_isa_get_round_method)* hsa_isa_get_round_method_fn;
            ^~~~~~~~~~~~~~~~~~~~~~~~
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:325:12: note: suggested alternative: ‘hsa_isa_get_info_alt_fn’
   decltype(hsa_isa_get_round_method)* hsa_isa_get_round_method_fn;
            ^~~~~~~~~~~~~~~~~~~~~~~~
            hsa_isa_get_info_alt_fn
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:325:12: error: ‘hsa_isa_get_round_method’ was not declared in this scope
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:325:12: note: suggested alternative: ‘hsa_isa_get_info_alt_fn’
   decltype(hsa_isa_get_round_method)* hsa_isa_get_round_method_fn;
            ^~~~~~~~~~~~~~~~~~~~~~~~
            hsa_isa_get_info_alt_fn
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:326:12: error: ‘hsa_wavefront_get_info’ was not declared in this scope
   decltype(hsa_wavefront_get_info)* hsa_wavefront_get_info_fn;
            ^~~~~~~~~~~~~~~~~~~~~~
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:326:12: note: suggested alternative: ‘hsa_agent_get_info’
   decltype(hsa_wavefront_get_info)* hsa_wavefront_get_info_fn;
            ^~~~~~~~~~~~~~~~~~~~~~
            hsa_agent_get_info
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:326:12: error: ‘hsa_wavefront_get_info’ was not declared in this scope
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:326:12: note: suggested alternative: ‘hsa_agent_get_info’
   decltype(hsa_wavefront_get_info)* hsa_wavefront_get_info_fn;
            ^~~~~~~~~~~~~~~~~~~~~~
            hsa_agent_get_info
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:327:12: error: ‘hsa_isa_iterate_wavefronts’ was not declared in this scope
   decltype(hsa_isa_iterate_wavefronts)* hsa_isa_iterate_wavefronts_fn;
            ^~~~~~~~~~~~~~~~~~~~~~~~~~
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:327:12: note: suggested alternative: ‘hsa_iterate_agents’
   decltype(hsa_isa_iterate_wavefronts)* hsa_isa_iterate_wavefronts_fn;
            ^~~~~~~~~~~~~~~~~~~~~~~~~~
            hsa_iterate_agents
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:327:12: error: ‘hsa_isa_iterate_wavefronts’ was not declared in this scope
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:327:12: note: suggested alternative: ‘hsa_iterate_agents’
   decltype(hsa_isa_iterate_wavefronts)* hsa_isa_iterate_wavefronts_fn;
            ^~~~~~~~~~~~~~~~~~~~~~~~~~
            hsa_iterate_agents
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:332:12: error: ‘hsa_code_object_get_symbol_from_name’ was not declared in this scope
   decltype(hsa_code_object_get_symbol_from_name)*
            ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:332:12: note: suggested alternative: ‘hsa_code_object_get_symbol_fn’
   decltype(hsa_code_object_get_symbol_from_name)*
            ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
            hsa_code_object_get_symbol_fn
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:332:12: error: ‘hsa_code_object_get_symbol_from_name’ was not declared in this scope
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:332:12: note: suggested alternative: ‘hsa_code_object_get_symbol_fn’
   decltype(hsa_code_object_get_symbol_from_name)*
            ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
            hsa_code_object_get_symbol_fn
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:337:12: error: ‘hsa_code_object_reader_create_from_file’ was not declared in this scope
   decltype(hsa_code_object_reader_create_from_file)*
            ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:337:12: note: suggested alternative: ‘hsa_code_object_iterate_symbols’
   decltype(hsa_code_object_reader_create_from_file)*
            ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
            hsa_code_object_iterate_symbols
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:337:12: error: ‘hsa_code_object_reader_create_from_file’ was not declared in this scope
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:337:12: note: suggested alternative: ‘hsa_code_object_iterate_symbols’
   decltype(hsa_code_object_reader_create_from_file)*
            ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
            hsa_code_object_iterate_symbols
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:339:12: error: ‘hsa_code_object_reader_create_from_memory’ was not declared in this scope
   decltype(hsa_code_object_reader_create_from_memory)*
            ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:339:12: note: suggested alternative: ‘hsa_code_object_reader_create_from_file_fn’
   decltype(hsa_code_object_reader_create_from_memory)*
            ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
            hsa_code_object_reader_create_from_file_fn
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:339:12: error: ‘hsa_code_object_reader_create_from_memory’ was not declared in this scope
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:339:12: note: suggested alternative: ‘hsa_code_object_reader_create_from_file_fn’
   decltype(hsa_code_object_reader_create_from_memory)*
            ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
            hsa_code_object_reader_create_from_file_fn
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:341:12: error: ‘hsa_code_object_reader_destroy’ was not declared in this scope
   decltype(hsa_code_object_reader_destroy)* hsa_code_object_reader_destroy_fn;
            ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:341:12: note: suggested alternative: ‘hsa_code_object_destroy’
   decltype(hsa_code_object_reader_destroy)* hsa_code_object_reader_destroy_fn;
            ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
            hsa_code_object_destroy
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:341:12: error: ‘hsa_code_object_reader_destroy’ was not declared in this scope
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:341:12: note: suggested alternative: ‘hsa_code_object_destroy’
   decltype(hsa_code_object_reader_destroy)* hsa_code_object_reader_destroy_fn;
            ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
            hsa_code_object_destroy
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:342:12: error: ‘hsa_executable_create_alt’ was not declared in this scope
   decltype(hsa_executable_create_alt)* hsa_executable_create_alt_fn;
            ^~~~~~~~~~~~~~~~~~~~~~~~~
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:342:12: note: suggested alternative: ‘hsa_executable_create_fn’
   decltype(hsa_executable_create_alt)* hsa_executable_create_alt_fn;
            ^~~~~~~~~~~~~~~~~~~~~~~~~
            hsa_executable_create_fn
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:342:12: error: ‘hsa_executable_create_alt’ was not declared in this scope
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:342:12: note: suggested alternative: ‘hsa_executable_create_fn’
   decltype(hsa_executable_create_alt)* hsa_executable_create_alt_fn;
            ^~~~~~~~~~~~~~~~~~~~~~~~~
            hsa_executable_create_fn
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:343:12: error: ‘hsa_executable_load_program_code_object’ was not declared in this scope
   decltype(hsa_executable_load_program_code_object)*
            ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:343:12: note: suggested alternative: ‘hsa_executable_load_code_object’
   decltype(hsa_executable_load_program_code_object)*
            ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
            hsa_executable_load_code_object
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:343:12: error: ‘hsa_executable_load_program_code_object’ was not declared in this scope
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:343:12: note: suggested alternative: ‘hsa_executable_load_code_object’
   decltype(hsa_executable_load_program_code_object)*
            ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
            hsa_executable_load_code_object
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:345:12: error: ‘hsa_executable_load_agent_code_object’ was not declared in this scope
   decltype(hsa_executable_load_agent_code_object)*
            ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:345:12: note: suggested alternative: ‘hsa_executable_load_code_object’
   decltype(hsa_executable_load_agent_code_object)*
            ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
            hsa_executable_load_code_object
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:345:12: error: ‘hsa_executable_load_agent_code_object’ was not declared in this scope
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:345:12: note: suggested alternative: ‘hsa_executable_load_code_object’
   decltype(hsa_executable_load_agent_code_object)*
            ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
            hsa_executable_load_code_object
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:347:12: error: ‘hsa_executable_validate_alt’ was not declared in this scope
   decltype(hsa_executable_validate_alt)* hsa_executable_validate_alt_fn;
            ^~~~~~~~~~~~~~~~~~~~~~~~~~~
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:347:12: note: suggested alternative: ‘hsa_executable_validate_fn’
   decltype(hsa_executable_validate_alt)* hsa_executable_validate_alt_fn;
            ^~~~~~~~~~~~~~~~~~~~~~~~~~~
            hsa_executable_validate_fn
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:347:12: error: ‘hsa_executable_validate_alt’ was not declared in this scope
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:347:12: note: suggested alternative: ‘hsa_executable_validate_fn’
   decltype(hsa_executable_validate_alt)* hsa_executable_validate_alt_fn;
            ^~~~~~~~~~~~~~~~~~~~~~~~~~~
            hsa_executable_validate_fn
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:348:12: error: ‘hsa_executable_get_symbol_by_name’ was not declared in this scope
   decltype(hsa_executable_get_symbol_by_name)*
            ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:348:12: note: suggested alternative: ‘hsa_executable_get_symbol_fn’
   decltype(hsa_executable_get_symbol_by_name)*
            ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
            hsa_executable_get_symbol_fn
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:348:12: error: ‘hsa_executable_get_symbol_by_name’ was not declared in this scope
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:348:12: note: suggested alternative: ‘hsa_executable_get_symbol_fn’
   decltype(hsa_executable_get_symbol_by_name)*
            ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
            hsa_executable_get_symbol_fn
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:350:12: error: ‘hsa_executable_iterate_agent_symbols’ was not declared in this scope
   decltype(hsa_executable_iterate_agent_symbols)*
            ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:350:12: note: suggested alternative: ‘hsa_executable_iterate_symbols’
   decltype(hsa_executable_iterate_agent_symbols)*
            ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
            hsa_executable_iterate_symbols
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:350:12: error: ‘hsa_executable_iterate_agent_symbols’ was not declared in this scope
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:350:12: note: suggested alternative: ‘hsa_executable_iterate_symbols’
   decltype(hsa_executable_iterate_agent_symbols)*
            ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
            hsa_executable_iterate_symbols
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:352:12: error: ‘hsa_executable_iterate_program_symbols’ was not declared in this scope
   decltype(hsa_executable_iterate_program_symbols)*
            ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:352:12: note: suggested alternative: ‘hsa_executable_iterate_symbols’
   decltype(hsa_executable_iterate_program_symbols)*
            ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
            hsa_executable_iterate_symbols
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:352:12: error: ‘hsa_executable_iterate_program_symbols’ was not declared in this scope
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_api_trace.h:352:12: note: suggested alternative: ‘hsa_executable_iterate_symbols’
   decltype(hsa_executable_iterate_program_symbols)*
            ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
            hsa_executable_iterate_symbols
In file included from /home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/core/inc/hsa_api_trace_int.h:47:0,
                 from /home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/core/inc/hsa_ext_interface.h:49,
                 from /home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/core/inc/runtime.h:51,
                 from /home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/core/inc/amd_gpu_agent.h:53,
                 from /home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/core/runtime/amd_blit_kernel.cpp:49:
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/core/inc/hsa_internal.h:77:43: error: ‘hsa_cache_t’ was not declared in this scope
   hsa_status_t HSA_API hsa_cache_get_info(hsa_cache_t cache, hsa_cache_info_t attribute,
                                           ^~~~~~~~~~~
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/core/inc/hsa_internal.h:77:43: note: suggested alternative: ‘hsa_queue_t’
   hsa_status_t HSA_API hsa_cache_get_info(hsa_cache_t cache, hsa_cache_info_t attribute,
                                           ^~~~~~~~~~~
                                           hsa_queue_t
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/core/inc/hsa_internal.h:77:62: error: ‘hsa_cache_info_t’ was not declared in this scope
   hsa_status_t HSA_API hsa_cache_get_info(hsa_cache_t cache, hsa_cache_info_t attribute,
                                                              ^~~~~~~~~~~~~~~~
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/core/inc/hsa_internal.h:77:62: note: suggested alternative: ‘hsa_isa_info_t’
   hsa_status_t HSA_API hsa_cache_get_info(hsa_cache_t cache, hsa_cache_info_t attribute,
                                                              ^~~~~~~~~~~~~~~~
                                                              hsa_isa_info_t
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/core/inc/hsa_internal.h:78:43: error: expected primary-expression before ‘void’
                                           void* value);
                                           ^~~~
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/core/inc/hsa_internal.h:78:54: error: expression list treated as compound expression in initializer [-fpermissive]
                                           void* value);
                                                      ^
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/core/inc/hsa_internal.h:80:50: error: expected ‘,’ or ‘...’ before ‘(’ token
       hsa_agent_t agent, hsa_status_t (*callback)(hsa_cache_t cache, void* data), void* value);
                                                  ^
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/core/inc/hsa_internal.h:89:56: error: ‘hsa_queue_type32_t’ has not been declared
     hsa_queue_create(hsa_agent_t agent, uint32_t size, hsa_queue_type32_t type,
                                                        ^~~~~~~~~~~~~~~~~~
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/core/inc/hsa_internal.h:96:5: error: ‘hsa_queue_type32_t’ has not been declared
     hsa_queue_type32_t type, uint32_t features,
     ^~~~~~~~~~~~~~~~~~
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/core/inc/hsa_internal.h:162:48: error: ‘hsa_signal_group_t’ has not been declared
                                                hsa_signal_group_t* signal_group);
                                                ^~~~~~~~~~~~~~~~~~
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/core/inc/hsa_internal.h:163:49: error: ‘hsa_signal_group_t’ was not declared in this scope
   hsa_status_t HSA_API hsa_signal_group_destroy(hsa_signal_group_t signal_group);
                                                 ^~~~~~~~~~~~~~~~~~
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/core/inc/hsa_internal.h:163:49: note: suggested alternative: ‘hsa_signal_value_t’
   hsa_status_t HSA_API hsa_signal_group_destroy(hsa_signal_group_t signal_group);
                                                 ^~~~~~~~~~~~~~~~~~
                                                 hsa_signal_value_t
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/core/inc/hsa_internal.h:164:60: error: ‘hsa_signal_group_t’ was not declared in this scope
   hsa_status_t HSA_API hsa_signal_group_wait_any_scacquire(hsa_signal_group_t signal_group,
                                                            ^~~~~~~~~~~~~~~~~~
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/core/inc/hsa_internal.h:164:60: note: suggested alternative: ‘hsa_signal_value_t’
   hsa_status_t HSA_API hsa_signal_group_wait_any_scacquire(hsa_signal_group_t signal_group,
                                                            ^~~~~~~~~~~~~~~~~~
                                                            hsa_signal_value_t
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/core/inc/hsa_internal.h:165:60: error: expected primary-expression before ‘const’
                                                            const hsa_signal_condition_t* conditions,
                                                            ^~~~~
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/core/inc/hsa_internal.h:166:60: error: expected primary-expression before ‘const’
                                                            const hsa_signal_value_t* compare_values,
                                                            ^~~~~
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/core/inc/hsa_internal.h:167:77: error: expected primary-expression before ‘wait_state_hint’
                                                            hsa_wait_state_t wait_state_hint,
                                                                             ^~~~~~~~~~~~~~~
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/core/inc/hsa_internal.h:168:72: error: expected primary-expression before ‘*’ token
                                                            hsa_signal_t* signal,
                                                                        ^
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/core/inc/hsa_internal.h:168:74: error: ‘signal’ was not declared in this scope
                                                            hsa_signal_t* signal,
                                                                          ^~~~~~
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/core/inc/hsa_internal.h:168:74: note: suggested alternative: ‘signed’
                                                            hsa_signal_t* signal,
                                                                          ^~~~~~
                                                                          signed
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/core/inc/hsa_internal.h:169:78: error: expected primary-expression before ‘*’ token
                                                            hsa_signal_value_t* value);
                                                                              ^
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/core/inc/hsa_internal.h:169:80: error: ‘value’ was not declared in this scope
                                                            hsa_signal_value_t* value);
                                                                                ^~~~~
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/core/inc/hsa_internal.h:169:80: note: suggested alternative: ‘valloc’
                                                            hsa_signal_value_t* value);
                                                                                ^~~~~
                                                                                valloc
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/core/inc/hsa_internal.h:169:85: error: expression list treated as compound expression in initializer [-fpermissive]
                                                            hsa_signal_value_t* value);
                                                                                     ^
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/core/inc/hsa_internal.h:170:58: error: ‘hsa_signal_group_t’ was not declared in this scope
   hsa_status_t HSA_API hsa_signal_group_wait_any_relaxed(hsa_signal_group_t signal_group,
                                                          ^~~~~~~~~~~~~~~~~~
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/core/inc/hsa_internal.h:170:58: note: suggested alternative: ‘hsa_signal_value_t’
   hsa_status_t HSA_API hsa_signal_group_wait_any_relaxed(hsa_signal_group_t signal_group,
                                                          ^~~~~~~~~~~~~~~~~~
                                                          hsa_signal_value_t
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/core/inc/hsa_internal.h:171:58: error: expected primary-expression before ‘const’
                                                          const hsa_signal_condition_t* conditions,
                                                          ^~~~~
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/core/inc/hsa_internal.h:172:58: error: expected primary-expression before ‘const’
                                                          const hsa_signal_value_t* compare_values,
                                                          ^~~~~
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/core/inc/hsa_internal.h:173:75: error: expected primary-expression before ‘wait_state_hint’
                                                          hsa_wait_state_t wait_state_hint,
                                                                           ^~~~~~~~~~~~~~~
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/core/inc/hsa_internal.h:174:70: error: expected primary-expression before ‘*’ token
                                                          hsa_signal_t* signal,
                                                                      ^
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/core/inc/hsa_internal.h:174:72: error: ‘signal’ was not declared in this scope
                                                          hsa_signal_t* signal,
                                                                        ^~~~~~
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/core/inc/hsa_internal.h:174:72: note: suggested alternative: ‘signed’
                                                          hsa_signal_t* signal,
                                                                        ^~~~~~
                                                                        signed
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/core/inc/hsa_internal.h:175:76: error: expected primary-expression before ‘*’ token
                                                          hsa_signal_value_t* value);
                                                                            ^
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/core/inc/hsa_internal.h:175:78: error: ‘value’ was not declared in this scope
                                                          hsa_signal_value_t* value);
                                                                              ^~~~~
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/core/inc/hsa_internal.h:175:78: note: suggested alternative: ‘valloc’
                                                          hsa_signal_value_t* value);
                                                                              ^~~~~
                                                                              valloc
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/core/inc/hsa_internal.h:175:83: error: expression list treated as compound expression in initializer [-fpermissive]
                                                          hsa_signal_value_t* value);
                                                                                   ^
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/core/inc/hsa_internal.h:247:7: error: ‘hsa_fp_type_t’ has not been declared
       hsa_fp_type_t fp_type,
       ^~~~~~~~~~~~~
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/core/inc/hsa_internal.h:248:7: error: ‘hsa_flush_mode_t’ has not been declared
       hsa_flush_mode_t flush_mode,
       ^~~~~~~~~~~~~~~~
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/core/inc/hsa_internal.h:249:7: error: ‘hsa_round_method_t’ has not been declared
       hsa_round_method_t *round_method);
       ^~~~~~~~~~~~~~~~~~
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/core/inc/hsa_internal.h:251:7: error: ‘hsa_wavefront_t’ was not declared in this scope
       hsa_wavefront_t wavefront,
       ^~~~~~~~~~~~~~~
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/core/inc/hsa_internal.h:251:7: note: suggested alternative: ‘hsa_agent_t’
       hsa_wavefront_t wavefront,
       ^~~~~~~~~~~~~~~
       hsa_agent_t
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/core/inc/hsa_internal.h:252:7: error: ‘hsa_wavefront_info_t’ was not declared in this scope
       hsa_wavefront_info_t attribute,
       ^~~~~~~~~~~~~~~~~~~~
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/core/inc/hsa_internal.h:252:7: note: suggested alternative: ‘hsa_agent_info_t’
       hsa_wavefront_info_t attribute,
       ^~~~~~~~~~~~~~~~~~~~
       hsa_agent_info_t
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/core/inc/hsa_internal.h:253:7: error: expected primary-expression before ‘void’
       void *value);
       ^~~~
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/core/inc/hsa_internal.h:253:18: error: expression list treated as compound expression in initializer [-fpermissive]
       void *value);
                  ^
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/core/inc/hsa_internal.h:256:31: error: expected ‘,’ or ‘...’ before ‘(’ token
       hsa_status_t (*callback)(hsa_wavefront_t wavefront,
                               ^
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/core/inc/hsa_internal.h:309:7: error: ‘hsa_file_t’ was not declared in this scope
       hsa_file_t file,
       ^~~~~~~~~~
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/core/inc/hsa_internal.h:309:7: note: suggested alternative: ‘hsa_isa_t’
       hsa_file_t file,
       ^~~~~~~~~~
       hsa_isa_t
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/core/inc/hsa_internal.h:310:7: error: ‘hsa_code_object_reader_t’ was not declared in this scope
       hsa_code_object_reader_t *code_object_reader);
       ^~~~~~~~~~~~~~~~~~~~~~~~
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/core/inc/hsa_internal.h:310:7: note: suggested alternative: ‘hsa_code_object_type_t’
       hsa_code_object_reader_t *code_object_reader);
       ^~~~~~~~~~~~~~~~~~~~~~~~
       hsa_code_object_type_t
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/core/inc/hsa_internal.h:310:33: error: ‘code_object_reader’ was not declared in this scope
       hsa_code_object_reader_t *code_object_reader);
                                 ^~~~~~~~~~~~~~~~~~
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/core/inc/hsa_internal.h:310:33: note: suggested alternative: ‘hsa_code_object_destroy’
       hsa_code_object_reader_t *code_object_reader);
                                 ^~~~~~~~~~~~~~~~~~
                                 hsa_code_object_destroy
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/core/inc/hsa_internal.h:310:51: error: expression list treated as compound expression in initializer [-fpermissive]
       hsa_code_object_reader_t *code_object_reader);
                                                   ^
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/core/inc/hsa_internal.h:314:7: error: ‘hsa_code_object_reader_t’ has not been declared
       hsa_code_object_reader_t *code_object_reader);
       ^~~~~~~~~~~~~~~~~~~~~~~~
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/core/inc/hsa_internal.h:316:7: error: ‘hsa_code_object_reader_t’ was not declared in this scope
       hsa_code_object_reader_t code_object_reader);
       ^~~~~~~~~~~~~~~~~~~~~~~~
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/core/inc/hsa_internal.h:316:7: note: suggested alternative: ‘hsa_code_object_type_t’
       hsa_code_object_reader_t code_object_reader);
       ^~~~~~~~~~~~~~~~~~~~~~~~
       hsa_code_object_type_t
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/core/inc/hsa_internal.h:336:7: error: ‘hsa_code_object_reader_t’ has not been declared
       hsa_code_object_reader_t code_object_reader,
       ^~~~~~~~~~~~~~~~~~~~~~~~
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/core/inc/hsa_internal.h:338:7: error: ‘hsa_loaded_code_object_t’ has not been declared
       hsa_loaded_code_object_t *loaded_code_object);
       ^~~~~~~~~~~~~~~~~~~~~~~~
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/core/inc/hsa_internal.h:342:7: error: ‘hsa_code_object_reader_t’ has not been declared
       hsa_code_object_reader_t code_object_reader,
       ^~~~~~~~~~~~~~~~~~~~~~~~
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/core/inc/hsa_internal.h:344:7: error: ‘hsa_loaded_code_object_t’ has not been declared
       hsa_loaded_code_object_t *loaded_code_object);
       ^~~~~~~~~~~~~~~~~~~~~~~~
In file included from /home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/core/inc/runtime.h:58:0,
                 from /home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/core/inc/amd_gpu_agent.h:53,
                 from /home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/core/runtime/amd_blit_kernel.cpp:49:
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/core/inc/signal.h:526:24: error: ‘hsa_signal_group_t’ does not name a type; did you mean ‘hsa_signal_value_t’?
   static __forceinline hsa_signal_group_t Convert(SignalGroup* group) {
                        ^~~~~~~~~~~~~~~~~~
                        hsa_signal_value_t
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/core/inc/signal.h:530:37: error: inline variables are only available with -std=c++1z or -std=gnu++1z [-Werror]
   static __forceinline SignalGroup* Convert(hsa_signal_group_t group) {
                                     ^~~~~~~
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/core/inc/signal.h:530:45: error: ‘always_inline’ attribute ignored [-Werror=attributes]
   static __forceinline SignalGroup* Convert(hsa_signal_group_t group) {
                                             ^~~~~~~~~~~~~~~~~~
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/core/inc/signal.h:530:37: error: expected ‘;’ at end of member declaration
   static __forceinline SignalGroup* Convert(hsa_signal_group_t group) {
                                     ^~~~~~~
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/core/inc/signal.h:530:64: error: expected ‘)’ before ‘group’
   static __forceinline SignalGroup* Convert(hsa_signal_group_t group) {
                                                                ^~~~~
In file included from /home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/core/inc/amd_hsa_loader.hpp:50:0,
                 from /home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/core/inc/amd_loader_context.hpp:46,
                 from /home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/core/inc/runtime.h:64,
                 from /home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/core/inc/amd_gpu_agent.h:53,
                 from /home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/core/runtime/amd_blit_kernel.cpp:49:
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_ven_amd_loader.h:276:5: error: ‘hsa_loaded_code_object_t’ has not been declared
     hsa_loaded_code_object_t loaded_code_object,
     ^~~~~~~~~~~~~~~~~~~~~~~~
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_ven_amd_loader.h:395:3: error: ‘hsa_loaded_code_object_t’ was not declared in this scope
   hsa_loaded_code_object_t loaded_code_object,
   ^~~~~~~~~~~~~~~~~~~~~~~~
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_ven_amd_loader.h:395:3: note: suggested alternative: ‘hsa_code_object_t’
   hsa_loaded_code_object_t loaded_code_object,
   ^~~~~~~~~~~~~~~~~~~~~~~~
   hsa_code_object_t
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_ven_amd_loader.h:396:48: error: expected primary-expression before ‘attribute’
   hsa_ven_amd_loader_loaded_code_object_info_t attribute,
                                                ^~~~~~~~~
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_ven_amd_loader.h:397:3: error: expected primary-expression before ‘void’
   void *value);
   ^~~~
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_ven_amd_loader.h:397:14: error: expression list treated as compound expression in initializer [-fpermissive]
   void *value);
              ^
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_ven_amd_loader.h:443:7: error: ‘hsa_loaded_code_object_t’ has not been declared
       hsa_loaded_code_object_t loaded_code_object,
       ^~~~~~~~~~~~~~~~~~~~~~~~
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/inc/hsa_ven_amd_loader.h:448:5: error: ‘hsa_loaded_code_object_t’ has not been declared
     hsa_loaded_code_object_t loaded_code_object,
     ^~~~~~~~~~~~~~~~~~~~~~~~
In file included from /home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/core/inc/amd_loader_context.hpp:46:0,
                 from /home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/core/inc/runtime.h:64,
                 from /home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/core/inc/amd_gpu_agent.h:53,
                 from /home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/core/runtime/amd_blit_kernel.cpp:49:
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/core/inc/amd_hsa_loader.hpp:189:10: error: ‘hsa_loaded_code_object_t’ does not name a type; did you mean ‘hsa_code_object_t’?
   static hsa_loaded_code_object_t Handle(LoadedCodeObject *object) {
          ^~~~~~~~~~~~~~~~~~~~~~~~
          hsa_code_object_t
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/core/inc/amd_hsa_loader.hpp:195:28: error: expected ‘;’ at end of member declaration
   static LoadedCodeObject* Object(hsa_loaded_code_object_t handle) {
                            ^~~~~~
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/core/inc/amd_hsa_loader.hpp:195:60: error: expected ‘)’ before ‘handle’
   static LoadedCodeObject* Object(hsa_loaded_code_object_t handle) {
                                                            ^~~~~~
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/core/inc/amd_hsa_loader.hpp:294:5: error: ‘hsa_loaded_code_object_t’ has not been declared
     hsa_loaded_code_object_t *loaded_code_object = nullptr) = 0;
     ^~~~~~~~~~~~~~~~~~~~~~~~
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/core/inc/amd_hsa_loader.hpp:301:5: error: ‘hsa_loaded_code_object_t’ has not been declared
     hsa_loaded_code_object_t *loaded_code_object = nullptr) = 0;
     ^~~~~~~~~~~~~~~~~~~~~~~~
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/core/inc/amd_hsa_loader.hpp:342:7: error: ‘hsa_loaded_code_object_t’ has not been declared
       hsa_loaded_code_object_t loaded_code_object,
       ^~~~~~~~~~~~~~~~~~~~~~~~
In file included from /home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/core/inc/amd_gpu_agent.h:57:0,
                 from /home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/core/runtime/amd_blit_kernel.cpp:49:
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/core/inc/cache.h:56:24: error: ‘hsa_cache_t’ does not name a type; did you mean ‘hsa_queue_t’?
   static __forceinline hsa_cache_t Convert(const Cache* cache) {
                        ^~~~~~~~~~~
                        hsa_queue_t
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/core/inc/cache.h:60:45: error: ‘hsa_cache_t’ does not name a type; did you mean ‘hsa_queue_t’?
   static __forceinline Cache* Convert(const hsa_cache_t cache) {
                                             ^~~~~~~~~~~
                                             hsa_queue_t
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/core/inc/cache.h:70:24: error: ‘hsa_cache_info_t’ has not been declared
   hsa_status_t GetInfo(hsa_cache_info_t attribute, void* value);
                        ^~~~~~~~~~~~~~~~
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/core/inc/cache.h: In static member function ‘static core::Cache* core::Cache::Convert(int)’:
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/core/inc/cache.h:61:66: error: request for member ‘handle’ in ‘cache’, which is of non-class type ‘const int’
     return reinterpret_cast<Cache*>(static_cast<uintptr_t>(cache.handle));
                                                                  ^~~~~~
In file included from /home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/core/runtime/amd_blit_kernel.cpp:49:0:
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/core/inc/amd_gpu_agent.h: At global scope:
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/core/inc/amd_gpu_agent.h:231:53: error: expected ‘,’ or ‘...’ before ‘(’ token
   hsa_status_t IterateCache(hsa_status_t (*callback)(hsa_cache_t cache, void* data),
                                                     ^
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/core/inc/amd_gpu_agent.h:259:41: error: ‘hsa_queue_type32_t’ has not been declared
   hsa_status_t QueueCreate(size_t size, hsa_queue_type32_t queue_type,
                                         ^~~~~~~~~~~~~~~~~~
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/core/runtime/amd_blit_kernel.cpp: In member function ‘virtual hsa_status_t amd::BlitKernel::SubmitLinearCopyCommand(void*, const void*, size_t, std::
vector<core::Signal*>&, core::Signal&)’:
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/core/runtime/amd_blit_kernel.cpp:630:32: error: ‘HSA_PACKET_HEADER_SCACQUIRE_FENCE_SCOPE’ was not declared in this scope
       (HSA_FENCE_SCOPE_NONE << HSA_PACKET_HEADER_SCACQUIRE_FENCE_SCOPE) |
                                ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/core/runtime/amd_blit_kernel.cpp:630:32: note: suggested alternative: ‘HSA_PACKET_HEADER_ACQUIRE_FENCE_SCOPE’
       (HSA_FENCE_SCOPE_NONE << HSA_PACKET_HEADER_SCACQUIRE_FENCE_SCOPE) |
                                ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
                                HSA_PACKET_HEADER_ACQUIRE_FENCE_SCOPE
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/core/runtime/amd_blit_kernel.cpp:631:33: error: ‘HSA_PACKET_HEADER_SCRELEASE_FENCE_SCOPE’ was not declared in this scope
       (HSA_FENCE_SCOPE_AGENT << HSA_PACKET_HEADER_SCRELEASE_FENCE_SCOPE);
                                 ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/core/runtime/amd_blit_kernel.cpp:631:33: note: suggested alternative: ‘HSA_PACKET_HEADER_RELEASE_FENCE_SCOPE’
       (HSA_FENCE_SCOPE_AGENT << HSA_PACKET_HEADER_SCRELEASE_FENCE_SCOPE);
                                 ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
                                 HSA_PACKET_HEADER_RELEASE_FENCE_SCOPE
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/core/runtime/amd_blit_kernel.cpp: In member function ‘void amd::BlitKernel::PopulateQueue(uint64_t, uint64_t, void*, uint32_t, hsa_signal_t)’:
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/core/runtime/amd_blit_kernel.cpp:812:34: error: ‘HSA_PACKET_HEADER_SCACQUIRE_FENCE_SCOPE’ was not declared in this scope
       (HSA_FENCE_SCOPE_SYSTEM << HSA_PACKET_HEADER_SCACQUIRE_FENCE_SCOPE) |
                                  ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/core/runtime/amd_blit_kernel.cpp:812:34: note: suggested alternative: ‘HSA_PACKET_HEADER_ACQUIRE_FENCE_SCOPE’
       (HSA_FENCE_SCOPE_SYSTEM << HSA_PACKET_HEADER_SCACQUIRE_FENCE_SCOPE) |
                                  ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
                                  HSA_PACKET_HEADER_ACQUIRE_FENCE_SCOPE
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/core/runtime/amd_blit_kernel.cpp:813:34: error: ‘HSA_PACKET_HEADER_SCRELEASE_FENCE_SCOPE’ was not declared in this scope
       (HSA_FENCE_SCOPE_SYSTEM << HSA_PACKET_HEADER_SCRELEASE_FENCE_SCOPE);
                                  ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
/home/yuq/workspace/projects/amd/rocm/ROCR-Runtime/src/core/runtime/amd_blit_kernel.cpp:813:34: note: suggested alternative: ‘HSA_PACKET_HEADER_RELEASE_FENCE_SCOPE’
       (HSA_FENCE_SCOPE_SYSTEM << HSA_PACKET_HEADER_SCRELEASE_FENCE_SCOPE);
                                  ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
                                  HSA_PACKET_HEADER_RELEASE_FENCE_SCOPE
cc1plus: all warnings being treated as errors
CMakeFiles/hsa-runtime64.dir/build.make:134: recipe for target 'CMakeFiles/hsa-runtime64.dir/core/runtime/amd_blit_kernel.cpp.o' failed
make[2]: *** [CMakeFiles/hsa-runtime64.dir/core/runtime/amd_blit_kernel.cpp.o] Error 1
CMakeFiles/Makefile2:67: recipe for target 'CMakeFiles/hsa-runtime64.dir/all' failed
make[1]: *** [CMakeFiles/hsa-runtime64.dir/all] Error 2
Makefile:151: recipe for target 'all' failed
make: *** [all] Error 2

ROCR Runtime Build Fails with GCC 8.x

Build log:

-- The C compiler identification is GNU 8.2.1
-- The CXX compiler identification is GNU 8.2.1
-- Check for working C compiler: /usr/bin/cc
-- Check for working C compiler: /usr/bin/cc -- works
-- Detecting C compiler ABI info
-- Detecting C compiler ABI info - done
-- Detecting C compile features
-- Detecting C compile features - done
-- Check for working CXX compiler: /usr/bin/c++
-- Check for working CXX compiler: /usr/bin/c++ -- works
-- Detecting CXX compiler ABI info
-- Detecting CXX compiler ABI info - done
-- Detecting CXX compile features
-- Detecting CXX compile features - done
-- Found LibElf: /usr/lib/libelf.so
-- Performing Test ELF_GETSHDRSTRNDX
-- Performing Test ELF_GETSHDRSTRNDX - Success
-- Looking for __NR_memfd_create
-- Looking for __NR_memfd_create - found
-- Configuring done
-- Generating done
-- Build files have been written to: /tmp/makepkg/hsa-rocr/src/ROCR-Runtime/build
Scanning dependencies of target hsa-link
Scanning dependencies of target hsa-runtime64.so-link
Scanning dependencies of target hsa-runtime64
[  0%] Built target hsa-link
[  0%] Built target hsa-runtime64.so-link
[  2%] Building CXX object CMakeFiles/hsa-runtime64.dir/core/runtime/amd_blit_sdma.cpp.o
[  5%] Building CXX object CMakeFiles/hsa-runtime64.dir/core/util/small_heap.cpp.o
[ 11%] Building CXX object CMakeFiles/hsa-runtime64.dir/core/util/timer.cpp.o
[ 11%] Building CXX object CMakeFiles/hsa-runtime64.dir/core/util/lnx/os_linux.cpp.o
[ 16%] Building CXX object CMakeFiles/hsa-runtime64.dir/core/runtime/amd_blit_kernel.cpp.o
[ 16%] Building CXX object CMakeFiles/hsa-runtime64.dir/core/runtime/default_signal.cpp.o
[ 19%] Building CXX object CMakeFiles/hsa-runtime64.dir/core/runtime/amd_gpu_agent.cpp.o
[ 22%] Building CXX object CMakeFiles/hsa-runtime64.dir/core/runtime/amd_cpu_agent.cpp.o
[ 25%] Building CXX object CMakeFiles/hsa-runtime64.dir/core/runtime/hsa_ven_amd_loader.cpp.o
[ 27%] Building CXX object CMakeFiles/hsa-runtime64.dir/core/runtime/amd_aql_queue.cpp.o
[ 30%] Building CXX object CMakeFiles/hsa-runtime64.dir/core/runtime/amd_loader_context.cpp.o
[ 33%] Building CXX object CMakeFiles/hsa-runtime64.dir/core/runtime/amd_memory_region.cpp.o
[ 36%] Building CXX object CMakeFiles/hsa-runtime64.dir/core/runtime/amd_topology.cpp.o
[ 44%] Building CXX object CMakeFiles/hsa-runtime64.dir/core/runtime/hsa_api_trace.cpp.o
[ 44%] Building CXX object CMakeFiles/hsa-runtime64.dir/core/runtime/host_queue.cpp.o
[ 50%] Building CXX object CMakeFiles/hsa-runtime64.dir/core/runtime/interrupt_signal.cpp.o
[ 50%] Building CXX object CMakeFiles/hsa-runtime64.dir/core/runtime/intercept_queue.cpp.o
[ 44%] Building CXX object CMakeFiles/hsa-runtime64.dir/core/runtime/hsa.cpp.o
[ 55%] Building CXX object CMakeFiles/hsa-runtime64.dir/core/runtime/hsa_ext_amd.cpp.o
[ 55%] Building CXX object CMakeFiles/hsa-runtime64.dir/core/runtime/hsa_ext_interface.cpp.o
[ 58%] Building CXX object CMakeFiles/hsa-runtime64.dir/core/runtime/ipc_signal.cpp.o
[ 61%] Building CXX object CMakeFiles/hsa-runtime64.dir/core/runtime/isa.cpp.o
[ 63%] Building CXX object CMakeFiles/hsa-runtime64.dir/core/runtime/runtime.cpp.o
[ 66%] Building CXX object CMakeFiles/hsa-runtime64.dir/core/runtime/cache.cpp.o
[ 69%] Building CXX object CMakeFiles/hsa-runtime64.dir/core/runtime/signal.cpp.o
[ 72%] Building CXX object CMakeFiles/hsa-runtime64.dir/core/runtime/queue.cpp.o
[ 77%] Building CXX object CMakeFiles/hsa-runtime64.dir/loader/executable.cpp.o
[ 77%] Building CXX object CMakeFiles/hsa-runtime64.dir/core/common/shared.cpp.o
[ 80%] Building CXX object CMakeFiles/hsa-runtime64.dir/core/common/hsa_table_interface.cpp.o
[ 83%] Building CXX object CMakeFiles/hsa-runtime64.dir/loader/loaders.cpp.o
[ 86%] Building CXX object CMakeFiles/hsa-runtime64.dir/libamdhsacode/amd_elf_image.cpp.o
[ 88%] Building CXX object CMakeFiles/hsa-runtime64.dir/libamdhsacode/amd_hsa_code_util.cpp.o
[ 91%] Building CXX object CMakeFiles/hsa-runtime64.dir/libamdhsacode/amd_hsa_locks.cpp.o
[ 94%] Building CXX object CMakeFiles/hsa-runtime64.dir/libamdhsacode/amd_options.cpp.o
[ 97%] Building CXX object CMakeFiles/hsa-runtime64.dir/libamdhsacode/amd_hsa_code.cpp.o
/tmp/makepkg/hsa-rocr/src/ROCR-Runtime/src/core/runtime/runtime.cpp: In member function ‘void core::Runtime::SetLinkCount(size_t)’:
/tmp/makepkg/hsa-rocr/src/ROCR-Runtime/src/core/runtime/runtime.cpp:260:71: error: ‘void* memset(void*, int, size_t)’ clearing an object of non-trivial type ‘__gnu_cxx::__alloc_traits<std::allocator<core::Runtime::LinkInfo>, core::Runtime::LinkInfo>::value_type’ {aka ‘struct core::Runtime::LinkInfo’}; use assignment or value-initialization instead [-Werror=class-memaccess]
          link_matrix_.size() * sizeof(hsa_amd_memory_pool_link_info_t));
                                                                       ^
In file included from /tmp/makepkg/hsa-rocr/src/ROCR-Runtime/src/core/runtime/runtime.cpp:43:
/tmp/makepkg/hsa-rocr/src/ROCR-Runtime/src/core/inc/runtime.h:97:10: note: ‘__gnu_cxx::__alloc_traits<std::allocator<core::Runtime::LinkInfo>, core::Runtime::LinkInfo>::value_type’ {aka ‘struct core::Runtime::LinkInfo’} declared here
   struct LinkInfo {
          ^~~~~~~~
cc1plus: all warnings being treated as errors
make[2]: *** [CMakeFiles/hsa-runtime64.dir/build.make:349: CMakeFiles/hsa-runtime64.dir/core/runtime/runtime.cpp.o] Error 1
make[1]: *** [CMakeFiles/Makefile2:73: CMakeFiles/hsa-runtime64.dir/all] Error 2
make: *** [Makefile:152: all] Error 2
==> ERROR: A failure occurred in build().
    Aborting...

This diff fixes GCC 8.x compatibility:

diff --git a/src/core/runtime/runtime.cpp b/src/core/runtime/runtime.cpp
index 0625c6f..4875c2b 100644
--- a/src/core/runtime/runtime.cpp
+++ b/src/core/runtime/runtime.cpp
@@ -256,8 +256,7 @@ void Runtime::SetLinkCount(size_t num_link) {
   const size_t last_index = GetIndexLinkInfo(0, num_link);
   link_matrix_.resize(last_index);
 
-  memset(&link_matrix_[0], 0,
-         link_matrix_.size() * sizeof(hsa_amd_memory_pool_link_info_t));
+  std::fill(link_matrix_.begin(), link_matrix_.end(), LinkInfo());
 }
 
 void Runtime::RegisterLinkInfo(uint32_t node_id_from, uint32_t node_id_to,

Setup to make it work on virtual machine

Hi, I opened a new issue from here with more specific topic.
I'm not sure this repo is more suitable or this repo more.

I want to run AMD ROC framework on virtual machine. However, it is not possible for my environment.
Is there anyone who tried to run it and succeed on virtual machine? Could you show tell me your setup? I can use Xen or KVM, and currently trying to run it on KVM.

Here is my setup:

Host
CPU: Intel Core i7 6700 (IOMMU enabled)
GPU: AMD Radeon RX 480 8GB
OS: Ubuntu 16.04.2 LTS (kernel 4.11)
I confirmed the sample worked if I run right on the host machine with kernel 4.9.0-kfd (ROCK 1.6).

Guest
Hypervisor: QEMU(2.7.92)-KVM(kernel 4.11)
Guest OS: Ubuntu 16.04.2 LTS (kernel 4.9.0-kfd)
VGA passthrough via vfio-pci
QEMU script:

sudo vfio-bind 0000:01:00.0 0000:01:00.1 &&

sudo qemu-system-x86_64 -enable-kvm -cpu host,kvm=off \
-m 8192 -smp 8 -M q35 \
-vga none -nographic \
-usb \
-device usb-host,hostbus=1,hostaddr=2 \
-device vfio-pci,host=01:00.0,multifunction=on,x-vga=on \
-device vfio-pci,host=01:00.1 \
-drive if=pflash,format=raw,readonly,file=OVMF_CODE.fd \
-drive if=pflash,format=raw,file=OVMF_VARS.fd \
-hda ubuntu.qcow2 \
-net nic \
-net user,hostfwd=tcp::54321-:22

Both guest machine and host machine freeze when I use ioh3420 PCIe controller, so I directly attach the GPU to root PCI bus.

My error message is

kfd kfd: skipped device (1002:67df), PCI rejects atomics

and it makes no GPU agent.
I am sure my hardware supports PCIe atomics, just the virtualized controller doesn't.

How can I make it work?
Any suggestion?

Thanks in advance!

GPU noise and power draw after ROCm programs terminate

So, I've been using ROCm on ArchLinux with an RX Vega 56 and have had the following issue:

After any ROCm program terminates and if there is no other ROCm program running, the GPU is utilized at around 50% (judging by the lights on the card), and a high-pitched noise can be heard. The problem goes away if I log out of my window manager (sway on Wayland if that helps), and run rocm-smi --gpu-reset -d 0 to reset the card.

This doesn't happen only with HIP, but also with HC programs and when using the OpenCL ROCm implementation.

The OpenCL platform from AMDGPU does not have this issue.

Any idea what gives?

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.