Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

some questions on compiling #1

Open
victoryang00 opened this issue Feb 13, 2020 · 0 comments
Open

some questions on compiling #1

victoryang00 opened this issue Feb 13, 2020 · 0 comments

Comments

@victoryang00
Copy link

follow the instruction using your .s file I get

 System minor 0
 System major 3
 agent prop name Device 66a1
hip Device prop succeeded 
*** Error in `/public/home/caspra120/sgemm_strided_batch_test/lib/./sgemm_strided_batched_test': free(): invalid size: 0x00007ffe096d8320 ***
======= Backtrace: =========
/lib64/libc.so.6(+0x81489)[0x2b70d6ccd489]
/public/home/caspra120/sgemm_strided_batch_test/lib/./sgemm_strided_batched_test[0x405254]
/public/home/caspra120/sgemm_strided_batch_test/lib/./sgemm_strided_batched_test[0x405ec1]
/public/home/caspra120/sgemm_strided_batch_test/lib/./sgemm_strided_batched_test[0x40669e]
/lib64/libc.so.6(__libc_start_main+0xf5)[0x2b70d6c6e3d5]
/public/home/caspra120/sgemm_strided_batch_test/lib/./sgemm_strided_batched_test[0x402684]
======= Memory map: ========
00400000-0040a000 r-xp 00000000 00:29 5856331270                         /public/home/caspra120/sgemm_strided_batch_test/lib/sgemm_strided_batched_test
00609000-0060a000 r-xp 00009000 00:29 5856331270                         /public/home/caspra120/sgemm_strided_batch_test/lib/sgemm_strided_batched_test
0060a000-0060c000 rwxp 0000a000 00:29 5856331270                         /public/home/caspra120/sgemm_strided_batch_test/lib/sgemm_strided_batched_test
0134c000-02b86000 rwxp 00000000 00:00 0                                  [heap]
2b70bfc3c000-2b70bfc5e000 r-xp 00000000 08:03 1320025                    /usr/lib64/ld-2.17.so
2b70bfc5e000-2b70bfc61000 rwxp 00000000 00:00 0 
2b70bfc61000-2b70bfc62000 rwxs 00000000 00:05 63663                      /dev/kfd
2b70bfc62000-2b70bfc63000 rwxp 00000000 00:00 0 
2b70bfc63000-2b70bfc64000 rwxs 00000000 00:05 63663                      /dev/kfd
2b70bfc65000-2b70bfc66000 rwxs 00000000 00:05 63663                      /dev/kfd
2b70bfc67000-2b70bfc68000 rwxs 00000000 00:05 63663                      /dev/kfd
2b70bfc69000-2b70bfc6a000 rwxp 00000000 00:00 0 
2b70bfc6c000-2b70bfc6e000 rwxp 00000000 00:00 0 
2b70bfc70000-2b70bfc78000 rwxs 100004000 00:05 53492                     /dev/dri/renderD128
2b70bfc79000-2b70bfc7a000 rwxp 00000000 00:00 0 
2b70bfc7c000-2b70bfc7e000 rwxp 00000000 00:00 0 
2b70bfc7f000-2b70bfc80000 rwxp 00000000 00:00 0 
2b70bfc82000-2b70bfc84000 rwxp 00000000 00:00 0 
2b70bfc85000-2b70bfc86000 rwxp 00000000 00:00 0 
2b70bfc88000-2b70bfc8a000 rwxp 00000000 00:00 0 
2b70bfc8b000-2b70bfc8c000 rwxp 00000000 00:00 0 
2b70bfc8d000-2b70bfc8e000 rwxp 00000000 00:00 0 
2b70bfc90000-2b70bfc9d000 rwxp 00000000 00:00 0 
2b70bfc9d000-2b70bfcd2000 r-xs 00000000 08:03 12977936                   /var/db/nscd/passwd
2b70bfcd4000-2b70bfcd6000 rwxp 00000000 00:00 0 
2b70bfcd7000-2b70bfcd8000 rwxp 00000000 00:00 0 
2b70bfcd9000-2b70bfcda000 ---p 14da5d000 00:05 53492                     /dev/dri/renderD128
2b70bfcdc000-2b70bfcde000 rwxs 00000000 00:05 63663                      /dev/kfd
2b70bfcdf000-2b70bfce0000 rwxp 00000000 00:00 0 
2b70bfce1000-2b70bfce2000 rwxp 00000000 00:00 0 
2b70bfce3000-2b70bfce4000 rwxp 00000000 00:00 0 
2b70bfce5000-2b70bfce6000 rwxp 00000000 00:00 0 
2b70bfd00000-2b70bfd80000 rwxp 00000000 00:00 0 
2b70bfdc0000-2b70bfe00000 rwxp 00000000 00:00 0 
2b70bfe5d000-2b70bfe5e000 r-xp 00021000 08:03 1320025                    /usr/lib64/ld-2.17.so
2b70bfe5e000-2b70bfe5f000 rwxp 00022000 08:03 1320025                    /usr/lib64/ld-2.17.so
2b70bfe5f000-2b70bfe60000 rwxp 00000000 00:00 0 
2b70bfe60000-2b70bff49000 r-xp 00000000 08:03 1320335                    /usr/lib64/libstdc++.so.6.0.19
2b70bff49000-2b70c0148000 ---p 000e9000 08:03 1320335                    /usr/lib64/libstdc++.so.6.0.19
2b70c0148000-2b70c0150000 r-xp 000e8000 08:03 1320335                    /usr/lib64/libstdc++.so.6.0.19
2b70c0150000-2b70c0152000 rwxp 000f0000 08:03 1320335                    /usr/lib64/libstdc++.so.6.0.19
2b70c0152000-2b70c0167000 rwxp 00000000 00:00 0 
2b70c0167000-2b70c0169000 r-xp 00000000 08:03 1320038                    /usr/lib64/libdl-2.17.so
2b70c0169000-2b70c0369000 ---p 00002000 08:03 1320038                    /usr/lib64/libdl-2.17.so
2b70c0369000-2b70c036a000 r-xp 00002000 08:03 1320038                    /usr/lib64/libdl-2.17.so
2b70c036a000-2b70c036b000 rwxp 00003000 08:03 1320038                    /usr/lib64/libdl-2.17.so
2b70c036b000-2b70c046c000 r-xp 00000000 08:03 1320040                    /usr/lib64/libm-2.17.so
2b70c046c000-2b70c066b000 ---p 00101000 08:03 1320040                    /usr/lib64/libm-2.17.so
2b70c066b000-2b70c066c000 r-xp 00100000 08:03 1320040                    /usr/lib64/libm-2.17.so
2b70c066c000-2b70c066d000 rwxp 00101000 08:03 1320040                    /usr/lib64/libm-2.17.so
2b70c066d000-2b70c0684000 r-xp 00000000 08:03 1320058                    /usr/lib64/libpthread-2.17.so
2b70c0684000-2b70c0883000 ---p 00017000 08:03 1320058                    /usr/lib64/libpthread-2.17.so
2b70c0883000-2b70c0884000 r-xp 00016000 08:03 1320058                    /usr/lib64/libpthread-2.17.so
2b70c0884000-2b70c0885000 rwxp 00017000 08:03 1320058                    /usr/lib64/libpthread-2.17.so
2b70c0885000-2b70c0889000 rwxp 00000000 00:00 0 
2b70c0889000-2b70c0894000 r-xp 00000000 08:03 6032106                    /opt/rocm/hcc/lib/libhc_am.so.2.9
2b70c0894000-2b70c0a93000 ---p 0000b000 08:03 6032106                    /opt/rocm/hcc/lib/libhc_am.so.2.9
2b70c0a93000-2b70c0a94000 r-xp 0000a000 08:03 6032106                    /opt/rocm/hcc/lib/libhc_am.so.2.9
2b70c0a94000-2b70c0afa000 rwxp 0000b000 08:03 6032106                    /opt/rocm/hcc/lib/libhc_am.so.2.9
2b70c0afa000-2b70c0b0f000 r-xp 00000000 08:03 6032109                    /opt/rocm/hcc/lib/libmcwamp.so.2.9
2b70c0b0f000-2b70c0d0e000 ---p 00015000 08:03 6032109                    /opt/rocm/hcc/lib/libmcwamp.so.2.9
2b70c0d0e000-2b70c0d0f000 r-xp 00014000 08:03 6032109                    /opt/rocm/hcc/lib/libmcwamp.so.2.9
2b70c0d0f000-2b70c0d54000 rwxp 00015000 08:03 6032109                    /opt/rocm/hcc/lib/libmcwamp.so.2.9
2b70c0d54000-2b70c0e9d000 r-xp 00000000 08:03 6033482                    /opt/rocm/hip/lib/libhip_hcc.so
2b70c0e9d000-2b70c109c000 ---p 00149000 08:03 6033482                    /opt/rocm/hip/lib/libhip_hcc.so
2b70c109c000-2b70c109e000 r-xp 00148000 08:03 6033482                    /opt/rocm/hip/lib/libhip_hcc.so
2b70c109e000-2b70c111d000 rwxp 0014a000 08:03 6033482                    /opt/rocm/hip/lib/libhip_hcc.so
2b70c111d000-2b70c155e000 rwxp 00000000 00:00 0 
2b70c155e000-2b70c1627000 r-xp 00000000 08:03 6031638                    /opt/rocm/hsa/lib/libhsa-runtime64.so.1.1.9
2b70c1627000-2b70c1826000 ---p 000c9000 08:03 6031638                    /opt/rocm/hsa/lib/libhsa-runtime64.so.1.1.9
2b70c1826000-2b70c182b000 r-xp 000c8000 08:03 6031638                    /opt/rocm/hsa/lib/libhsa-runtime64.so.1.1.9
2b70c182b000-2b70c182c000 rwxp 000cd000 08:03 6031638                    /opt/rocm/hsa/lib/libhsa-runtime64.so.1.1.9
2b70c182c000-2b70c182d000 rwxp 00000000 00:00 0 
2b70c182d000-2b70c187a000 r-xp 00000000 08:03 6033963                    /opt/rocm/profiler/CXLActivityLogger/bin/x86_64/libCXLActivityLogger.so
2b70c187a000-2b70c1a79000 ---p 0004d000 08:03 6033963                    /opt/rocm/profiler/CXLActivityLogger/bin/x86_64/libCXLActivityLogger.so
2b70c1a79000-2b70c1a7b000 r-xp 0004c000 08:03 6033963                    /opt/rocm/profiler/CXLActivityLogger/bin/x86_64/libCXLActivityLogger.so
2b70c1a7b000-2b70c1a7c000 rwxp 0004e000 08:03 6033963                    /opt/rocm/profiler/CXLActivityLogger/bin/x86_64/libCXLActivityLogger.so
2b70c1a7c000-2b70c1a7d000 r-xp 00000000 00:29 5586203398                 /public/home/caspra120/sgemm_strided_batch_test/lib/libcheckresult.so
2b70c1a7d000-2b70c1c7c000 ---p 00001000 00:29 5586203398                 /public/home/caspra120/sgemm_strided_batch_test/lib/libcheckresult.so
2b70c1c7c000-2b70c1c7d000 r-xp 00000000 00:29 5586203398                 /public/home/caspra120/sgemm_strided_batch_test/lib/libcheckresult.so
2b70c1c7d000-2b70c1c7e000 rwxp 00001000 00:29 5586203398                 /public/home/caspra120/sgemm_strided_batch_test/lib/libcheckresult.so
2b70c1c7e000-2b70c1c7f000 r-xp 00000000 00:29 5586203654                 /public/home/caspra120/sgemm_strided_batch_test/lib/libsgemm_strided_batched.so
2b70c1c7f000-2b70c1e7e000 ---p 00001000 00:29 5586203654                 /public/home/caspra120/sgemm_strided_batch_test/lib/libsgemm_strided_batched.so
2b70c1e7e000-2b70c1e7f000 r-xp 00000000 00:29 5586203654                 /public/home/caspra120/sgemm_strided_batch_test/lib/libsgemm_strided_batched.so
2b70c1e7f000-2b70c1e80000 rwxp 00001000 00:29 5586203654                 /public/home/caspra120/sgemm_strided_batch_test/lib/libsgemm_strided_batched.so
2b70c1e80000-2b70c20b4000 r-xp 00000000 08:03 1320697                    /usr/lib64/libcrypto.so.1.0.2k
2b70c20b4000-2b70c22b4000 ---p 00234000 08:03 1320697                    /usr/lib64/libcrypto.so.1.0.2k
2b70c22b4000-2b70c22d0000 r-xp 00234000 08:03 1320697                    /usr/lib64/libcrypto.so.1.0.2k
2b70c22d0000-2b70c22dd000 rwxp 00250000 08:03 1320697                    /usr/lib64/libcrypto.so.1.0.2k
2b70c22dd000-2b70c22e1000 rwxp 00000000 00:00 0 
2b70c22e1000-2b70d3141000 r-xp 00000000 08:03 6033990                    /opt/rocm/rocblas/lib/librocblas.so.0.1
2b70d3141000-2b70d3341000 ---p 10e60000 08:03 6033990                    /opt/rocm/rocblas/lib/librocblas.so.0.1
2b70d3341000-2b70d3368000 r-xp 10e60000 08:03 6033990                    /opt/rocm/rocblas/lib/librocblas.so.0.1
2b70d3368000-2b70d669c000 rwxp 10e87000 08:03 6033990                    /opt/rocm/rocblas/lib/librocblas.so.0.1
2b70d669c000-2b70d6a36000 rwxp 00000000 00:00 0 
2b70d6a36000-2b70d6a4b000 r-xp 00000000 08:03 1310739                    /usr/lib64/libgcc_s-4.8.5-20150702.so.1
2b70d6a4b000-2b70d6c4a000 ---p 00015000 08:03 1310739                    /usr/lib64/libgcc_s-4.8.5-20150702.so.1
2b70d6c4a000-2b70d6c4b000 r-xp 00014000 08:03 1310739                    /usr/lib64/libgcc_s-4.8.5-20150702.so.1
2b70d6c4b000-2b70d6c4c000 rwxp 00015000 08:03 1310739                    /usr/lib64/libgcc_s-4.8.5-20150702.so.1
2b70d6c4c000-2b70d6e0e000 r-xp 00000000 08:03 1320032                    /usr/lib64/libc-2.17.so
2b70d6e0e000-2b70d700e000 ---p 001c2000 08:03 1320032                    /usr/lib64/libc-2.17.so
2b70d700e000-2b70d7012000 r-xp 001c2000 08:03 1320032                    /usr/lib64/libc-2.17.so
2b70d7012000-2b70d7014000 rwxp 001c6000 08:03 1320032                    /usr/lib64/libc-2.17.so
2b70d7014000-2b70d7019000 rwxp 00000000 00:00 0 
2b70d7019000-2b70dc3ae000 r-xp 00000000 08:03 5905890                    /opt/rocm/lib/libamd_comgr.so.1.3
2b70dc3ae000-2b70dc5ad000 ---p 05395000 08:03 5905890                    /opt/rocm/lib/libamd_comgr.so.1.3
2b70dc5ad000-2b70dc989000 r-xp 05394000 08:03 5905890                    /opt/rocm/lib/libamd_comgr.so.1.3
2b70dc989000-2b70dc996000 rwxp 05770000 08:03 5905890                    /opt/rocm/lib/libamd_comgr.so.1.3
2b70dc996000-2b70dc9e7000 rwxp 00000000 00:00 0 
2b70dc9e7000-2b70dca06000 r-xp 00000000 08:03 5905898                    /opt/rocm/lib64/libhsakmt.so.1.0.6
2b70dca06000-2b70dcc06000 ---p 0001f000 08:03 5905898                    /opt/rocm/lib64/libhsakmt.so.1.0.6
2b70dcc06000-2b70dcc07000 r-xp 0001f000 08:03 5905898                    /opt/rocm/lib64/libhsakmt.so.1.0.6
2b70dcc07000-2b70dcc11000 rwxp 00020000 08:03 5905898                    /opt/rocm/lib64/libhsakmt.so.1.0.6
2b70dcc11000-2b70dcc26000 r-xp 00000000 08:03 1320344                    /usr/lib64/libz.so.1.2.7
2b70dcc26000-2b70dce25000 ---p 00015000 08:03 1320344                    /usr/lib64/libz.so.1.2.7
2b70dce25000-2b70dce26000 r-xp 00014000 08:03 1320344                    /usr/lib64/libz.so.1.2.7
2b70dce26000-2b70dce27000 rwxp 00015000 08:03 1320344                    /usr/lib64/libz.so.1.2.7
2b70dce27000-2b70dce2e000 r-xp 00000000 08:03 1320062                    /usr/lib64/librt-2.17.so
2b70dce2e000-2b70dd02d000 ---p 00007000 08:03 1320062                    /usr/lib64/librt-2.17.so
2b70dd02d000-2b70dd02e000 r-xp 00006000 08:03 1320062                    /usr/lib64/librt-2.17.so
2b70dd02e000-2b70dd02f000 rwxp 00007000 08:03 1320062                    /usr/lib64/librt-2.17.so
2b70dd02f000-2b70dd054000 r-xp 00000000 08:03 1320373                    /usr/lib64/libtinfo.so.5.9
2b70dd054000-2b70dd254000 ---p 00025000 08:03 1320373                    /usr/lib64/libtinfo.so.5.9
2b70dd254000-2b70dd258000 r-xp 00025000 08:03 1320373                    /usr/lib64/libtinfo.so.5.9
2b70dd258000-2b70dd259000 rwxp 00029000 08:03 1320373                    /usr/lib64/libtinfo.so.5.9
2b70dd259000-2b70dd263000 r-xp 00000000 08:03 1320901                    /usr/lib64/libnuma.so.1
2b70dd263000-2b70dd463000 ---p 0000a000 08:03 1320901                    /usr/lib64/libnuma.so.1
2b70dd463000-2b70dd464000 r-xp 0000a000 08:03 1320901                    /usr/lib64/libnuma.so.1
2b70dd464000-2b70dd465000 rwxp 0000b000 08:03 1320901                    /usr/lib64/libnuma.so.1
2b70dd465000-2b70dd471000 r-xp 00000000 08:03 1322493                    /usr/lib64/libpci.so.3.5.1
2b70dd471000-2b70dd670000 ---p 0000c000 08:03 1322493                    /usr/lib64/libpci.so.3.5.1
2b70dd670000-2b70dd671000 r-xp 0000b000 08:03 1322493                    /usr/lib64/libpci.so.3.5.1
2b70dd671000-2b70dd672000 rwxp 0000c000 08:03 1322493                    /usr/lib64/libpci.so.3.5.1
2b70dd672000-2b70dd688000 r-xp 00000000 08:03 1320060                    /usr/lib64/libresolv-2.17.so
2b70dd688000-2b70dd887000 ---p 00016000 08:03 1320060                    /usr/lib64/libresolv-2.17.so
2b70dd887000-2b70dd888000 r-xp 00015000 08:03 1320060                    /usr/lib64/libresolv-2.17.so
2b70dd888000-2b70dd889000 rwxp 00016000 08:03 1320060                    /usr/lib64/libresolv-2.17.so
2b70dd889000-2b70dd88b000 rwxp 00000000 00:00 0 
2b70dd88b000-2b70dd92e000 r-xp 00000000 08:03 6032115                    /opt/rocm/hcc/lib/libmcwamp_hsa.so.2.9
2b70dd92e000-2b70ddb2d000 ---p 000a3000 08:03 6032115                    /opt/rocm/hcc/lib/libmcwamp_hsa.so.2.9
2b70ddb2d000-2b70ddb30000 r-xp 000a2000 08:03 6032115                    /opt/rocm/hcc/lib/libmcwamp_hsa.so.2.9
2b70ddb30000-2b70ddf6f000 rwxp 000a5000 08:03 6032115                    /opt/rocm/hcc/lib/libmcwamp_hsa.so.2.9
2b70ddf6f000-2b70ddf70000 ---p 00000000 00:00 0 
2b70ddf70000-2b70de170000 rwxp 00000000 00:00 0 
2b70de170000-2b70de1c3000 r-xp 00000000 08:03 6031645                    /opt/rocm/hsa/lib/libhsa-ext-image64.so.1.1.9
2b70de1c3000-2b70de3c3000 ---p 00053000 08:03 6031645                    /opt/rocm/hsa/lib/libhsa-ext-image64.so.1.1.9
2b70de3c3000-2b70de3c4000 r-xp 00053000 08:03 6031645                    /opt/rocm/hsa/lib/libhsa-ext-image64.so.1.1.9
2b70de3c4000-2b70de434000 rwxp 00054000 08:03 6031645                    /opt/rocm/hsa/lib/libhsa-ext-image64.so.1.1.9
2b70de480000-2b70de500000 rwxp 00000000 00:00 0 
2b70de540000-2b70de5a0000 rwxp 00000000 00:00 0 
2b70de600000-2b70dea00000 rwxp 00000000 00:00 0 
2b70dea80000-2b70deb00000 rwxp 00000000 00:00 0 
2b70dec00000-2b70df000000 rwxp 00000000 00:00 0 
2b70df080000-2b70df100000 rwxp 00000000 00:00 0 
2b70df200000-2b70df600000 rwxp 00000000 00:00 0 
2b70df680000-2b70df700000 rwxp 00000000 00:00 0 
2b70df800000-2b70dfc00000 rwxp 00000000 00:00 0 
2b70dfc80000-2b70dfd00000 rwxp 00000000 00:00 0 
2b70dfd80000-2b70dfe00000 rwxp 00000000 00:00 0 
2b70dfe80000-2b70dff00000 rwxp 00000000 00:00 0 
2b70e0000000-2b70e0021000 rwxp 00000000 00:00 0 
2b70e0021000-2b70e4000000 ---p 00000000 00:00 0 
2b70e4200000-2b70e4600000 rwxp 00000000 00:00 0 
2b70e4800000-2b70e4c00000 rwxp 00000000 00:00 0 
2b70e4e00000-2b70e5200000 rwxp 00000000 00:00 0 
2b70e5400000-2b70e5800000 rwxp 00000000 00:00 0 
2b70e5a00000-2b70e5e00000 rwxp 00000000 00:00 0 
2b70e6000000-2b70e6400000 rwxp 00000000 00:00 0 
2b70e6600000-2b70e6a00000 rwxp 00000000 00:00 0 
2b70e6c00000-2b70e7000000 rwxp 00000000 00:00 0 
2b70e7200000-2b70e7600000 rwxp 00000000 00:00 0 
2b70e7800000-2b70e7c00000 rwxp 00000000 00:00 0 
2b70e7e00000-2b70e8200000 rwxp 00000000 00:00 0 
2b70e8400000-2b70e8800000 rwxp 00000000 00:00 0 
2b70e8a00000-2b70e8e00000 rwxp 00000000 00:00 0 
2b70e9000000-2b70e9400000 rwxp 00000000 00:00 0 
2b70e9600000-2b70e9a00000 rwxp 00000000 00:00 0 
2b70e9c00000-2b70ea000000 rwxp 00000000 00:00 0 
2b70ea200000-2b70ea600000 rwxp 00000000 00:00 0 
2b70ea800000-2b70eac00000 rwxp 00000000 00:00 0 
2b70eae00000-2b70eb200000 rwxp 00000000 00:00 0 
2b70eb400000-2b70eb800000 rwxp 00000000 00:00 0 
2b70eba00000-2b70ebe00000 rwxp 00000000 00:00 0 
2b70ec000000-2b70ec400000 rwxp 00000000 00:00 0 
2b70ec600000-2b70eca00000 rwxp 00000000 00:00 0 
2b70ecc00000-2b70ed000000 rwxp 00000000 00:00 0 
2b70ed200000-2b70ed600000 rwxp 00000000 00:00 0 
2b70ed800000-2b70edc00000 rwxp 00000000 00:00 0 
2b70ede00000-2b70ee200000 rwxp 00000000 00:00 0 
2b70ee400000-2b70ee800000 rwxp 00000000 00:00 0 
2b70ee800000-2b70f3a02000 rwxp 00000000 00:00 0 
2b70f3c00000-2b70f6600000 rwxs 10841b000 00:05 53492                     /dev/dri/renderD128
2b70f6800000-2b70f9200000 rwxs 10ae1b000 00:05 53492                     /dev/dri/renderD128
2b70f9400000-2b70fa945000 rwxp 00000000 00:00 0 
2b70fc000000-2b70fc021000 rwxp 00000000 00:00 0 
2b70fc021000-2b7100000000 ---p 00000000 00:00 0 
2b7105d6c000-2b710909f000 rwxp 00000000 00:00 0 
2b7109200000-2b7209200000 ---p 00000000 00:00 0 
2b7209400000-2b7309400000 ---p 00000000 00:00 0 
2b7309600000-2b7409600000 ---p 00000000 00:00 0 
2b7409800000-2b7509800000 ---p 00000000 00:00 0 
2b7509800000-2b7549901000 rwxp 00000000 00:00 0 
2b7549a00000-2b7589c00000 rwxs 10d81b000 00:05 53492                     /dev/dri/renderD128
7ffe096ba000-7ffe096dd000 rwxp 00000000 00:00 0                          [stack]
7ffe097b8000-7ffe097ba000 r-xp 00000000 00:00 0                          [vdso]
ffffffffff600000-ffffffffff601000 r-xp 00000000 00:00 0                  [vsyscall]
host allocated 
device allocated 
Host to Device Copied
sgemm_sqc.o begin
sgemm_sqc.o loaded
srun: error: k03r1n14: task 0: Aborted
srun: Terminating job step 1504559.0

my rocminfo should be the same as yours.

  Name:                    gfx906                             
  Marketing Name:          Device 66a1                        
  Vendor Name:             AMD                                
  Feature:                 KERNEL_DISPATCH                    
  Profile:                 BASE_PROFILE                       
  Float Round Mode:        NEAR                               
  Max Queue Number:        128(0x80)                          
  Queue Min Size:          4096(0x1000)                       
  Queue Max Size:          131072(0x20000)                    
  Queue Type:              MULTI                              
  Node:                    6                                  
  Device Type:             GPU                                
  Cache Info:              
    L1:                      16(0x10) KB                        
  Chip ID:                 26273(0x66a1)                      
  Cacheline Size:          64(0x40)                           
  Max Clock Freq. (MHz):   1700                               
  BDFID:                   17152                              
  Internal Node ID:        6                                  
  Compute Unit:            64                                 
  SIMDs per CU:            4                                  
  Shader Engines:          4                                  
  Shader Arrs. per Eng.:   1                                  
  WatchPts on Addr. Ranges:4                                  
  Features:                KERNEL_DISPATCH 
  Fast F16 Operation:      FALSE                              
  Wavefront Size:          64(0x40)                           
  Workgroup Max Size:      1024(0x400)                        
  Workgroup Max Size per Dimension:
    x                        1024(0x400)                        
    y                        1024(0x400)                        
    z                        1024(0x400)                        
  Max Waves Per CU:        40(0x28)                           
  Max Work-item Per CU:    2560(0xa00)                        
  Grid Max Size:           4294967295(0xffffffff)             
  Grid Max Size per Dimension:
    x                        4294967295(0xffffffff)             
    y                        4294967295(0xffffffff)             
    z                        4294967295(0xffffffff)             
  Max fbarriers/Workgrp:   32                                 
  Pool Info:               
    Pool 1                   
      Segment:                 GLOBAL; FLAGS: COARSE GRAINED      
      Size:                    16760832(0xffc000) KB              
      Allocatable:             TRUE                               
      Alloc Granule:           4KB                                
      Alloc Alignment:         4KB                                
      Acessible by all:        FALSE                              
    Pool 2                   
      Segment:                 GROUP                              
      Size:                    64(0x40) KB                        
      Allocatable:             FALSE                              
      Alloc Granule:           0KB                                
      Alloc Alignment:         0KB                                
      Acessible by all:        FALSE                              
  ISA Info:                
    ISA 1                    
      Name:                    amdgcn-amd-amdhsa--gfx906          
      Machine Models:          HSA_MACHINE_MODEL_LARGE            
      Profiles:                HSA_PROFILE_BASE                   
      Default Rounding Mode:   NEAR                               
      Default Rounding Mode:   NEAR                               
      Fast f16:                TRUE                               
      Workgroup Max Size:      1024(0x400)                        
      Workgroup Max Size per Dimension:
        x                        1024(0x400)                        
        y                        1024(0x400)                        
        z                        1024(0x400)                        
      Grid Max Size:           4294967295(0xffffffff)             
      Grid Max Size per Dimension:
        x                        4294967295(0xffffffff)             
        y                        4294967295(0xffffffff)             
        z                        4294967295(0xffffffff)             
      FBarrier Max Size:       32

is that because of the memory over allocate on different device? or there's multiple memory type or VGPR registors on the same distinctive device?
Thanks for your tutorial by the way.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant