Summary: | sci-libs/rocBLAS-5.7.1-r1: TypeError: buildSourceCodeObjectFile() missing 2 required positional arguments: 'outputPath' and 'kernelFile' | ||
---|---|---|---|
Product: | Gentoo Linux | Reporter: | Stefano <stelinux01> |
Component: | Current packages | Assignee: | Gentoo Science Related Packages <sci> |
Status: | UNCONFIRMED --- | ||
Severity: | normal | CC: | gentoo, lockalsash, m.seifert, stelinux01, xgreenlandforwyy |
Priority: | Normal | ||
Version: | unspecified | ||
Hardware: | AMD64 | ||
OS: | Linux | ||
Whiteboard: | |||
Package list: | Runtime testing required: | --- | |
Attachments: | build.log |
Description
Stefano
2023-12-22 21:30:11 UTC
Can you attach the full build.log? I'd like to see the Tensile_CREATE_COMMAND cmake is executing. Created attachment 880254 [details]
build.log
build.log for debug
I can reproduced this bug with MAKEOPTS="-j1". Setting MAKEOPTS="-j1" temporarily solves the issue. However, I encountered another bug when only gfx1010 is being built: Generating kernels: Launching 2 threads... Generating kernels: Done. ^M0it [00:00, ?it/s]^M0it [00:00, ?it/s] Compiling source kernels: Launching 2 threads... hipcc: /opt/gentoo/usr/bin/hipcc --genco -D__HIP_HCC_COMPAT_MODE__=1 -I /var/tmp/notmpdir/portage/sci-libs/rocBLAS-5.7.1-r1/work/rocBLAS-rocm-5.7.1_build/Tensile -Xoffload-linker --build-id --offload-arch=gfx1010 /var/tmp/notmpdir/portage/sci-libs/rocBLAS-5.7.1-r1/work/rocBLAS-rocm-5.7.1_build/Tensile/Kernels.cpp -c -o /var/tmp/notmpdir/portage/sci-libs/rocBLAS-5.7.1-r1/work/rocBLAS-rocm-5.7.1_build/library/src/build_tmp/TENSILE/code_object_tmp/Kernels.o /opt/gentoo/usr/lib/llvm/17/bin/clang-offload-bundler -type=o -targets=hipv4-amdgcn-amd-amdhsa--gfx1010 -input=/var/tmp/notmpdir/portage/sci-libs/rocBLAS-5.7.1-r1/work/rocBLAS-rocm-5.7.1_build/library/src/build_tmp/TENSILE/code_object_tmp/Kernels.o -output=/var/tmp/notmpdir/portage/sci-libs/rocBLAS-5.7.1-r1/work/rocBLAS-rocm-5.7.1_build/library/src/build_tmp/TENSILE/code_object_tmp/Kernels.so-000-gfx1010.hsaco -unbundle Compiling source kernels: Done. # Kernel Building elapsed time = 0.7 secs codeObjectFiles: ['/var/tmp/notmpdir/portage/sci-libs/rocBLAS-5.7.1-r1/work/rocBLAS-rocm-5.7.1_build/Tensile/library/Kernels.so-000-gfx1010.hsaco'] sourceLibPaths + asmLibPaths: ['/var/tmp/notmpdir/portage/sci-libs/rocBLAS-5.7.1-r1/work/rocBLAS-rocm-5.7.1_build/Tensile/library/Kernels.so-000-gfx1010.hsaco'] Traceback (most recent call last): File "/opt/gentoo/usr/lib/python-exec/python3.11/TensileCreateLibrary", line 8, in <module> sys.exit(TensileCreateLibrary()) ^^^^^^^^^^^^^^^^^^^^^^ File "/opt/gentoo/usr/lib/python3.11/site-packages/Tensile/TensileCreateLibrary.py", line 1407, in TensileCreateLibrary theMasterLibrary = list(masterLibraries.values())[0] ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~^^^ IndexError: list index out of range [16/456] cd /var/tmp/notmpdir/portage/sci-libs/rocBLAS-5.7.1-r1/work/rocBLAS-rocm-5.7.1/clients && /opt/gentoo/usr/bin/cmake -E copy include/rocblas_template.yaml /var/tmp/notmpdir/portage/sci-libs/rocBLAS-5.7.1-r1/work/rocBLAS-rocm-5.7.1_build/clients/staging/rocblas_template.yaml FAILED: Tensile/library/Kernels.so-000-gfx1010.hsaco /var/tmp/notmpdir/portage/sci-libs/rocBLAS-5.7.1-r1/work/rocBLAS-rocm-5.7.1_build/Tensile/library/Kernels.so-000-gfx1010.hsaco cd /var/tmp/notmpdir/portage/sci-libs/rocBLAS-5.7.1-r1/work/rocBLAS-rocm-5.7.1_build/library/src && TensileCreateLibrary --merge-files --separate-architectures --lazy-library-loading --no-short-file-names --no-library-print-debug --code-object-version=default --cxx-compiler=hipcc --jobs=2 --library-format=msgpack --architecture=gfx1010 /var/tmp/notmpdir/portage/sci-libs/rocBLAS-5.7.1-r1/work/rocBLAS-rocm-5.7.1/library/src/blas3/Tensile/Logic/asm_full /var/tmp/notmpdir/portage/sci-libs/rocBLAS-5.7.1-r1/work/rocBLAS-rocm-5.7.1_build/Tensile HIP [18/456] cd /var/tmp/notmpdir/portage/sci-libs/rocBLAS-5.7.1-r1/work/rocBLAS-rocm-5.7.1/clients/gtest && /opt/gentoo/usr/bin/python3 ../common/rocblas_gentest.py -I ../include rocblas_gtest.yaml -o /var/tmp/notmpdir/portage/sci-libs/rocBLAS-5.7.1-r1/work/rocBLAS-rocm-5.7.1_build/clients/staging/rocblas_gtest.data ninja: build stopped: subcommand failed. I have the same issue as Yiyang Wu with AMDGPU_TARGETS_gfx1031. The solution was to also enable a "master library" for BLAS, in my case gfx1030. Maybe a REQUIRED_USE like "gfx1031? ( gfx1030 )" (of course, with all the corresponding "master libraries" discovered through trial and error) could fix this. I can confirm the issue persists with sci-libs/rocBLAS-5.7.1-r2. I have the same USE flags, especially amdgpu_targets_gfx1010. Setting MAKEOPTS="-j1" does not work around the build error in my case. I have found on site that gfx1010 is not compatible anymore with this version of rocBLAS but need the preceeding version 5.4.2 In my computer also rocminfo-5.7.1 failed with unrecognized module while rocminfo-5.4.2 is OK. rocBLAS-5.4.2-r3 compile sucessfully @Stefano, could you try this patch? https://github.com/T0mstone/void-packages/blob/dbfd3ec74ce0ef4e647e93bf27566275be384237/srcpkgs/rocBLAS/patches/Tensile-fix-fallback-arch-build.patch I know that it allows to *compile* for gfx1010, I just don't know if resulting library works. Upstream bug: https://github.com/ROCm/Tensile/issues/1757 What I see in 5.4.2, that there is no finetuned kernels for gfx1010 (https://github.com/ROCm/rocBLAS/tree/rocm-5.4.1/library/src/blas3/Tensile/Logic/asm_full - no navi10), so back then library used fallback generic kernels. These kernels are still there, they just need to be enabled. Thanks @Sv. Lockal but the main problem (In my computer also rocminfo-5.7.1 failed with unrecognized module while rocminfo-5.4.2 is OK.) remain |