Gentoo Websites Logo
Go to: Gentoo Home Documentation Forums Lists Bugs Planet Store Wiki Get Gentoo!
Bug 764605 - app-text/tesseract crashes amdgpu with opencl benchmark on sys-kernel/gentoo-sources-5.10.5
Summary: app-text/tesseract crashes amdgpu with opencl benchmark on sys-kernel/gentoo-...
Status: UNCONFIRMED
Alias: None
Product: Gentoo Linux
Classification: Unclassified
Component: Current packages (show other bugs)
Hardware: AMD64 Linux
: Normal normal
Assignee: Bernard Cafarelli
URL: https://bugzilla.kernel.org/show_bug....
Whiteboard:
Keywords:
Depends on:
Blocks:
 
Reported: 2021-01-09 13:06 UTC by Stefan de Konink
Modified: 2021-08-26 20:21 UTC (History)
2 users (show)

See Also:
Package list:
Runtime testing required: ---


Attachments

Note You need to log in before you can comment on or make changes to this bug.
Description Stefan de Konink 2021-01-09 13:06:52 UTC
After checking a trivial thing with tesseract, an opencl benchmark was started with corrupted the graphics stack.

Reproducible: Always

Steps to Reproduce:
1. AMD Raven Ridge APU
2. tesseract --list-langs
Actual Results:  
Blank screens. Computer reachable over the network.

[ 5906.578015] [drm:amdgpu_dm_atomic_commit_tail [amdgpu]] *ERROR* Waiting for fences timed out!
[ 5906.578126] [drm:amdgpu_dm_atomic_commit_tail [amdgpu]] *ERROR* Waiting for fences timed out!
[ 5911.628033] [drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring gfx timeout, signaled seq=854610, emitted seq=854612
[ 5911.628104] [drm:amdgpu_job_timedout [amdgpu]] *ERROR* Process information: process X pid 833 thread X:cs0 pid 834
[ 5911.628109] amdgpu 0000:05:00.0: amdgpu: GPU reset begin!
[ 5911.848066] [drm] free PSP TMR buffer
[ 5911.909148] amdgpu 0000:05:00.0: amdgpu: GPU reset succeeded, trying to resume
[ 5911.909595] [drm] PCIE GART of 1024M enabled (table at 0x000000F400900000).
[ 5911.910074] [drm] PSP is resuming...
[ 5911.930123] [drm] reserve 0x400000 from 0xf40fc00000 for PSP TMR
[ 5912.437956] amdgpu 0000:05:00.0: amdgpu: RAS: optional ras ta ucode is not available
[ 5912.497949] amdgpu 0000:05:00.0: amdgpu: RAP: optional rap ta ucode is not available
[ 5912.818636] [drm] kiq ring mec 2 pipe 1 q 0
[ 5913.668777] ------------[ cut here ]------------
[ 5913.668845] WARNING: CPU: 5 PID: 14459 at drivers/gpu/drm/amd/amdgpu/../display/dc/core/dc.c:922 dc_commit_state+0x933/0xaf0 [amdgpu]
[ 5913.668847] Modules linked in: ctr ccm cmac bnep joydev ath10k_pci ath10k_core ath zfs(PO) zunicode(PO) zzstd(O) zlua(O) zavl(PO) icp(PO) uvcvideo videobuf2_vmalloc videobuf2_memops videobuf2_v4l2 zcommon(PO) znvpair(PO) videodev amdgpu spl(O) mac80211 videobuf2_common btusb btrtl zlib_deflate btbcm btintel mfd_core gpu_sched zlib_inflate bluetooth snd_hda_codec_conexant ttm kvm_amd i2c_algo_bit snd_hda_codec_generic snd_hda_codec_hdmi ecdh_generic drm_kms_helper ecc syscopyarea sdhci_pci kvm sysfillrect snd_hda_intel sysimgblt iosf_mbi fb_sys_fops cqhci irqbypass cec snd_intel_dspcfg cfg80211 sdhci snd_hda_codec wmi_bmof libarc4 r8169 mmc_core snd_hda_core aesni_intel realtek snd_pcm mdio_devres crypto_simd snd_timer wireguard libphy cryptd thinkpad_acpi ccp psmouse glue_helper nvram evdev ip6_udp_tunnel ledtrig_audio udp_tunnel i2c_piix4 sha1_generic snd soundcore rfkill wmi battery ac i2c_scmi video button sch_fq_codel drm backlight i2c_core fuse configfs efivarfs
[ 5913.668928] CPU: 5 PID: 14459 Comm: kworker/5:2 Tainted: P           O      5.10.5-gentoo #1
[ 5913.668930] Hardware name: LENOVO 20KU000NMH/20KU000NMH, BIOS R0UET77W (1.57 ) 04/07/2020
[ 5913.668936] Workqueue: events drm_sched_job_timedout [gpu_sched]
[ 5913.668995] RIP: 0010:dc_commit_state+0x933/0xaf0 [amdgpu]
[ 5913.668999] Code: 04 24 48 c7 00 00 00 00 00 48 c7 40 08 00 00 00 00 e9 01 f8 ff ff 31 d2 e9 54 f8 ff ff 80 b8 e0 02 00 00 00 0f 84 c0 fd ff ff <0f> 0b e9 b9 fd ff ff 48 89 ef e8 6e b1 00 00 48 89 ef e8 b6 9b 1e
[ 5913.669001] RSP: 0018:ffffc900094a7c50 EFLAGS: 00010202
[ 5913.669004] RAX: ffff8881f3caf800 RBX: ffff8882376e0690 RCX: 0000000000000005
[ 5913.669005] RDX: 0000000000000e24 RSI: 00000000000007cd RDI: 00000aca1f576270
[ 5913.669007] RBP: 0000000000000000 R08: ffffc900094a7bd4 R09: ffffc900094a7b20
[ 5913.669008] R10: 0000000000000002 R11: 000000000000000c R12: ffff8882376e0000
[ 5913.669010] R13: ffff8882376e1ec8 R14: ffff888110550000 R15: ffff8882376e1ec8
[ 5913.669012] FS:  0000000000000000(0000) GS:ffff88844ef40000(0000) knlGS:0000000000000000
[ 5913.669014] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[ 5913.669015] CR2: 00003a9a093e1000 CR3: 000000012fec8000 CR4: 00000000003506e0
[ 5913.669017] Call Trace:
[ 5913.669093]  dm_resume+0x3b0/0x510 [amdgpu]
[ 5913.669164]  ? psm_adjust_power_state_dynamic+0xec/0x1c0 [amdgpu]
[ 5913.669207]  amdgpu_device_ip_resume_phase2+0x52/0xb0 [amdgpu]
[ 5913.669268]  amdgpu_do_asic_reset+0x26c/0x39c [amdgpu]
[ 5913.669329]  amdgpu_device_gpu_recover.cold+0x6b9/0x98d [amdgpu]
[ 5913.669393]  amdgpu_job_timedout+0x11c/0x140 [amdgpu]
[ 5913.669398]  drm_sched_job_timedout+0x60/0xd0 [gpu_sched]
[ 5913.669403]  process_one_work+0x1dc/0x370
[ 5913.669406]  worker_thread+0x4d/0x3d0
[ 5913.669409]  ? rescuer_thread+0x3f0/0x3f0
[ 5913.669412]  kthread+0x125/0x140
[ 5913.669415]  ? kthread_create_worker_on_cpu+0x70/0x70
[ 5913.669418]  ret_from_fork+0x1f/0x30
[ 5913.669421] ---[ end trace c1d81a78b4c82ff4 ]---
[ 5913.683049] [drm] VCN decode and encode initialized successfully(under SPG Mode).
[ 5913.683060] amdgpu 0000:05:00.0: amdgpu: ring gfx uses VM inv eng 0 on hub 0
[ 5913.683063] amdgpu 0000:05:00.0: amdgpu: ring comp_1.0.0 uses VM inv eng 1 on hub 0
[ 5913.683065] amdgpu 0000:05:00.0: amdgpu: ring comp_1.1.0 uses VM inv eng 4 on hub 0
[ 5913.683067] amdgpu 0000:05:00.0: amdgpu: ring comp_1.2.0 uses VM inv eng 5 on hub 0
[ 5913.683070] amdgpu 0000:05:00.0: amdgpu: ring comp_1.3.0 uses VM inv eng 6 on hub 0
[ 5913.683072] amdgpu 0000:05:00.0: amdgpu: ring comp_1.0.1 uses VM inv eng 7 on hub 0
[ 5913.683074] amdgpu 0000:05:00.0: amdgpu: ring comp_1.1.1 uses VM inv eng 8 on hub 0
[ 5913.683076] amdgpu 0000:05:00.0: amdgpu: ring comp_1.2.1 uses VM inv eng 9 on hub 0
[ 5913.683078] amdgpu 0000:05:00.0: amdgpu: ring comp_1.3.1 uses VM inv eng 10 on hub 0
[ 5913.683080] amdgpu 0000:05:00.0: amdgpu: ring kiq_2.1.0 uses VM inv eng 11 on hub 0
[ 5913.683082] amdgpu 0000:05:00.0: amdgpu: ring sdma0 uses VM inv eng 0 on hub 1
[ 5913.683084] amdgpu 0000:05:00.0: amdgpu: ring vcn_dec uses VM inv eng 1 on hub 1
[ 5913.683086] amdgpu 0000:05:00.0: amdgpu: ring vcn_enc0 uses VM inv eng 4 on hub 1
[ 5913.683088] amdgpu 0000:05:00.0: amdgpu: ring vcn_enc1 uses VM inv eng 5 on hub 1
[ 5913.683090] amdgpu 0000:05:00.0: amdgpu: ring jpeg_dec uses VM inv eng 6 on hub 1
[ 5914.738098] amdgpu 0000:05:00.0: [drm:amdgpu_ib_ring_tests [amdgpu]] *ERROR* IB test failed on gfx (-110).
[ 5914.738106] amdgpu 0000:05:00.0: amdgpu: ib ring test failed (-110).
[ 5914.927936] [drm] free PSP TMR buffer


Expected Results:  
If the device was not supporting ROCm, I would expect it would gracefully report so.

Portage 3.0.12 (python 3.7.9-final-0, default/linux/amd64/17.1/no-multilib, gcc-10.2.0, glibc-2.32-r7, 5.10.5-gentoo x86_64)
=================================================================
System uname: Linux-5.10.5-gentoo-x86_64-AMD_Ryzen_5_2500U_with_Radeon_Vega_Mobile_Gfx-with-gentoo-2.7
KiB Mem:    16091136 total,   8073056 free
KiB Swap:   33554428 total,  33554428 free
Timestamp of repository gentoo: Thu, 07 Jan 2021 23:00:01 +0000
Head commit of repository gentoo: a6686c82c22a0fc316cdfa69938f1f51fc30bd40
sh bash 5.1_p4
ld GNU ld (Gentoo 2.34 p6) 2.34.0
ccache version 4.1 [disabled]
app-shells/bash:          5.1_p4::gentoo
dev-java/java-config:     2.3.1::gentoo
dev-lang/perl:            5.30.3-r1::gentoo
dev-lang/python:          2.7.18-r100::lto-overlay, 3.7.9-r1::lto-overlay
dev-util/ccache:          4.1::gentoo
dev-util/cmake:           3.19.2::gentoo
sys-apps/baselayout:      2.7-r1::gentoo
sys-apps/openrc:          0.41.2::gentoo
sys-apps/sandbox:         2.20::gentoo
sys-devel/autoconf:       2.13-r1::gentoo, 2.69-r5::gentoo
sys-devel/automake:       1.16.3-r1::gentoo
sys-devel/binutils:       2.32-r1::gentoo, 2.33.1-r1::gentoo, 2.34-r2::gentoo, 2.35.1-r1::gentoo
sys-devel/gcc:            10.2.0-r5::gentoo
sys-devel/gcc-config:     2.3.2-r1::gentoo
sys-devel/libtool:        2.4.6-r6::gentoo
sys-devel/make:           4.3::gentoo
sys-kernel/linux-headers: 5.10::gentoo (virtual/os-headers)
sys-libs/glibc:           2.32-r7::gentoo
Repositories:

gentoo
    location: /var/db/repos/gentoo
    sync-type: rsync
    sync-uri: rsync://rsync.gentoo.org/gentoo-portage
    priority: -1000
    sync-rsync-verify-metamanifest: yes
    sync-rsync-verify-jobs: 1
    sync-rsync-extra-opts: 
    sync-rsync-verify-max-age: 24

4nykey
    location: /var/lib/layman/4nykey
    masters: gentoo
    priority: 50

booboo
    location: /var/lib/layman/booboo
    masters: gentoo
    priority: 50

dotnet
    location: /var/lib/layman/dotnet
    masters: gentoo
    priority: 50

flavour
    location: /var/lib/layman/flavour
    masters: gentoo
    priority: 50

fordfrog
    location: /var/lib/layman/fordfrog
    masters: gentoo
    priority: 50

gentoo-zh
    location: /var/lib/layman/gentoo-zh
    masters: gentoo
    priority: 50

imaging
    location: /var/lib/layman/imaging
    masters: gentoo
    priority: 50

lto-overlay
    location: /var/lib/layman/lto-overlay
    masters: gentoo mv
    priority: 50

mv
    location: /var/lib/layman/mv
    masters: gentoo
    priority: 50

raiagent
    location: /var/lib/layman/raiagent
    masters: gentoo
    priority: 50

salfter
    location: /var/lib/layman/salfter
    masters: gentoo
    priority: 50

FireBurn
    location: /var/lib/layman/FireBurn
    masters: gentoo
    priority: 60

ACCEPT_KEYWORDS="amd64 ~amd64"
ACCEPT_LICENSE="*"
CBUILD="x86_64-pc-linux-gnu"
CFLAGS="-O2 -march=znver1 -O2 -pipe -pipe"
CHOST="x86_64-pc-linux-gnu"
CONFIG_PROTECT="/etc /usr/share/config /usr/share/gnupg/qualified.txt /usr/share/maven-bin-3.6/conf"
CONFIG_PROTECT_MASK="/etc/ca-certificates.conf /etc/dconf /etc/env.d /etc/fonts/fonts.conf /etc/gconf /etc/gentoo-release /etc/sandbox.d /etc/terminfo /etc/texmf/language.dat.d /etc/texmf/language.def.d /etc/texmf/updmap.d /etc/texmf/web2c"
CXXFLAGS="-O2 -march=znver1 -O2 -pipe -pipe"
DISTDIR="/var/cache/distfiles"
ENV_UNSET="CARGO_HOME DBUS_SESSION_BUS_ADDRESS DISPLAY GOBIN GOPATH PERL5LIB PERL5OPT PERLPREFIX PERL_CORE PERL_MB_OPT PERL_MM_OPT XAUTHORITY XDG_CACHE_HOME XDG_CONFIG_HOME XDG_DATA_HOME XDG_RUNTIME_DIR"
FCFLAGS="-O2 -march=znver1 -O2 -pipe -pipe"
FEATURES="assume-digests binpkg-docompress binpkg-dostrip binpkg-logs config-protect-if-modified distlocks ebuild-locks fixlafiles ipc-sandbox merge-sync multilib-strict network-sandbox news parallel-fetch pid-sandbox preserve-libs protect-owned qa-unresolved-soname-deps sandbox sfperms strict unknown-features-warn unmerge-logs unmerge-orphans userfetch userpriv usersandbox usersync xattr"
FFLAGS="-O2 -march=znver1 -O2 -pipe -pipe"
GENTOO_MIRRORS="http://mirror.leaseweb.com/gentoo/ http://ftp.snt.utwente.nl/pub/os/linux/gentoo"
LANG="en_US.utf8"
LDFLAGS="-Wl,-O1 -Wl,--as-needed"
MAKEOPTS="-j7"
PKGDIR="/var/cache/binpkgs"
PORTAGE_CONFIGROOT="/"
PORTAGE_RSYNC_OPTS="--recursive --links --safe-links --perms --times --omit-dir-times --compress --force --whole-file --delete --stats --human-readable --timeout=180 --exclude=/distfiles --exclude=/local --exclude=/packages --exclude=/.git"
PORTAGE_TMPDIR="/var/tmp"
USE="X acl adwaita-icon-theme alsa amd64 autohint berkdb bluetooth brotli bzip2 cairo caps clang cli codec2 corefonts cpudetection crypt curl dav1d dbus designer dri dri3 egl exif ffmpeg fftw fontconfig fortran gdal gdbm geos gimp gles2 glslang gold graphite gtk gtk3 hackrf harfbuzz heif http2 hunspell iconv icu infinality ipv6 jack jpeg jpeg2k jumbo-build lapack libglvnd libnotify libsamplerate libtirpc lm-sensors lto lz4 lzma minizip ncurses nptl ntfs ogg opencl opencv openexr opengl openmp optimization opus orc pam pcre pdf pgo png policykit postgres qml qt5 readline rtlsdr rubberband seccomp sndfile speex split-usr sqlite srt ssl startup-notification strong-optimization svg synaptics system-jsoncpp system-libvpx systemd sysv-utils tcpd theora tiff tracepath truetype udev unicode upower vaapi vdpau vim-syntax vorbis vpx vulkan webp webstart woff2 x264 x265 xattr xinerama xrandr zeromq zlib zstd" ABI_X86="64" ADA_TARGET="gnat_2018" APACHE2_MODULES="authn_core authz_core socache_shmcb unixd actions alias auth_basic authn_alias authn_anon authn_dbm authn_default authn_file authz_dbm authz_default authz_groupfile authz_host authz_owner authz_user autoindex cache cgi cgid dav dav_fs dav_lock deflate dir disk_cache env expires ext_filter file_cache filter headers include info log_config logio mem_cache mime mime_magic negotiation rewrite setenvif speling status unique_id userdir usertrack vhost_alias" CALLIGRA_FEATURES="karbon sheets words" COLLECTD_PLUGINS="df interface irq load memory rrdtool swap syslog" CPU_FLAGS_X86="aes avx avx2 f16c fma3 mmx mmxext pclmul popcnt sha sse sse2 sse3 sse4_1 sse4_2 sse4a ssse3" ELIBC="glibc" GPSD_PROTOCOLS="ashtech aivdm earthmate evermore fv18 garmin garmintxt gpsclock greis isync itrax mtk3301 nmea ntrip navcom oceanserver oldstyle oncore rtcm104v2 rtcm104v3 sirf skytraq superstar2 timing tsip tripmate tnt ublox ubx" INPUT_DEVICES="libinput" KERNEL="linux" L10N="en nl" LCD_DEVICES="bayrad cfontz cfontz633 glk hd44780 lb216 lcdm001 mtxorb ncurses text" LIBREOFFICE_EXTENSIONS="presenter-console presenter-minimizer" LLVM_TARGETS="AMDGPU BPF" LUA_SINGLE_TARGET="lua5-1" LUA_TARGETS="lua5-1" OFFICE_IMPLEMENTATION="libreoffice" PHP_TARGETS="php7-3 php7-4" POSTGRES_TARGETS="postgres13" PYTHON_SINGLE_TARGET="python3_7" PYTHON_TARGETS="python2_7 python3_7" RUBY_TARGETS="ruby27 ruby30" USERLAND="GNU" UWSGI_PLUGINS="cache carbon cheaper_busyness corerouter fastrouter http logfile logsocket mongodblog nagios ping rawrouter redislog router_basicauth router_cache router_expires router_hash router_http router_memcached router_metrics router_redirect router_redis router_rewrite router_static router_uwsgi rpc signal spooler symcall syslog transformation_chunked transformation_gzip transformation_offload transformation_tofile ugreen zergpool" VIDEO_CARDS="amdgpu radeonsi radeon" XTABLES_ADDONS="quota2 psd pknock lscan length2 ipv4options ipset ipp2p iface geoip fuzzy condition tee tarpit sysrq steal rawnat logmark ipmark dhcpmac delude chaos account"
Unset:  CC, CPPFLAGS, CTARGET, CXX, EMERGE_DEFAULT_OPTS, INSTALL_MASK, LC_ALL, LINGUAS, PORTAGE_BINHOST, PORTAGE_BUNZIP2_COMMAND, PORTAGE_COMPRESS, PORTAGE_COMPRESS_FLAGS, PORTAGE_RSYNC_EXTRA_OPTS
Comment 1 Bernard Cafarelli gentoo-dev 2021-08-26 20:21:24 UTC
I do not have the hardware to test this sadly. Is it still happening with newer kernel? I just pushed a beta ebuild for upcoming 5.0.0, they mention quite a few changes in opencl too