[Bug 61445] Radeon HD 4250 random hard lockups and soft resets

bugzilla-daemon at freedesktop.org bugzilla-daemon at freedesktop.org
Tue May 14 10:58:45 PDT 2013


https://bugs.freedesktop.org/show_bug.cgi?id=61445

--- Comment #9 from StephenM <stephen.wright.ii at gmail.com> ---
I wasn't sure if my previous post went through or got rejected. Please
disregard this post. This was simply a test to see if i could post

This thread can be removed. Sorry for the inconvenience. I will post my
question again

I'm having a hard time getting the ati radeon driver to work properly with
slackware current 3.8.8 kernel. The driver will randomly crash and produce a
corrupted image on the screen. Eventually Xwindows will stop working entirely
and the box will freeze. SSH still works but it appears that X is taking up
nearly 100% cpu. I have experienced the problem where the screen will appear
functional for about 10 seconds then go completely black. Here are some stats
from my server. Any help would be appreciated.

---------------------------------------------
X.Org X Server 1.13.4
Release Date: 2013-04-17
X Protocol Version 11, Revision 0
Build Operating System: Slackware 14.1 Slackware Linux Project
Current Operating System: Linux wright-mac 3.8.8 #1 SMP Thu Apr 18 21:48:01 CDT
2013 x86_64
Kernel command line: BOOT_IMAGE=Slackware ro root=801 vt.default_utf8=1
Build Date: 18 April 2013 01:47:25AM

Current version of pixman: 0.28.2
Before reporting problems, check http://wiki.x.org
to make sure that you have the latest version.

Linux wright-mac 3.8.8 #1 SMP Thu Apr 18 21:48:01 CDT 2013 x86_64 Intel(R)
Core(TM) i5 CPU 760 @ 2.80GHz GenuineIntel GNU/Linux
------------------------------------------
syslog
May 2 16:16:10 mac kernel: [196739.807725] radeon 0000:01:00.0: GPU lockup CP
stall for more than 10000msec
May 2 16:16:10 mac kernel: [196739.807732] radeon 0000:01:00.0: GPU lockup
(waiting for 0x0000000000a619ab last fence id 0x0000000000a619aa)
May 2 16:16:22 mac kernel: [196751.494415] radeon 0000:01:00.0: GPU lockup CP
stall for more than 10000msec
May 2 16:16:22 mac kernel: [196751.494422] radeon 0000:01:00.0: GPU lockup
(waiting for 0x0000000000a619b1 last fence id 0x0000000000a619ad)
May 2 16:16:33 mac kernel: [196762.084773] radeon 0000:01:00.0: GPU lockup CP
stall for more than 10000msec
May 2 16:16:33 mac kernel: [196762.084780] radeon 0000:01:00.0: GPU lockup
(waiting for 0x0000000000a619b2 last fence id 0x0000000000a619ae)
May 2 16:16:33 mac kernel: [196762.084785] [drm:r600_ib_test] *ERROR* radeon:
fence wait failed (-35).
May 2 16:16:33 mac kernel: [196762.084790] [drm:radeon_ib_ring_tests] *ERROR*
radeon: failed testing IB on GFX ring (-35).
May 2 16:16:33 mac kernel: [196762.084793] radeon 0000:01:00.0: ib ring test
failed (-35).
May 2 16:16:44 mac kernel: [196773.828224] radeon 0000:01:00.0: GPU lockup CP
stall for more than 10000msec
May 2 16:16:44 mac kernel: [196773.828230] radeon 0000:01:00.0: GPU lockup
(waiting for 0x0000000000a619b6 last fence id 0x0000000000a619b5)
May 2 16:16:56 mac kernel: [196785.591675] radeon 0000:01:00.0: GPU lockup CP
stall for more than 10000msec
May 2 16:16:56 mac kernel: [196785.591682] radeon 0000:01:00.0: GPU lockup
(waiting for 0x0000000000a619c3 last fence id 0x0000000000a619ba)
May 2 16:17:07 mac kernel: [196796.209111] radeon 0000:01:00.0: GPU lockup CP
stall for more than 10000msec
May 2 16:17:07 mac kernel: [196796.209118] radeon 0000:01:00.0: GPU lockup
(waiting for 0x0000000000a619c4 last fence id 0x0000000000a619bc)
May 2 16:17:07 mac kernel: [196796.209123] [drm:r600_ib_test] *ERROR* radeon:
fence wait failed (-35).
May 2 16:17:07 mac kernel: [196796.209127] [drm:radeon_ib_ring_tests] *ERROR*
radeon: failed testing IB on GFX ring (-35).
May 2 16:17:07 mac kernel: [196796.209131] radeon 0000:01:00.0: ib ring test
failed (-35).
May 2 16:17:19 mac kernel: [196807.978533] radeon 0000:01:00.0: GPU lockup CP
stall for more than 10000msec
May 2 16:17:19 mac kernel: [196807.978540] radeon 0000:01:00.0: GPU lockup
(waiting for 0x0000000000a619ca last fence id 0x0000000000a619c6)
May 2 16:17:29 mac kernel: [196818.574008] radeon 0000:01:00.0: GPU lockup CP
stall for more than 10000msec
May 2 16:17:29 mac kernel: [196818.574015] radeon 0000:01:00.0: GPU lockup
(waiting for 0x0000000000a619cb last fence id 0x0000000000a619c7)
May 2 16:17:29 mac kernel: [196818.574019] [drm:r600_ib_test] *ERROR* radeon:
fence wait failed (-35).
May 2 16:17:29 mac kernel: [196818.574024] [drm:radeon_ib_ring_tests] *ERROR*
radeon: failed testing IB on GFX ring (-35).
May 2 16:17:29 mac kernel: [196818.574027] radeon 0000:01:00.0: ib ring test
failed (-35).
May 2 16:17:40 mac kernel: [196829.740512] radeon 0000:01:00.0: GPU lockup CP
stall for more than 10000msec
May 2 16:17:40 mac kernel: [196829.740519] radeon 0000:01:00.0: GPU lockup
(waiting for 0x0000000000a619cf last fence id 0x0000000000a619cc)
May 2 16:17:50 mac kernel: [196839.820819] radeon 0000:01:00.0: GPU lockup CP
stall for more than 10000msec
May 2 16:17:50 mac kernel: [196839.820826] radeon 0000:01:00.0: GPU lockup
(waiting for 0x0000000000a619d0 last fence id 0x0000000000a619cd)
May 2 16:17:50 mac kernel: [196839.820830] [drm:r600_ib_test] *ERROR* radeon:
fence wait failed (-35).
May 2 16:17:50 mac kernel: [196839.820835] [drm:radeon_ib_ring_tests] *ERROR*
radeon: failed testing IB on GFX ring (-35).
May 2 16:17:50 mac kernel: [196839.820838] radeon 0000:01:00.0: ib ring test
failed (-35).
May 2 16:18:02 mac kernel: [196850.982306] radeon 0000:01:00.0: GPU lockup CP
stall for more than 10000msec
May 2 16:18:02 mac kernel: [196850.982313] radeon 0000:01:00.0: GPU lockup
(waiting for 0x0000000000a619d2 last fence id 0x0000000000a619d1)
May 2 16:18:13 mac kernel: [196862.675902] radeon 0000:01:00.0: GPU lockup CP
stall for more than 10000msec
May 2 16:18:13 mac kernel: [196862.675909] radeon 0000:01:00.0: GPU lockup
(waiting for 0x0000000000a619d5 last fence id 0x0000000000a619d4)
May 2 16:18:25 mac kernel: [196874.368496] radeon 0000:01:00.0: GPU lockup CP
stall for more than 10000msec
May 2 16:18:25 mac kernel: [196874.368503] radeon 0000:01:00.0: GPU lockup
(waiting for 0x0000000000a619d9 last fence id 0x0000000000a619d7)
May 2 16:18:36 mac kernel: [196884.959946] radeon 0000:01:00.0: GPU lockup CP
stall for more than 10000msec
May 2 16:18:36 mac kernel: [196884.959953] radeon 0000:01:00.0: GPU lockup
(waiting for 0x0000000000a619da last fence id 0x0000000000a619d8)
May 2 16:18:36 mac kernel: [196884.959957] [drm:r600_ib_test] *ERROR* radeon:
fence wait failed (-35).
May 2 16:18:36 mac kernel: [196884.959962] [drm:radeon_ib_ring_tests] *ERROR*
radeon: failed testing IB on GFX ring (-35).
May 2 16:18:36 mac kernel: [196884.959965] radeon 0000:01:00.0: ib ring test
failed (-35).
May 2 16:18:46 mac kernel: [196894.995395] radeon 0000:01:00.0: GPU lockup CP
stall for more than 10000msec
May 2 16:18:46 mac kernel: [196894.995402] radeon 0000:01:00.0: GPU lockup
(waiting for 0x0000000000a619dd last fence id 0x0000000000a619dc)
May 2 16:18:57 mac kernel: [196906.688951] radeon 0000:01:00.0: GPU lockup CP
stall for more than 10000msec
May 2 16:18:57 mac kernel: [196906.688958] radeon 0000:01:00.0: GPU lockup
(waiting for 0x0000000000a619e1 last fence id 0x0000000000a619df)
May 2 16:19:08 mac kernel: [196917.280434] radeon 0000:01:00.0: GPU lockup CP
stall for more than 10000msec
May 2 16:19:08 mac kernel: [196917.280441] radeon 0000:01:00.0: GPU lockup
(waiting for 0x0000000000a619e2 last fence id 0x0000000000a619e0)
May 2 16:19:08 mac kernel: [196917.280445] [drm:r600_ib_test] *ERROR* radeon:
fence wait failed (-35).
May 2 16:19:08 mac kernel: [196917.280450] [drm:radeon_ib_ring_tests] *ERROR*
radeon: failed testing IB on GFX ring (-35).
May 2 16:19:08 mac kernel: [196917.280453] radeon 0000:01:00.0: ib ring test
failed (-35).
May 2 16:19:20 mac kernel: [196928.933293] radeon 0000:01:00.0: GPU lockup CP
stall for more than 10000msec
May 2 16:19:20 mac kernel: [196928.933300] radeon 0000:01:00.0: GPU lockup
(waiting for 0x0000000000a619eb last fence id 0x0000000000a619e4)
May 2 16:19:30 mac kernel: [196939.525567] radeon 0000:01:00.0: GPU lockup CP
stall for more than 10000msec
May 2 16:19:30 mac kernel: [196939.525574] radeon 0000:01:00.0: GPU lockup
(waiting for 0x0000000000a619ec last fence id 0x0000000000a619e5)
May 2 16:19:30 mac kernel: [196939.525579] [drm:r600_ib_test] *ERROR* radeon:
fence wait failed (-35).
May
--------------------------------------------------------------------------
/var/log/message
May 2 16:16:10 mac kernel: [196739.808776] radeon 0000:01:00.0: Saved 23 dwords
of commands on ring 0.
May 2 16:16:10 mac kernel: [196739.808779] radeon 0000:01:00.0: GPU softreset:
0x00000003
May 2 16:16:10 mac kernel: [196739.811869] radeon 0000:01:00.0: GRBM_STATUS =
0xF5703828
May 2 16:16:10 mac kernel: [196739.811871] radeon 0000:01:00.0: GRBM_STATUS_SE0
= 0xFC000007
May 2 16:16:10 mac kernel: [196739.811872] radeon 0000:01:00.0: GRBM_STATUS_SE1
= 0x00000007
May 2 16:16:10 mac kernel: [196739.811874] radeon 0000:01:00.0: SRBM_STATUS =
0x200000C0
May 2 16:16:10 mac kernel: [196739.811875] radeon 0000:01:00.0:
R_008674_CP_STALLED_STAT1 = 0x00000000
May 2 16:16:10 mac kernel: [196739.811877] radeon 0000:01:00.0:
R_008678_CP_STALLED_STAT2 = 0x40000000
May 2 16:16:10 mac kernel: [196739.811878] radeon 0000:01:00.0:
R_00867C_CP_BUSY_STAT = 0x00008000
May 2 16:16:10 mac kernel: [196739.811879] radeon 0000:01:00.0:
R_008680_CP_STAT = 0x80228643
May 2 16:16:10 mac kernel: [196739.811880] radeon 0000:01:00.0:
GRBM_SOFT_RESET=0x00007F6B
May 2 16:16:10 mac kernel: [196739.811933] radeon 0000:01:00.0: GRBM_STATUS =
0x00003828
May 2 16:16:10 mac kernel: [196739.811934] radeon 0000:01:00.0: GRBM_STATUS_SE0
= 0x00000007
May 2 16:16:10 mac kernel: [196739.811936] radeon 0000:01:00.0: GRBM_STATUS_SE1
= 0x00000007
May 2 16:16:10 mac kernel: [196739.811937] radeon 0000:01:00.0: SRBM_STATUS =
0x200000C0
May 2 16:16:10 mac kernel: [196739.811939] radeon 0000:01:00.0:
R_008674_CP_STALLED_STAT1 = 0x00000000
May 2 16:16:10 mac kernel: [196739.811940] radeon 0000:01:00.0:
R_008678_CP_STALLED_STAT2 = 0x00000000
May 2 16:16:10 mac kernel: [196739.811941] radeon 0000:01:00.0:
R_00867C_CP_BUSY_STAT = 0x00000000
May 2 16:16:10 mac kernel: [196739.811943] radeon 0000:01:00.0:
R_008680_CP_STAT = 0x00000000
May 2 16:16:10 mac kernel: [196739.829415] radeon 0000:01:00.0: GPU reset
succeeded, trying to resume
May 2 16:16:10 mac kernel: [196739.893330] [drm] probing gen 2 caps for device
8086:d138 = 2/0
May 2 16:16:10 mac kernel: [196739.893332] [drm] PCIE gen 2 link speeds already
enabled
May 2 16:16:10 mac kernel: [196739.895696] [drm] PCIE GART of 512M enabled
(table at 0x0000000000040000).
May 2 16:16:10 mac kernel: [196739.895817] radeon 0000:01:00.0: WB enabled
May 2 16:16:10 mac kernel: [196739.895820] radeon 0000:01:00.0: fence driver on
ring 0 use gpu addr 0x0000000040000c00 and cpu addr 0xffff88032fe0ec00
May 2 16:16:10 mac kernel: [196739.895821] radeon 0000:01:00.0: fence driver on
ring 3 use gpu addr 0x0000000040000c0c and cpu addr 0xffff88032fe0ec0c
May 2 16:16:10 mac kernel: [196739.912353] [drm] ring test on 0 succeeded in 1
usecs
May 2 16:16:10 mac kernel: [196739.912411] [drm] ring test on 3 succeeded in 1
usecs
May 2 16:16:10 mac kernel: [196739.912546] [drm] ib test on ring 0 succeeded in
0 usecs
May 2 16:16:10 mac kernel: [196739.912581] [drm] ib test on ring 3 succeeded in
1 usecs
May 2 16:16:22 mac kernel: [196751.495464] radeon 0000:01:00.0: Saved 119
dwords of commands on ring 0.
May 2 16:16:22 mac kernel: [196751.495467] radeon 0000:01:00.0: GPU softreset:
0x00000003
May 2 16:16:22 mac kernel: [196751.503885] radeon 0000:01:00.0: GRBM_STATUS =
0xF5303828
May 2 16:16:22 mac kernel: [196751.503887] radeon 0000:01:00.0: GRBM_STATUS_SE0
= 0xF4000007
May 2 16:16:22 mac kernel: [196751.503888] radeon 0000:01:00.0: GRBM_STATUS_SE1
= 0x00000007
May 2 16:16:22 mac kernel: [196751.503890] radeon 0000:01:00.0: SRBM_STATUS =
0x200000C0
May 2 16:16:22 mac kernel: [196751.503891] radeon 0000:01:00.0:
R_008674_CP_STALLED_STAT1 = 0x00000000
May 2 16:16:22 mac kernel: [196751.503892] radeon 0000:01:00.0:
R_008678_CP_STALLED_STAT2 = 0x40000000
May 2 16:16:22 mac kernel: [196751.503894] radeon 0000:01:00.0:
R_00867C_CP_BUSY_STAT = 0x00008004
May 2 16:16:22 mac kernel: [196751.503895] radeon 0000:01:00.0:
R_008680_CP_STAT = 0x80228647
May 2 16:16:22 mac kernel: [196751.503896] radeon 0000:01:00.0:
GRBM_SOFT_RESET=0x00007F6B
May 2 16:16:22 mac kernel: [196751.503949] radeon 0000:01:00.0: GRBM_STATUS =
0x00003828
May 2 16:16:22 mac kernel: [196751.503950] radeon 0000:01:00.0: GRBM_STATUS_SE0
= 0x00000007
May 2 16:16:22 mac kernel: [196751.503952] radeon 0000:01:00.0: GRBM_STATUS_SE1
= 0x00000007
May 2 16:16:22 mac kernel: [196751.503953] radeon 0000:01:00.0: SRBM_STATUS =
0x200000C0
May 2 16:16:22 mac kernel: [196751.503954] radeon 0000:01:00.0:
R_008674_CP_STALLED_STAT1 = 0x00000000
May 2 16:16:22 mac kernel: [196751.503956] radeon 0000:01:00.0:
R_008678_CP_STALLED_STAT2 = 0x00000000
May 2 16:16:22 mac kernel: [196751.503957] radeon 0000:01:00.0:
R_00867C_CP_BUSY_STAT = 0x00000000
May 2 16:16:22 mac kernel: [196751.503959] radeon 0000:01:00.0:
R_008680_CP_STAT = 0x00000000
May 2 16:16:22 mac kernel: [196751.521430] radeon 0000:01:00.0: GPU reset
succeeded, trying to resume
---------------------------------------------------------
Xorg.0.log
(EE) Backtrace:
(EE) 0: /usr/bin/X (xorg_backtrace+0x3d) [0x57a5ed]
(EE) 1: /usr/bin/X (mieqEnqueue+0x22b) [0x55c8ab]
(EE) 2: /usr/bin/X (QueuePointerEvents+0x52) [0x44c062]
(EE) 3: /usr/lib64/xorg/modules/input/evdev_drv.so (0x7f9d9dab8000+0x5ccd)
[0x7f9d9dabdccd]
(EE) 4: /usr/bin/X (0x400000+0x71fa8) [0x471fa8]
(EE) 5: /usr/bin/X (0x400000+0x99e3d) [0x499e3d]
(EE) 6: /lib64/libpthread.so.0 (0x7f9da2ef7000+0xf670) [0x7f9da2f06670]
(EE) 7: /lib64/libc.so.6 (ioctl+0x7) [0x7f9da1170a67]
(EE) 8: /usr/lib64/libdrm.so.2 (drmIoctl+0x28) [0x7f9da2cef5a8]
(EE) 9: /usr/lib64/libdrm.so.2 (drmCommandWriteRead+0x1c) [0x7f9da2cf199c]
(EE) 10: /usr/lib64/libdrm_radeon.so.1 (0x7f9da0596000+0x2029) [0x7f9da0598029]
(EE) 11: /usr/lib64/libdrm_radeon.so.1 (0x7f9da0596000+0x2244) [0x7f9da0598244]
(EE) 12: /usr/lib64/xorg/modules/drivers/radeon_drv.so (0x7f9da07a0000+0x1cc3a)
[0x7f9da07bcc3a]
(EE) 13: /usr/lib64/xorg/modules/libexa.so (0x7f9d9ff6d000+0x4ad7)
[0x7f9d9ff71ad7]
(EE) 14: /usr/lib64/xorg/modules/libexa.so (0x7f9d9ff6d000+0x8089)
[0x7f9d9ff75089]
(EE) 15: /usr/lib64/xorg/modules/libexa.so (0x7f9d9ff6d000+0x4cfb)
[0x7f9d9ff71cfb]
(EE) 16: /usr/bin/X (0x400000+0x106840) [0x506840]
(EE) 17: /usr/bin/X (ValidateGC+0x1c) [0x447c1c]
(EE) 18: /usr/bin/X (0x400000+0xfa7aa) [0x4fa7aa]
(EE) 19: /usr/bin/X (miCompositeRects+0x75) [0x4fa945]
(EE) 20: /usr/bin/X (0x400000+0x102961) [0x502961]
(EE) 21: /usr/bin/X (0x400000+0x35d11) [0x435d11]
(EE) 22: /usr/bin/X (0x400000+0x25475) [0x425475]
(EE) 23: /lib64/libc.so.6 (__libc_start_main+0xf5) [0x7f9da10a1d85]
(EE) 24: /usr/bin/X (0x400000+0x257bd) [0x4257bd]
(EE)
(EE) [mi] These backtraces from mieqEnqueue may point to a culprit higher up
the stack.
(EE) [mi] mieq is *NOT* the cause. It is a victim.
(EE) [mi] EQ overflow continuing. 100 events have been dropped.
(EE)
(EE) Backtrace:
(EE) 0: /usr/bin/X (xorg_backtrace+0x3d) [0x57a5ed]
(EE) 1: /usr/bin/X (QueuePointerEvents+0x52) [0x44c062]
(EE) 2: /usr/lib64/xorg/modules/input/evdev_drv.so (0x7f9d9dab8000+0x5ccd)
[0x7f9d9dabdccd]
(EE) 3: /usr/bin/X (0x400000+0x71fa8) [0x471fa8]
(EE) 4: /usr/bin/X (0x400000+0x99e3d) [0x499e3d]
(EE) 5: /lib64/libpthread.so.0 (0x7f9da2ef7000+0xf670) [0x7f9da2f06670]
(EE) 6: /lib64/libc.so.6 (ioctl+0x7) [0x7f9da1170a67]
(EE) 7: /usr/lib64/libdrm.so.2 (drmIoctl+0x28) [0x7f9da2cef5a8]
(EE) 8: /usr/lib64/libdrm.so.2 (drmCommandWriteRead+0x1c) [0x7f9da2cf199c]
(EE) 9: /usr/lib64/libdrm_radeon.so.1 (0x7f9da0596000+0x2029) [0x7f9da0598029]
(EE) 10: /usr/lib64/libdrm_radeon.so.1 (0x7f9da0596000+0x2244) [0x7f9da0598244]
(EE) 11: /usr/lib64/xorg/modules/drivers/radeon_drv.so (0x7f9da07a0000+0x1cc3a)
[0x7f9da07bcc3a]
(EE) 12: /usr/lib64/xorg/modules/libexa.so (0x7f9d9ff6d000+0x4ad7)
[0x7f9d9ff71ad7]
(EE) 13: /usr/lib64/xorg/modules/libexa.so (0x7f9d9ff6d000+0x8089)
[0x7f9d9ff75089]
(EE) 14: /usr/lib64/xorg/modules/libexa.so (0x7f9d9ff6d000+0x4cfb)
[0x7f9d9ff71cfb]
(EE) 15: /usr/bin/X (0x400000+0x106840) [0x506840]
(EE) 16: /usr/bin/X (ValidateGC+0x1c) [0x447c1c]
(EE) 17: /usr/bin/X (0x400000+0xfa7aa) [0x4fa7aa]
(EE) 18: /usr/bin/X (miCompositeRects+0x75) [0x4fa945]
(EE) 19: /usr/bin/X (0x400000+0x102961) [0x502961]
(EE) 20: /usr/bin/X (0x400000+0x35d11) [0x435d11]
(EE) 21: /usr/bin/X (0x400000+0x25475) [0x425475]
(EE) 22: /lib64/libc.so.6 (__libc_start_main+0xf5) [0x7f9da10a1d85]
(EE) 23: /usr/bin/X (0x400000+0x257bd) [0x4257bd]
(EE)
[197084.987] [mi] Increasing EQ size to 512 to prevent dropped events.
[197084.987] [mi] EQ processing has resumed after 929 dropped events.
[197084.987] [mi] This may be caused my a misbehaving driver monopolizing the
server's resources.
(EE) [mi] EQ overflowing. Additional events will be discarded until existing
events are processed.

-- 
You are receiving this mail because:
You are the assignee for the bug.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.x.org/archives/xorg-driver-ati/attachments/20130514/6d4a7ee1/attachment-0001.html>


More information about the xorg-driver-ati mailing list