Re: GPU hanging after upgrade to 14.1

From: Andrea Venturoli <ml_at_netfence.it>
Date: Wed, 05 Mar 2025 15:48:07 UTC
On 10/3/24 15:27, Andrea Venturoli wrote:
> Hello.
> 
> I've got a Starlite laptop which has always worked flawlessly (at least 
> WRT graphics).
> 
>> pciconf -lvb
>> vgapci0@pci0:0:2:0:     class=0x030000 rev=0x06 hdr=0x00 vendor=0x8086 
>> device=0x3184 subvendor=0x0000 subdevice=0x0000
>>     vendor     = 'Intel Corporation'
>>     device     = 'GeminiLake [UHD Graphics 605]'
>>     class      = display
>>     subclass   = VGA
>>     bar   [10] = type Memory, range 64, base 0x90000000, size 
>> 16777216, enabled
>>     bar   [18] = type Prefetchable Memory, range 64, base 0x80000000, 
>> size 268435456, enabled
>>     bar   [20] = type I/O Port, range 32, base 0x1000, size 64, enabled
> 
> 
> However, a few days ago I upgraded it from 13.3 to 14.1 and it started 
> giving troubles (usually 2-3 times a day, while performing normal 
> desktop work).
> When in happens, the screen might get garbled in different ways, it 
> might be unresponsive or not, but in the end I have to press Ctrl-Alt- 
> Backspace and start over.
> 
> What I see in the first console might be (please forgive any little 
> mistake, as I need to rewrite it from a picture):
> 
>  > drmn0: [drm] GPU HANG: ecode 9:1:85dffdfb, in MainThread [100791]
>  > drmn0: [drm] Resetting rcs0 for preemption time out
>  > drmn0: [drm] Xorg[100791] context reset due to GPU hang
> 
> or:
> 
>  > drmn0: [drm] Resetting rcs0 for CS error
>  > drmn0: [drm] MainThread[100791] context reset due to GPU hang
>  > drmn0: [drm] GPU HANG: ecode 9:1:00280001, in MainThread [100791]
>  > drmn0: [drm] GPU HANG: ecode 9:1:85dfffff, in MainThread [100791]
>  > drmn0: [drm] Resetting rcs0 for preemption time out
>  > drmn0: [drm] Xorg[100791] context reset due to GPU hang
> 
> I've seen a couple of bug reports, but those does not look the same to me.

Hello.

Could please someone with more insight that I have try and shed some 
light on this?
Please!

I've also opened a bug report:
https://bugs.freebsd.org/bugzilla/show_bug.cgi?id=282304




I understand this might not be a FreeBSD problem, coming instead from 
upstream...
I made several searches, found several suggestions and tried them:
_ setting compat.linuxkpi.i915_disable_power_well="0";
_ hw.i915kms.enable_psr=0
_ hw.i915kms.enable_dc=0 and/or compat.linuxkpi.i915_enable_dc=0.

Nothing seems to help.



To anyone with internal knowledge of X11 on FreeBSD, is this a known 
problem?
Is someone (here or upstream) working on this?
Is this fixed on newer versions (stable? current?)? Or is a fix expected 
to come (in 14.3? 15.0)?
Any workaround?
Anything I should report?
Anything I should try to help debugging this?



  bye & Thanks
	av.