Re: GPU hanging after upgrade to 14.1

From: Zaphod Beeblebrox <zbeeble_at_gmail.com>
Date: Thu, 06 Mar 2025 22:34:36 UTC
I generally recompile any kernel bits related to X on an upgrade.  Not
every upgrade brakes the in-kernel binary compat, but some do.

On Wed, Mar 5, 2025 at 10:48 AM Andrea Venturoli <ml@netfence.it> wrote:

> On 10/3/24 15:27, Andrea Venturoli wrote:
> > Hello.
> >
> > I've got a Starlite laptop which has always worked flawlessly (at least
> > WRT graphics).
> >
> >> pciconf -lvb
> >> vgapci0@pci0:0:2:0:     class=0x030000 rev=0x06 hdr=0x00 vendor=0x8086
> >> device=0x3184 subvendor=0x0000 subdevice=0x0000
> >>     vendor     = 'Intel Corporation'
> >>     device     = 'GeminiLake [UHD Graphics 605]'
> >>     class      = display
> >>     subclass   = VGA
> >>     bar   [10] = type Memory, range 64, base 0x90000000, size
> >> 16777216, enabled
> >>     bar   [18] = type Prefetchable Memory, range 64, base 0x80000000,
> >> size 268435456, enabled
> >>     bar   [20] = type I/O Port, range 32, base 0x1000, size 64, enabled
> >
> >
> > However, a few days ago I upgraded it from 13.3 to 14.1 and it started
> > giving troubles (usually 2-3 times a day, while performing normal
> > desktop work).
> > When in happens, the screen might get garbled in different ways, it
> > might be unresponsive or not, but in the end I have to press Ctrl-Alt-
> > Backspace and start over.
> >
> > What I see in the first console might be (please forgive any little
> > mistake, as I need to rewrite it from a picture):
> >
> >  > drmn0: [drm] GPU HANG: ecode 9:1:85dffdfb, in MainThread [100791]
> >  > drmn0: [drm] Resetting rcs0 for preemption time out
> >  > drmn0: [drm] Xorg[100791] context reset due to GPU hang
> >
> > or:
> >
> >  > drmn0: [drm] Resetting rcs0 for CS error
> >  > drmn0: [drm] MainThread[100791] context reset due to GPU hang
> >  > drmn0: [drm] GPU HANG: ecode 9:1:00280001, in MainThread [100791]
> >  > drmn0: [drm] GPU HANG: ecode 9:1:85dfffff, in MainThread [100791]
> >  > drmn0: [drm] Resetting rcs0 for preemption time out
> >  > drmn0: [drm] Xorg[100791] context reset due to GPU hang
> >
> > I've seen a couple of bug reports, but those does not look the same to
> me.
>
> Hello.
>
> Could please someone with more insight that I have try and shed some
> light on this?
> Please!
>
> I've also opened a bug report:
> https://bugs.freebsd.org/bugzilla/show_bug.cgi?id=282304
>
>
>
>
> I understand this might not be a FreeBSD problem, coming instead from
> upstream...
> I made several searches, found several suggestions and tried them:
> _ setting compat.linuxkpi.i915_disable_power_well="0";
> _ hw.i915kms.enable_psr=0
> _ hw.i915kms.enable_dc=0 and/or compat.linuxkpi.i915_enable_dc=0.
>
> Nothing seems to help.
>
>
>
> To anyone with internal knowledge of X11 on FreeBSD, is this a known
> problem?
> Is someone (here or upstream) working on this?
> Is this fixed on newer versions (stable? current?)? Or is a fix expected
> to come (in 14.3? 15.0)?
> Any workaround?
> Anything I should report?
> Anything I should try to help debugging this?
>
>
>
>   bye & Thanks
>         av.
>
>