[Bug 285803] graphics/nvidia-drm-66-kmod Suspend failure

From: <bugzilla-noreply_at_freebsd.org>
Date: Mon, 31 Mar 2025 10:40:24 UTC
https://bugs.freebsd.org/bugzilla/show_bug.cgi?id=285803

            Bug ID: 285803
           Summary: graphics/nvidia-drm-66-kmod Suspend failure
           Product: Ports & Packages
           Version: Latest
          Hardware: amd64
                OS: Any
            Status: New
          Severity: Affects Only Me
          Priority: ---
         Component: Individual Port(s)
          Assignee: ports-bugs@FreeBSD.org
          Reporter: ben@benhutton.com.au
                CC: ashafer@badland.io
             Flags: maintainer-feedback?(ashafer@badland.io)
                CC: ashafer@badland.io

After upgrading from graphics/nvidia-drm-66-kmod from 550.127.05.1500034_1 to
570.124.04.1500034_1 suspend no longer works. When I close the lid on my laptop
the screen just goes blank and it doesn't go into suspend mode. After a few
minutes I get dumped into the terminal then the below appears in
/var/log/messages. I also capture the following the bellow on the console:

Note: I also upgraded from FreeBSD Current 1500034 to 1500035 with both
versions having the same suspend issue.

Also note I both installed the nvidia-drm from the latest pkg I also compile
from the latest ports.

This is occurring on a Lenovo Thinkpad P1 Gen3 with a Nvidia Quadro T2000 with
the latest BIOS with FreeBSD Current and KDE Plasma 6 and SDDM. If I stay on
the current version of the DRM Drivers 550.127.05.1500034_1 suspend/resume is
working.

More than happy to assist with debugging. While I have rolled back to the
previous boot environment i did save the boot environment with the issue.

Console Errors:

DEVICE_SUSPEND(acpi0) failed: 5
DEVICE_SUSPEND(nexus0) failed: 5
acpi0: device_suspend failed

... kernel: nvidia-modeset: ERROR: GPU:0: Failed detecting connected display
devices


/var/log/messages

Mar 31 08:48:47 tesla kernel: vgapci1: child drmn1 requested pci_set_powerstate 
Mar 31 08:48:47 tesla kernel: NVRM: GPU at PCI:0000:01:00:
GPU-b3a0ee73-8822-4b5d-35ac-5c952972aa91   
Mar 31 08:48:47 tesla kernel: NVRM: Xid (PCI:0000:01:00): 120, GSP task
exception: load access fault (cause:0x5) @ pc:0x50fe804, task:1   
Mar 31 08:48:47 tesla kernel: NVRM:     Reported by libos task:0 v2.0 \[0\] @
ts:1743410864   
Mar 31 08:48:47 tesla kernel: NVRM:     RISC-V CSR State:   
Mar 31 08:48:47 tesla kernel: NVRM:         mstatus:0x000000001e000000 
mscratch:0x0000000000000000     mie:0x0000000000000880  mip:0x0000000000000000  
Mar 31 08:48:47 tesla kernel: NVRM:         mepc:0x00000000050fe804 
mbadaddr:0x0000000000000080  mcause:0x0000000000000005   
Mar 31 08:48:47 tesla kernel: NVRM:     RISC-V GPR State:   
Mar 31 08:48:47 tesla kernel: NVRM:         ra:0x00000000050fe7cc  
sp:0x0000000005b91a30   gp:0x0000000000000000   tp:0x0000000000000000   
Mar 31 08:48:47 tesla kernel: NVRM:         a0:0x0000000000000000  
a1:0x0000000005b3c060   a2:0x0000000000000004   a3:0x0000000000000000   
Mar 31 08:48:47 tesla kernel: NVRM:         a4:0x0000000000000000  
a5:0x0000000000000100   a6:0x0000000000000001   a7:0x0000000000000003   
Mar 31 08:48:47 tesla kernel: NVRM:         s0:0x0000000005b91a90  
s1:0x0000000005b91ad0   s2:0x0000000005b91ab0   s3:0x0000000004775cb0   
Mar 31 08:48:47 tesla kernel: NVRM:         s4:0x800000000019f710  
s5:0x80000000001b9a90   s6:0x800000000022bbb0   s7:0x00000000041db000   
Mar 31 08:48:47 tesla kernel: NVRM:         s8:0x80000000003325d0  
s9:0x00000000041db000  s10:0x0000000000000000  s11:0x0000000000073d00   
Mar 31 08:48:47 tesla kernel: NVRM:         t0:0x0000000000000009  
t1:0x0000000005a86dec   t2:0x800000000003ef70   t3:0x0000000000000020   
Mar 31 08:48:47 tesla kernel: NVRM:         t4:0x0000000000000000  
t5:0x0000000005b91811   t6:0x0000000000000000   
Mar 31 08:48:47 tesla kernel: NVRM:     Stack Trace:   
Mar 31 08:48:47 tesla kernel: NVRM:         0x00000000050fe804   
Mar 31 08:48:47 tesla kernel: NVRM:         0x00000000051133dc   
Mar 31 08:48:47 tesla kernel: NVRM:         0x000000000511d8e4   
Mar 31 08:48:47 tesla kernel: NVRM:         0x00000000050b8098   
Mar 31 08:48:47 tesla kernel: NVRM:         0x000000000521ecd4   
Mar 31 08:48:47 tesla kernel: NVRM:         0x0000000005ad099c   
Mar 31 08:48:47 tesla kernel: NVRM:         0x0000000005a86398   
Mar 31 08:48:47 tesla kernel: NVRM:         0x0000000005a88a80   
Mar 31 08:48:47 tesla kernel: NVRM:         0x0000000005aa2c6c   
Mar 31 08:48:47 tesla kernel: NVRM:         0x0000000005aadd68   
Mar 31 08:48:47 tesla kernel: NVRM:     PC Trace:   
Mar 31 08:48:47 tesla kernel: NVRM:         0x0000000004018064 
0x000000000401a8a8  0x0000000004018d0c  0x000000000400e35c  0x0000000004018c4c  
Mar 31 08:48:47 tesla kernel: NVRM:         0x000000000401832c 
0x000000000400e35c  0x00000000040183a0  0x0000000004018c3c  0x0000000004018110  
Mar 31 08:48:47 tesla kernel: NVRM:         0x0000000004018c1c 
0x000000000401aa84  0x0000000004018a84  0x00000000040181d8  0x0000000004018b90  
Mar 31 08:48:47 tesla kernel: NVRM:         0x000000000401832c 
0x000000000400e35c  0x00000000040183a0  0x0000000004018bc8  0x00000000040181d8  
Mar 31 08:48:47 tesla kernel: NVRM:         0x0000000004018b90 
0x000000000401832c  0x000000000400e35c  0x00000000040183a0  0x0000000004018bc8  
Mar 31 08:48:47 tesla kernel: NVRM:         0x00000000040181d8 
0x0000000004018b90  0x000000000401832c  0x000000000400e35c  0x00000000040183a0  
Mar 31 08:48:47 tesla kernel: NVRM:         0x0000000004018bc8 
0x00000000040181d8  0x0000000004018b90  0x000000000401832c  0x000000000400e35c  
Mar 31 08:48:47 tesla kernel: NVRM:         0x00000000040183a0   
Mar 31 08:48:47 tesla kernel: NVRM:     External I/O Register State:   
Mar 31 08:48:47 tesla kernel: NVRM:         0x00111360:0x00000000  
0x00111364:0xbadf5108   0x00111368:0x0000e828   0x0011136c:0x00000000   
Mar 31 08:48:47 tesla kernel: NVRM:         0x001112b4:0x00040040  
0x001112b8:0x00000040   0x001112bc:0x00000000   0x00111344:0x11100000   
Mar 31 08:48:47 tesla kernel: NVRM:         0x00110008:0x00008050  
0x0011010c:0x00000000   0x00110118:0x00012022   0x00110110:0x003f8950   
Mar 31 08:48:47 tesla kernel: NVRM:         0x00110128:0x00000000  
0x00110114:0x00005060   0x0011011c:0x00000010   
Mar 31 08:48:47 tesla kernel: NVRM:     ------------\[ end crash report
\]------------   
Mar 31 08:48:47 tesla kernel: NVRM: Xid (PCI:0000:01:00): 119, Timeout after 6s
of waiting for RPC response from GPU0 GSP! Expected function 47
(UNLOADING_GUEST_DRIVER) (0x0 0x0).   
Mar 31 08:48:47 tesla kernel: NVRM: GPU0 GSP RPC buffer contains function 4128
(GSP_POST_NOCAT_RECORD) and data 0x0000000000000005 0x00000000050fe7cc.   
Mar 31 08:48:47 tesla kernel: NVRM: GPU0 RPC history (CPU -> GSP):   
Mar 31 08:48:47 tesla kernel: NVRM:     entry function                   data0 
            data1              ts_start           ts_end             duration
actively_polling   
Mar 31 08:48:47 tesla kernel: NVRM:      0    47   UNLOADING_GUEST_DRIVE
0x0000000000000000 0x0000000000000000 0x00063198c4b1748d 0x0000000000000000    
     y   
Mar 31 08:48:47 tesla kernel: NVRM:     -1    76   GSP_RM_CONTROL       
0x0000000020800301 0x0000000000000014 0x00063198c4a61a4c 0x00063198c4a61a4c     
Mar 31 08:48:47 tesla kernel: NVRM:     -2    103  GSP_RM_ALLOC         
0x000000000000007e 0x0000000000000018 0x00063198c4a61664 0x00063198c4a61a4c  
1000us     
Mar 31 08:48:47 tesla kernel: NVRM:     -3    76   GSP_RM_CONTROL       
0x00000000a06f0103 0x0000000000000002 0x00063198c4a6127c 0x00063198c4a61664  
1000us     
Mar 31 08:48:47 tesla kernel: NVRM:     -4    103  GSP_RM_ALLOC         
0x000000000000c5b5 0x0000000000000008 0x00063198c4a60e96 0x00063198c4a6127c   
998us     
Mar 31 08:48:47 tesla kernel: NVRM:     -5    103  GSP_RM_ALLOC         
0x000000000000c46f 0x0000000000000170 0x00063198c4a5c074 0x00063198c4a5d3fc  
5000us     
Mar 31 08:48:47 tesla kernel: NVRM:     -6    76   GSP_RM_CONTROL       
0x0000000020802a08 0x0000000000000004 0x00063198c4a5bc8c 0x00063198c4a5c074  
1000us     
Mar 31 08:48:47 tesla kernel: NVRM:     -7    10   FREE                 
0x000000000000000c 0x0000000000000000 0x00063198c4a5bc8c 0x00063198c4a5bc8c     
Mar 31 08:48:47 tesla kernel: NVRM: GPU0 RPC event history (CPU <- GSP):   
Mar 31 08:48:47 tesla kernel: NVRM:     entry function                   data0 
            data1              ts_start           ts_end             duration
during_incomplete_rpc   
Mar 31 08:48:47 tesla kernel: NVRM:      0    4128 GSP_POST_NOCAT_RECORD
0x0000000000000005 0x00000000050fe7cc 0x00063198c4b27e2d 0x00063198c4b27e2d    
     y   
Mar 31 08:48:47 tesla kernel: NVRM:     -1    4128 GSP_POST_NOCAT_RECORD
0x0000000000000002 0x0000000000000028 0x00063198c4b1ce64 0x00063198c4b1ce64    
     y   
Mar 31 08:48:47 tesla kernel: NVRM:     -2    4111 PERF_BRIDGELESS_INFO\_
0x0000000000000000 0x0000000000000000 0x00063198c481bdb5 0x00063198c481bdb5     
Mar 31 08:48:47 tesla kernel: NVRM:     -3    4111 PERF_BRIDGELESS_INFO\_
0x0000000000000000 0x0000000000000000 0x00063198c478a38f 0x00063198c478a38f     
Mar 31 08:48:47 tesla kernel: NVRM:     -4    4099 POST_EVENT           
0x0000000000000000 0x0000000000000000 0x00063198c476193b 0x00063198c476193b     
Mar 31 08:48:47 tesla kernel: NVRM:     -5    4099 POST_EVENT           
0x0000000000000000 0x0000000000000000 0x00063198c476193b 0x00063198c476193b     
Mar 31 08:48:47 tesla kernel: NVRM:     -6    4099 POST_EVENT           
0x0000000000000000 0x0000000000000000 0x00063198c475d6df 0x00063198c475d6df     
Mar 31 08:48:47 tesla kernel: NVRM:     -7    4099 POST_EVENT           
0x0000000000000000 0x0000000000000000 0x00063198c475d6df 0x00063198c475d6df     
Mar 31 08:48:47 tesla kernel: #0 0xffffffff855a9d28 at os_dump_stack+0x18   
Mar 31 08:48:47 tesla kernel: #1 0xffffffff84ebdc68 at nv013200rm+0x508   
Mar 31 08:48:47 tesla kernel: DEVICESUSPEND(nvidia0) failed: 5   
Mar 31 08:48:47 tesla kernel: DEVICE_SUSPEND(vgapci0) failed: 5   
Mar 31 08:48:47 tesla kernel: DEVICE_SUSPEND(pci1) failed: 5   
Mar 31 08:48:47 tesla kernel: DEVICE_SUSPEND(pcib1) failed: 5   
Mar 31 08:48:47 tesla kernel: vgapci1: child drmn1 requested pci_set_powerstate 
Mar 31 08:48:47 tesla kernel: vgapci1: child drmn1 requested pci_enable_io   
Mar 31 08:48:47 tesla syslogd: last message repeated 1 times   
Mar 31 08:48:47 tesla kernel: pci6: failed to set ACPI power state D3 on
\\\_SB\_.PCI0.RP07.PXSX: AE_BAD_PARAMETER

-- 
You are receiving this mail because:
You are the assignee for the bug.