[Bug 288565] graphics/nvidia-drm-61-kmod and 66-kmod stopped working on 15-current

From: <bugzilla-noreply_at_freebsd.org>
Date: Thu, 31 Jul 2025 12:38:39 UTC
https://bugs.freebsd.org/bugzilla/show_bug.cgi?id=288565

--- Comment #9 from Tomoaki AOKI <junchoon@dec.sakura.ne.jp> ---
In your kldload output,

> linker_load_file: /boot/modules/nvidia-drm.ko - unsupported file type

This typically happens when version mis-matches occurs between kernel and
modules.

I assumed (as you're using 15-Current) you're building src locally, so also
assumed your src tree and running kernel 100% matches.

But this alone usually doesn't cause coredumps. Simply fails to load.

So make 100% sure your src tree (required to build kernel modules regardless
from src or ports). If you've installed via snapshot image and didn't installed
src distribution, then, pulled src repo later (git or gitup, usually), there
can be mis-matches. If you're pulling src using git, you can checkout exact the
same commit using `git checkout` from the same or newer state.


And for crashes, if you want nvidia-drm.ko to work, all of the 3 below needs to
be in sync.

  1. x11/nvidia-driver[-devel]
  2. exactly one of graphics/drm-[61|66]-kmod for 15-Current
  3. exactly one of graphics/nvidia-drm-[61|66]-kmod[-devel]

nvidia-drm.kmod depends on nvidia-modeset.ko and nvidia-modeset.ko depends on
nvidia-ko.
This is because nvidia.ko is the module actually controlling supported nvidia
GPU, nvidia-modeset.ko is a wrapper to provide basic modesetting interface
using nvidia.ko and nvidia-drm.ko is a wrapper to provide DRM interface using
nvidia-modeset.ko.
So these must be in exactly the same version (currently, 570.172.08).

And nvidia-drm.ko also requires modules provided by graphics/drm-[61|66]-kmod
of the version it used for building, as nvidia doesn't provide sources common
to Intel and AMD GPUs, and we do not build kmods provided by
graphics/drm-*-kmod (buiding twice is just a silly mess).

If any of them are out of sync, crashes can be happen, especially you
(accidentally or intentionally) mangled 61 and 66, or master ports and -devel
variant.

x11/nvidia-driver-devel and graphics/nvidia-drm-*-kmod-devel is basically for
New Feature Branch (NFB] of driver set, currently at 575.64.05.

  https://www.nvidia.com/en-us/drivers/unix/

Another reason of crash would be GPU-related kmods are loaded via
/boot/loader.conf.
This could cause undefined behavior (as loaded huge kmode could be truncated).

If you're customising kernel configuration, there could be risks for breakage,
too.

-- 
You are receiving this mail because:
You are the assignee for the bug.