From nobody Sat Jun 28 01:37:07 2025 X-Original-To: freebsd-current@mlmmj.nyi.freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2610:1c1:1:606c::19:1]) by mlmmj.nyi.freebsd.org (Postfix) with ESMTP id 4bTZmK3ZWDz60XZF for ; Sat, 28 Jun 2025 01:37:21 +0000 (UTC) (envelope-from freebsd-rwg@gndrsh.dnsmgr.net) Received: from gndrsh.dnsmgr.net (pdx.rh.CN85.dnsmgr.net [65.75.216.6]) (using TLSv1 with cipher DHE-RSA-AES256-SHA (256/256 bits)) (Client did not present a certificate) by mx1.freebsd.org (Postfix) with ESMTPS id 4bTZmJ4DwWz467Y; Sat, 28 Jun 2025 01:37:20 +0000 (UTC) (envelope-from freebsd-rwg@gndrsh.dnsmgr.net) Authentication-Results: mx1.freebsd.org; none Received: from gndrsh.dnsmgr.net (localhost [127.0.0.1]) by gndrsh.dnsmgr.net (8.13.3/8.13.3) with ESMTP id 55S1b73d063056; Fri, 27 Jun 2025 18:37:07 -0700 (PDT) (envelope-from freebsd-rwg@gndrsh.dnsmgr.net) Received: (from freebsd-rwg@localhost) by gndrsh.dnsmgr.net (8.13.3/8.13.3/Submit) id 55S1b7Fe063055; Fri, 27 Jun 2025 18:37:07 -0700 (PDT) (envelope-from freebsd-rwg) From: "Rodney W. Grimes" Message-Id: <202506280137.55S1b7Fe063055@gndrsh.dnsmgr.net> Subject: Re: Panic with "Unrecoverable machine check exception" In-Reply-To: To: sgk@troutmask.apl.washington.edu Date: Fri, 27 Jun 2025 18:37:07 -0700 (PDT) CC: Alexander Motin , FreeBSD Current X-Mailer: ELM [version 2.4ME+ PL121h (25)] List-Id: Discussions about the use of FreeBSD-current List-Archive: https://lists.freebsd.org/archives/freebsd-current List-Help: List-Post: List-Subscribe: List-Unsubscribe: Sender: owner-freebsd-current@FreeBSD.org MIME-Version: 1.0 Content-Transfer-Encoding: 7bit Content-Type: text/plain; charset=US-ASCII X-Rspamd-Queue-Id: 4bTZmJ4DwWz467Y X-Spamd-Bar: ---- X-Rspamd-Pre-Result: action=no action; module=replies; Message is reply to one we originated X-Spamd-Result: default: False [-4.00 / 15.00]; REPLY(-4.00)[]; ASN(0.00)[asn:10494, ipnet:65.75.216.0/23, country:US] > Thanks for the quick reply. You're probably correct about > a hardware issue. This system is getting long in the tooth. > Panic occurs if I run a few very intensive floating point > computations. Console logs do not show anything unusual. > I need to go investigate how to up BMC. Check for CPU overheating, as systems age the thermal compound between the CPU and heatsink often becomes dryed and ineffective. > -- > steve > > > On Fri, Jun 27, 2025 at 04:52:53PM -0400, Alexander Motin wrote: > > Hi. > > > > It is likely a hardware failure, unless a false positive. Check last > > console logs or better BMC logs for detail. > > > > On Fri, Jun 27, 2025, 16:26 Steve Kargl > > wrote: > > > > > Never seen this type of panic. I can upload > > > core.txt.4 and info.4 to freefall if someone > > > wants more details. > > > > > > > > > __curthread () at /usr/src/sys/amd64/include/pcpu_aux.h:57 > > > 57 __asm("movq %%gs:%c1,%0" : "=r" (td) > > > (kgdb) #0 __curthread () at /usr/src/sys/amd64/include/pcpu_aux.h:57 > > > td = > > > #1 doadump (textdump=textdump@entry=1) > > > at /usr/src/sys/kern/kern_shutdown.c:399 > > > error = 0 > > > coredump = > > > #2 0xffffffff80676e30 in kern_reboot (howto=260) > > > at /usr/src/sys/kern/kern_shutdown.c:519 > > > once = 0 > > > __pc = 0x0 > > > #3 0xffffffff80677367 in vpanic ( > > > fmt=0xffffffff80abdb94 "Unrecoverable machine check exception", > > > ap=ap@entry=0xfffffe001c024ed0) at > > > /usr/src/sys/kern/kern_shutdown.c:974 > > > buf = "Unrecoverable machine check exception", '\000' > > times> > > > __pc = 0x0 > > > __pc = 0x0 > > > __pc = 0x0 > > > other_cpus = {__bits = {251, 0 }} > > > td = 0xfffff80004601000 > > > bootopt = > > > newpanic = > > > #4 0xffffffff80677193 in panic (fmt=) > > > at /usr/src/sys/kern/kern_shutdown.c:887 > > > ap = {{gp_offset = 8, fp_offset = 48, > > > overflow_arg_area = 0xfffffe001c024f00, > > > reg_save_area = 0xfffffe001c024ea0}} > > > #5 0xffffffff809d391b in mca_intr () at /usr/src/sys/x86/x86/mca.c:1540 > > > recoverable = false > > > mcg_status = > > > count = > > > lmcs = > > > #6 > > > No locals. > > > #7 mi_switch (flags=flags@entry=262) at > > > /usr/src/sys/kern/kern_synch.c:537 > > > __pc = 0x0 > > > __pc = 0x0 > > > __pc = 0x0 > > > __pc = 0x0 > > > __pc = 0x0 > > > td = 0xfffff80004601000 > > > new_switchtime = > > > runtime = > > > #8 0xffffffff806aaca4 in sched_idletd (dummy=dummy@entry=0x0) > > > at /usr/src/sys/kern/sched_4bsd.c:1688 > > > __pc = 0x0 > > > stat = > > > #9 0xffffffff806324b2 in fork_exit ( > > > callout=0xffffffff806aac50 , arg=0x0, > > > frame=0xfffffe00d5407f40) at /usr/src/sys/kern/kern_fork.c:1153 > > > __pc = 0x0 > > > __pc = 0x0 > > > td = 0xfffff80004601000 > > > p = 0xfffffe00c3002040 > > > dtd = > > > #10 > > > > > > > > > > > > > > > > > > > > > -- > > > Steve > > > > > > -- > > Alexander Motin > > -- > Steve > > -- Rod Grimes rgrimes@freebsd.org