From nobody Tue Jun 21 00:57:05 2022 X-Original-To: freebsd-current@mlmmj.nyi.freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2610:1c1:1:606c::19:1]) by mlmmj.nyi.freebsd.org (Postfix) with ESMTP id EB0D8871686 for ; Tue, 21 Jun 2022 00:57:24 +0000 (UTC) (envelope-from ultima1252@gmail.com) Received: from mail-lj1-x230.google.com (mail-lj1-x230.google.com [IPv6:2a00:1450:4864:20::230]) (using TLSv1.3 with cipher TLS_AES_128_GCM_SHA256 (128/128 bits) key-exchange X25519 server-signature RSA-PSS (4096 bits) server-digest SHA256 client-signature RSA-PSS (2048 bits) client-digest SHA256) (Client CN "smtp.gmail.com", Issuer "GTS CA 1D4" (verified OK)) by mx1.freebsd.org (Postfix) with ESMTPS id 4LRp5J0YK8z3DSw for ; Tue, 21 Jun 2022 00:57:24 +0000 (UTC) (envelope-from ultima1252@gmail.com) Received: by mail-lj1-x230.google.com with SMTP id d19so13672939lji.10 for ; Mon, 20 Jun 2022 17:57:24 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20210112; h=mime-version:references:in-reply-to:from:date:message-id:subject:to :cc; bh=nRRlkP+oml8dggbf2pUr8ybyDdg+5UseGbx4VYgZauk=; b=n2Vvpnvlw2EOm6k5Mt9gsUc5VpLWn/v5pYwcHgNkwfpot0ebl9XLVkprtcwC8IKdUB CJTlY4gXuRUMrrxVR6m5R8RgKxizjnxkEBdN8SvJ2Hz1DFhchUFxnAvmxQ9Nf8xAe23P IeGbIaXgzs5oWHZNA49rRIDQ2RZwH3vZPvz9H4FDtezv35euS/KTHxtdgx6tVeX6Kddb 1Ue9+moUNH2cXw8UAdAlqP2sYFaROW+moeOckMGYPgz5qcHVWqXRab42JocE/Oni7ELw aF9DmzS5/rFMSXk/3dVYwd+0hwo9CBVG99bnoI36AB5xH3s0oqfLsJTf7ZJx5t0HjLFm vvQQ== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20210112; h=x-gm-message-state:mime-version:references:in-reply-to:from:date :message-id:subject:to:cc; bh=nRRlkP+oml8dggbf2pUr8ybyDdg+5UseGbx4VYgZauk=; b=NzetYI/kHeACjLR3dz2zfoIjtBZ5ThBVb4c/V0i+H048K2fxw6bDEBD6PO/OYfyQes x/JliERTWYDwr7+xxBj93jrp2wD4mLecfDFEEkQiEjYUghO75IcEyoutiXDb5d8nKGcF z5MFWD7XOKx0y+rqFzZ3+LNahq7Fys0cjy5JsvTZ7lVBszV7K69yNK82Yi2H87FAf+yn 2OObCHtn+6sErACD7YGVh/mFh3QEKkpDQHCdfbkevXOaUJedxd+kDynLX6tDjlz28lxb xpuZo/9BEV3jq4U9vx4ApXTeYqRkLtIT/4cY80H3TN7BwDhH36P4j6cqCQSC3kCIWwGg aGKw== X-Gm-Message-State: AJIora9m0vuIY7pjLAtaaCFWdrKs65OkHR7ZURTwprJCZ1kLUrqwBIYC 9BRG+nvoYeWfhcB2JihLzmiaIGu/i5Sexs1HeaPffEDh X-Google-Smtp-Source: AGRyM1u+1hy3o6J8uFyrBVgmJpQFNKxHmPEn/bb/IhCotZtXkulthwjHAhVr9yl4paVfFtUG5X/B+y0ekfuaV4vVGfg= X-Received: by 2002:a2e:5743:0:b0:25a:7177:a2ab with SMTP id r3-20020a2e5743000000b0025a7177a2abmr2802863ljd.173.1655773036730; Mon, 20 Jun 2022 17:57:16 -0700 (PDT) List-Id: Discussions about the use of FreeBSD-current List-Archive: https://lists.freebsd.org/archives/freebsd-current List-Help: List-Post: List-Subscribe: List-Unsubscribe: Sender: owner-freebsd-current@freebsd.org MIME-Version: 1.0 References: <49c1dfe9bbe9912282b0f0339a0c077b@lerctr.org> In-Reply-To: <49c1dfe9bbe9912282b0f0339a0c077b@lerctr.org> From: Ultima Date: Mon, 20 Jun 2022 17:57:05 -0700 Message-ID: Subject: Re: MCE: Does this look possibly like a slot issue? To: Larry Rosenman Cc: Freebsd current Content-Type: multipart/alternative; boundary="000000000000aedc2c05e1eab3a7" X-Rspamd-Queue-Id: 4LRp5J0YK8z3DSw X-Spamd-Bar: -- Authentication-Results: mx1.freebsd.org; dkim=pass header.d=gmail.com header.s=20210112 header.b=n2Vvpnvl; dmarc=pass (policy=none) header.from=gmail.com; spf=pass (mx1.freebsd.org: domain of ultima1252@gmail.com designates 2a00:1450:4864:20::230 as permitted sender) smtp.mailfrom=ultima1252@gmail.com X-Spamd-Result: default: False [-2.75 / 15.00]; FREEMAIL_FROM(0.00)[gmail.com]; R_SPF_ALLOW(-0.20)[+ip6:2a00:1450:4000::/36]; MID_RHS_MATCH_FROMTLD(0.00)[]; TO_DN_ALL(0.00)[]; DKIM_TRACE(0.00)[gmail.com:+]; RCPT_COUNT_TWO(0.00)[2]; DMARC_POLICY_ALLOW(-0.50)[gmail.com,none]; NEURAL_HAM_SHORT(-0.75)[-0.748]; FROM_EQ_ENVFROM(0.00)[]; MIME_TRACE(0.00)[0:+,1:+,2:~]; FREEMAIL_ENVFROM(0.00)[gmail.com]; ASN(0.00)[asn:15169, ipnet:2a00:1450::/32, country:US]; SUBJECT_ENDS_QUESTION(1.00)[]; DWL_DNSWL_NONE(0.00)[gmail.com:dkim]; ARC_NA(0.00)[]; NEURAL_HAM_MEDIUM(-1.00)[-1.000]; R_DKIM_ALLOW(-0.20)[gmail.com:s=20210112]; FROM_HAS_DN(0.00)[]; NEURAL_HAM_LONG(-1.00)[-1.000]; MIME_GOOD(-0.10)[multipart/alternative,text/plain]; PREVIOUSLY_DELIVERED(0.00)[freebsd-current@freebsd.org]; TO_MATCH_ENVRCPT_SOME(0.00)[]; RCVD_IN_DNSWL_NONE(0.00)[2a00:1450:4864:20::230:from]; MLMMJ_DEST(0.00)[freebsd-current]; RCVD_COUNT_TWO(0.00)[2]; RCVD_TLS_ALL(0.00)[] X-ThisMailContainsUnwantedMimeParts: N --000000000000aedc2c05e1eab3a7 Content-Type: text/plain; charset="UTF-8" Hey Larry, One red flag I am seeing is that the error is being produced on the same CPU/bank with each error you have provided so far. Can you try and follow my original recommendation and swap currently installed DIMM with the problem DIMM slot and see if anything changes? Can you also provide the motherboard model? Also, do you have multiple CPUs installed in this system? Best regards, Richard Gallamore On Mon, Jun 20, 2022 at 5:41 PM Larry Rosenman wrote: > Yes and Yes. > > > On 06/20/2022 7:37 pm, Ultima wrote: > > Are you sure that the module you replaced it with was good? > Are you sure you replaced the correct module? > > Best regards, > Richard Gallamore > > On Mon, Jun 20, 2022 at 5:23 PM Larry Rosenman wrote: > > I'm seeing them constantly: > > root@freenas[~]# mcelog --dmi > Hardware event. This is not a software error. > MCE 0 > CPU 22 BANK 8 TSC 20aab486464a > MISC ac29890200046444 ADDR ee2f6e800 > TIME 1655770989 Mon Jun 20 19:23:09 2022 > MCG status: > Memory read ECC error > Memory corrected error count (CORE_ERR_CNT): 1 > Memory transaction Tracker ID (RTId): 44 > Memory DIMM ID of error: 0 > Memory channel ID of error: 1 > Memory ECC syndrome: ac298902 > STATUS 8c0000400001009f MCGSTATUS 0 > MCGCAP 1c09 APICID 34 SOCKETID 0 > CPUID Vendor Intel Family 6 Model 44 Step 2 > WARNING: SMBIOS data is often unreliable. Take with a grain of salt! > DDR3 DIMM 800 Mhz Other Width 72 Data Width 64 Size 4 GB > Device Locator: P2-DIMM2C > Bank Locator: BANK14 > Manufacturer: Hyundai > Serial Number: 40F3C20F > Asset Tag: > Part Number: HMT151R7BFR4C-H9 > Hardware event. This is not a software error. > MCE 1 > CPU 22 BANK 8 TSC 296dfcc82582 > MISC ac29890200041381 ADDR ee2f6e800 > TIME 1655770989 Mon Jun 20 19:23:09 2022 > MCG status: > Memory read ECC error > Memory corrected error count (CORE_ERR_CNT): 1 > Memory transaction Tracker ID (RTId): 81 > Memory DIMM ID of error: 0 > Memory channel ID of error: 1 > Memory ECC syndrome: ac298902 > STATUS 8c0000400001009f MCGSTATUS 0 > MCGCAP 1c09 APICID 34 SOCKETID 0 > CPUID Vendor Intel Family 6 Model 44 Step 2 > DDR3 DIMM 800 Mhz Other Width 72 Data Width 64 Size 4 GB > Device Locator: P2-DIMM2C > Bank Locator: BANK14 > Manufacturer: Hyundai > Serial Number: 40F3C20F > Asset Tag: > Part Number: HMT151R7BFR4C-H9 > Hardware event. This is not a software error. > MCE 2 > CPU 22 BANK 8 TSC 2a5604a6a070 > MISC ac29890200044281 > TIME 1655770989 Mon Jun 20 19:23:09 2022 > MCG status: > Memory ECC error occurred during scrub > Memory corrected error count (CORE_ERR_CNT): 1 > Memory transaction Tracker ID (RTId): 81 > Memory DIMM ID of error: 0 > Memory channel ID of error: 1 > Memory ECC syndrome: ac298902 > STATUS 88000040000200cf MCGSTATUS 0 > MCGCAP 1c09 APICID 34 SOCKETID 0 > CPUID Vendor Intel Family 6 Model 44 Step 2 > Hardware event. This is not a software error. > MCE 3 > CPU 22 BANK 8 TSC 31e141418eb8 > MISC ac29890200046a4a ADDR ee2f6e800 > TIME 1655770989 Mon Jun 20 19:23:09 2022 > MCG status: > Memory read ECC error > Memory corrected error count (CORE_ERR_CNT): 1 > Memory transaction Tracker ID (RTId): 4a > Memory DIMM ID of error: 0 > Memory channel ID of error: 1 > Memory ECC syndrome: ac298902 > STATUS 8c0000400001009f MCGSTATUS 0 > MCGCAP 1c09 APICID 34 SOCKETID 0 > CPUID Vendor Intel Family 6 Model 44 Step 2 > DDR3 DIMM 800 Mhz Other Width 72 Data Width 64 Size 4 GB > Device Locator: P2-DIMM2C > Bank Locator: BANK14 > Manufacturer: Hyundai > Serial Number: 40F3C20F > Asset Tag: > Part Number: HMT151R7BFR4C-H9 > Hardware event. This is not a software error. > MCE 4 > CPU 22 BANK 8 TSC 3a014afee106 > MISC ac29890200046646 ADDR ee2f6e800 > TIME 1655770989 Mon Jun 20 19:23:09 2022 > MCG status: > Memory read ECC error > Memory corrected error count (CORE_ERR_CNT): 1 > Memory transaction Tracker ID (RTId): 46 > Memory DIMM ID of error: 0 > Memory channel ID of error: 1 > Memory ECC syndrome: ac298902 > STATUS 8c0000400001009f MCGSTATUS 0 > MCGCAP 1c09 APICID 34 SOCKETID 0 > CPUID Vendor Intel Family 6 Model 44 Step 2 > DDR3 DIMM 800 Mhz Other Width 72 Data Width 64 Size 4 GB > Device Locator: P2-DIMM2C > Bank Locator: BANK14 > Manufacturer: Hyundai > Serial Number: 40F3C20F > Asset Tag: > Part Number: HMT151R7BFR4C-H9 > Hardware event. This is not a software error. > MCE 5 > CPU 22 BANK 8 TSC 41d1dbef1a6a > MISC ac29890200046141 ADDR ee2f6e800 > TIME 1655770989 Mon Jun 20 19:23:09 2022 > MCG status: > Memory read ECC error > Memory corrected error count (CORE_ERR_CNT): 1 > Memory transaction Tracker ID (RTId): 41 > Memory DIMM ID of error: 0 > Memory channel ID of error: 1 > Memory ECC syndrome: ac298902 > STATUS 8c0000400001009f MCGSTATUS 0 > MCGCAP 1c09 APICID 34 SOCKETID 0 > CPUID Vendor Intel Family 6 Model 44 Step 2 > DDR3 DIMM 800 Mhz Other Width 72 Data Width 64 Size 4 GB > Device Locator: P2-DIMM2C > Bank Locator: BANK14 > Manufacturer: Hyundai > Serial Number: 40F3C20F > Asset Tag: > Part Number: HMT151R7BFR4C-H9 > Hardware event. This is not a software error. > MCE 6 > CPU 22 BANK 8 TSC 4a1b1ecef446 > MISC ac29890200046a4a ADDR ee2f6e800 > TIME 1655770989 Mon Jun 20 19:23:09 2022 > MCG status: > Memory read ECC error > Memory corrected error count (CORE_ERR_CNT): 1 > Memory transaction Tracker ID (RTId): 4a > Memory DIMM ID of error: 0 > Memory channel ID of error: 1 > Memory ECC syndrome: ac298902 > STATUS 8c0000400001009f MCGSTATUS 0 > MCGCAP 1c09 APICID 34 SOCKETID 0 > CPUID Vendor Intel Family 6 Model 44 Step 2 > DDR3 DIMM 800 Mhz Other Width 72 Data Width 64 Size 4 GB > Device Locator: P2-DIMM2C > Bank Locator: BANK14 > Manufacturer: Hyundai > Serial Number: 40F3C20F > Asset Tag: > Part Number: HMT151R7BFR4C-H9 > Hardware event. This is not a software error. > MCE 7 > CPU 22 BANK 8 TSC 527bc27db776 > MISC ac29890200040386 ADDR ee2f6e800 > TIME 1655770989 Mon Jun 20 19:23:09 2022 > MCG status: > Memory read ECC error > Memory corrected error count (CORE_ERR_CNT): 1 > Memory transaction Tracker ID (RTId): 86 > Memory DIMM ID of error: 0 > Memory channel ID of error: 1 > Memory ECC syndrome: ac298902 > STATUS 8c0000400001009f MCGSTATUS 0 > MCGCAP 1c09 APICID 34 SOCKETID 0 > CPUID Vendor Intel Family 6 Model 44 Step 2 > DDR3 DIMM 800 Mhz Other Width 72 Data Width 64 Size 4 GB > Device Locator: P2-DIMM2C > Bank Locator: BANK14 > Manufacturer: Hyundai > Serial Number: 40F3C20F > Asset Tag: > Part Number: HMT151R7BFR4C-H9 > Hardware event. This is not a software error. > MCE 8 > CPU 22 BANK 8 TSC 5aa4ecdd795a > MISC ac29890200046646 ADDR ee2f6e800 > TIME 1655770989 Mon Jun 20 19:23:09 2022 > MCG status: > Memory read ECC error > Memory corrected error count (CORE_ERR_CNT): 1 > Memory transaction Tracker ID (RTId): 46 > Memory DIMM ID of error: 0 > Memory channel ID of error: 1 > Memory ECC syndrome: ac298902 > STATUS 8c0000400001009f MCGSTATUS 0 > MCGCAP 1c09 APICID 34 SOCKETID 0 > CPUID Vendor Intel Family 6 Model 44 Step 2 > DDR3 DIMM 800 Mhz Other Width 72 Data Width 64 Size 4 GB > Device Locator: P2-DIMM2C > Bank Locator: BANK14 > Manufacturer: Hyundai > Serial Number: 40F3C20F > Asset Tag: > Part Number: HMT151R7BFR4C-H9 > root@freenas[~]# <#m_3316908189451833722_NOP> > > > and I replaced the DIMM yesterday :( > > > > On 06/20/2022 7:19 pm, Ultima wrote: > > Hey Larry, > > It is possible it's the motherboard itself, but it's rare. The way I > would determine this is to swap the DIMM module with another > populated slot on the motherboard and see if the error migrated > to the new slot or not. Also, this error doesn't necessarily mean > there is a problem that needs to be addressed. If you have been > running the system for many months and you see ECC errors a > handful of times, it can probably be safely ignored. > > Best regards, > Richard Gallamore > > On Mon, Jun 20, 2022 at 3:14 PM Larry Rosenman wrote: > > I've gotten a BUNCH of these on my TrueNAS server. I've replaced this > DIMM a couple of times, and still the MCE's continue. > Is it possible it's Motherboard slot issue? > > Hardware event. This is not a software error. > MCE 8 > CPU 22 BANK 8 TSC 5aa4ecdd795a > MISC ac29890200046646 ADDR ee2f6e800 > TIME 1655762472 Mon Jun 20 17:01:12 2022 > MCG status: > Memory read ECC error > Memory corrected error count (CORE_ERR_CNT): 1 > Memory transaction Tracker ID (RTId): 46 > Memory DIMM ID of error: 0 > Memory channel ID of error: 1 > Memory ECC syndrome: ac298902 > STATUS 8c0000400001009f MCGSTATUS 0 > MCGCAP 1c09 APICID 34 SOCKETID 0 > CPUID Vendor Intel Family 6 Model 44 Step 2 > DDR3 DIMM 800 Mhz Other Width 72 Data Width 64 Size 4 GB > Device Locator: P2-DIMM2C > Bank Locator: BANK14 > Manufacturer: Hyundai > Serial Number: 40F3C20F > Asset Tag: > Part Number: HMT151R7BFR4C-H9 > > > > -- > Larry Rosenman http://www.lerctr.org/~ler > Phone: +1 214-642-9640 E-Mail: ler@lerctr.org > US Mail: 5708 Sabbia Dr, Round Rock, TX 78665-2106 > > > -- > Larry Rosenman http://www.lerctr.org/~ler > Phone: +1 214-642-9640 E-Mail: ler@lerctr.org > US Mail: 5708 Sabbia Dr, Round Rock, TX 78665-2106 > > > -- > Larry Rosenman http://www.lerctr.org/~ler > Phone: +1 214-642-9640 E-Mail: ler@lerctr.org > US Mail: 5708 Sabbia Dr, Round Rock, TX 78665-2106 > --000000000000aedc2c05e1eab3a7 Content-Type: text/html; charset="UTF-8" Content-Transfer-Encoding: quoted-printable
Hey Larry,

One red flag I am= seeing is that the error is being produced on
the same CPU/bank = with each error you have provided so far.
Can you try = and follow my original recommendation and swap
currently installe= d DIMM with the problem DIMM slot and see
if anything changes?

Can you also provide the motherboard model? Also= , do you
have multiple CPUs installed in this system?

Best regards,
Richard Gallamore
<= div>

On Mon, Jun 20, 2022 at 5:41 PM Larry Rosenman <ler@lerctr.org> wrote:

Yes and Yes.


On 06/20/2022 7:37 pm, Ult= ima wrote:

Are you sure that the module you replaced it with was good?
Are you sure you replaced the correct module?
=C2=A0
Best regards,
Richard Gallamore

On Mon, Jun 20, 2022 at 5:23 PM Larry Rosenman <ler@lerctr.= org> wrote:

I'm seeing them constantly:

root@freenas[~]# mcelog --dmi
Hardware event. This is not a software = error.
MCE 0
CPU 22 BANK 8 TSC 20aab486464a
MISC ac29890200046444 = ADDR ee2f6e800
TIME 1655770989 Mon Jun 20 19:23:09 2022
MCG status:Memory read ECC error
Memory corrected error count (CORE_ERR_CNT): 1Memory transaction Tracker ID (RTId): 44
Memory DIMM ID of error: 0Memory channel ID of error: 1
Memory ECC syndrome: ac298902
STATUS 8= c0000400001009f MCGSTATUS 0
MCGCAP 1c09 APICID 34 SOCKETID 0
CPUID Ve= ndor Intel Family 6 Model 44 Step 2
WARNING: SMBIOS data is often unreli= able. Take with a grain of salt!
DDR3 DIMM 800 Mhz Other Width 72 Data W= idth 64 Size 4 GB
Device Locator: P2-DIMM2C
Bank Locator: BANK14
M= anufacturer: Hyundai
Serial Number: 40F3C20F
Asset Tag:
Part Numbe= r: HMT151R7BFR4C-H9
Hardware event. This is not a software error.
MCE= 1
CPU 22 BANK 8 TSC 296dfcc82582
MISC ac29890200041381 ADDR ee2f6e80= 0
TIME 1655770989 Mon Jun 20 19:23:09 2022
MCG status:
Memory read= ECC error
Memory corrected error count (CORE_ERR_CNT): 1
Memory tran= saction Tracker ID (RTId): 81
Memory DIMM ID of error: 0
Memory chann= el ID of error: 1
Memory ECC syndrome: ac298902
STATUS 8c000040000100= 9f MCGSTATUS 0
MCGCAP 1c09 APICID 34 SOCKETID 0
CPUID Vendor Intel Fa= mily 6 Model 44 Step 2
DDR3 DIMM 800 Mhz Other Width 72 Data Width 64 Si= ze 4 GB
Device Locator: P2-DIMM2C
Bank Locator: BANK14
Manufacture= r: Hyundai
Serial Number: 40F3C20F
Asset Tag:
Part Number: HMT151R= 7BFR4C-H9
Hardware event. This is not a software error.
MCE 2
CPU = 22 BANK 8 TSC 2a5604a6a070
MISC ac29890200044281
TIME 1655770989 Mon = Jun 20 19:23:09 2022
MCG status:
Memory ECC error occurred during scr= ub
Memory corrected error count (CORE_ERR_CNT): 1
Memory transaction = Tracker ID (RTId): 81
Memory DIMM ID of error: 0
Memory channel ID of= error: 1
Memory ECC syndrome: ac298902
STATUS 88000040000200cf MCGST= ATUS 0
MCGCAP 1c09 APICID 34 SOCKETID 0
CPUID Vendor Intel Family 6 M= odel 44 Step 2
Hardware event. This is not a software error.
MCE 3CPU 22 BANK 8 TSC 31e141418eb8
MISC ac29890200046a4a ADDR ee2f6e800
= TIME 1655770989 Mon Jun 20 19:23:09 2022
MCG status:
Memory read ECC = error
Memory corrected error count (CORE_ERR_CNT): 1
Memory transacti= on Tracker ID (RTId): 4a
Memory DIMM ID of error: 0
Memory channel ID= of error: 1
Memory ECC syndrome: ac298902
STATUS 8c0000400001009f MC= GSTATUS 0
MCGCAP 1c09 APICID 34 SOCKETID 0
CPUID Vendor Intel Family = 6 Model 44 Step 2
DDR3 DIMM 800 Mhz Other Width 72 Data Width 64 Size 4 = GB
Device Locator: P2-DIMM2C
Bank Locator: BANK14
Manufacturer: Hy= undai
Serial Number: 40F3C20F
Asset Tag:
Part Number: HMT151R7BFR4= C-H9
Hardware event. This is not a software error.
MCE 4
CPU 22 BA= NK 8 TSC 3a014afee106
MISC ac29890200046646 ADDR ee2f6e800
TIME 16557= 70989 Mon Jun 20 19:23:09 2022
MCG status:
Memory read ECC error
M= emory corrected error count (CORE_ERR_CNT): 1
Memory transaction Tracker= ID (RTId): 46
Memory DIMM ID of error: 0
Memory channel ID of error:= 1
Memory ECC syndrome: ac298902
STATUS 8c0000400001009f MCGSTATUS 0<= br>MCGCAP 1c09 APICID 34 SOCKETID 0
CPUID Vendor Intel Family 6 Model 44= Step 2
DDR3 DIMM 800 Mhz Other Width 72 Data Width 64 Size 4 GB
Devi= ce Locator: P2-DIMM2C
Bank Locator: BANK14
Manufacturer: Hyundai
S= erial Number: 40F3C20F
Asset Tag:
Part Number: HMT151R7BFR4C-H9
Ha= rdware event. This is not a software error.
MCE 5
CPU 22 BANK 8 TSC 4= 1d1dbef1a6a
MISC ac29890200046141 ADDR ee2f6e800
TIME 1655770989 Mon = Jun 20 19:23:09 2022
MCG status:
Memory read ECC error
Memory corr= ected error count (CORE_ERR_CNT): 1
Memory transaction Tracker ID (RTId)= : 41
Memory DIMM ID of error: 0
Memory channel ID of error: 1
Memo= ry ECC syndrome: ac298902
STATUS 8c0000400001009f MCGSTATUS 0
MCGCAP = 1c09 APICID 34 SOCKETID 0
CPUID Vendor Intel Family 6 Model 44 Step 2DDR3 DIMM 800 Mhz Other Width 72 Data Width 64 Size 4 GB
Device Locator= : P2-DIMM2C
Bank Locator: BANK14
Manufacturer: Hyundai
Serial Numb= er: 40F3C20F
Asset Tag:
Part Number: HMT151R7BFR4C-H9
Hardware eve= nt. This is not a software error.
MCE 6
CPU 22 BANK 8 TSC 4a1b1ecef44= 6
MISC ac29890200046a4a ADDR ee2f6e800
TIME 1655770989 Mon Jun 20 19:= 23:09 2022
MCG status:
Memory read ECC error
Memory corrected erro= r count (CORE_ERR_CNT): 1
Memory transaction Tracker ID (RTId): 4a
Me= mory DIMM ID of error: 0
Memory channel ID of error: 1
Memory ECC syn= drome: ac298902
STATUS 8c0000400001009f MCGSTATUS 0
MCGCAP 1c09 APICI= D 34 SOCKETID 0
CPUID Vendor Intel Family 6 Model 44 Step 2
DDR3 DIMM= 800 Mhz Other Width 72 Data Width 64 Size 4 GB
Device Locator: P2-DIMM2= C
Bank Locator: BANK14
Manufacturer: Hyundai
Serial Number: 40F3C2= 0F
Asset Tag:
Part Number: HMT151R7BFR4C-H9
Hardware event. This i= s not a software error.
MCE 7
CPU 22 BANK 8 TSC 527bc27db776
MISC = ac29890200040386 ADDR ee2f6e800
TIME 1655770989 Mon Jun 20 19:23:09 2022=
MCG status:
Memory read ECC error
Memory corrected error count (C= ORE_ERR_CNT): 1
Memory transaction Tracker ID (RTId): 86
Memory DIMM = ID of error: 0
Memory channel ID of error: 1
Memory ECC syndrome: ac2= 98902
STATUS 8c0000400001009f MCGSTATUS 0
MCGCAP 1c09 APICID 34 SOCKE= TID 0
CPUID Vendor Intel Family 6 Model 44 Step 2
DDR3 DIMM 800 Mhz O= ther Width 72 Data Width 64 Size 4 GB
Device Locator: P2-DIMM2C
Bank = Locator: BANK14
Manufacturer: Hyundai
Serial Number: 40F3C20F
Asse= t Tag:
Part Number: HMT151R7BFR4C-H9
Hardware event. This is not a so= ftware error.
MCE 8
CPU 22 BANK 8 TSC 5aa4ecdd795a
MISC ac29890200= 046646 ADDR ee2f6e800
TIME 1655770989 Mon Jun 20 19:23:09 2022
MCG st= atus:
Memory read ECC error
Memory corrected error count (CORE_ERR_CN= T): 1
Memory transaction Tracker ID (RTId): 46
Memory DIMM ID of erro= r: 0
Memory channel ID of error: 1
Memory ECC syndrome: ac298902
S= TATUS 8c0000400001009f MCGSTATUS 0
MCGCAP 1c09 APICID 34 SOCKETID 0
C= PUID Vendor Intel Family 6 Model 44 Step 2
DDR3 DIMM 800 Mhz Other Width= 72 Data Width 64 Size 4 GB
Device Locator: P2-DIMM2C
Bank Locator: B= ANK14
Manufacturer: Hyundai
Serial Number: 40F3C20F
Asset Tag:
= Part Number: HMT151R7BFR4C-H9
roo= t@freenas[~]#


and I replaced the DIMM yesterday :(=C2=A0



On 06/20/2022 7:19 pm, Ultima wrote:

Hey Larry,
=C2=A0
=C2=A0It is possible it's the motherboard itself, but it's rar= e. The way I
would determine this is to swap the DIMM module with another
populated slot on the motherboard and see if the error migrated
to the new slot or not. Also, this error doesn't necessarily mean<= /div>
there is a problem that needs to be addressed. If you have been
running the system for many months and you see ECC errors a
handful of times, it can probably be safely ignored.
=C2=A0
Best regards,
Richard Gallamore

On Mon, Jun 20, 2022 at 3:14 PM Larry Rosenman <ler@lerctr.= org> wrote:
I've gotten a BUNCH of these on my TrueNAS = server.=C2=A0 I've replaced this
DIMM a couple of times, and still = the MCE's continue.
Is it possible it's Motherboard slot issue?<= br>
Hardware event. This is not a software error.
MCE 8
CPU 22 BAN= K 8 TSC 5aa4ecdd795a
MISC ac29890200046646 ADDR ee2f6e800
TIME 165576= 2472 Mon Jun 20 17:01:12 2022
MCG status:
Memory read ECC error
Me= mory corrected error count (CORE_ERR_CNT): 1
Memory transaction Tracker = ID (RTId): 46
Memory DIMM ID of error: 0
Memory channel ID of error: = 1
Memory ECC syndrome: ac298902
STATUS 8c0000400001009f MCGSTATUS 0MCGCAP 1c09 APICID 34 SOCKETID 0
CPUID Vendor Intel Family 6 Model 44 = Step 2
DDR3 DIMM 800 Mhz Other Width 72 Data Width 64 Size 4 GB
Devic= e Locator: P2-DIMM2C
Bank Locator: BANK14
Manufacturer: Hyundai
Se= rial Number: 40F3C20F
Asset Tag:
Part Number: HMT151R7BFR4C-H9


--
Larry Rosenman=C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2= =A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0http://www.lerctr.org/~lerPhone: +1 214-642-9640=C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 = =C2=A0 =C2=A0E-Mail: ler@lerctr.org
US Mail: 5708 Sabbia Dr, Round Rock,= TX 78665-2106


--=C2=A0<= br>Larry Rosenman =C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2= =A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0ht= tp://www.lerctr.org/~ler
Phone: +1 214-642-9640 =C2=A0=C2=A0=C2=A0= =C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2= =A0E-Mail: ler@lerctr.org
US Mail: 5708 Sabbia Dr, Round Rock, TX 78665-= 2106


--=C2=A0<= br>Larry Rosenman =C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2= =A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0ht= tp://www.lerctr.org/~ler
Phone: +1 214-642-9640 =C2=A0=C2=A0=C2=A0= =C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2=A0=C2= =A0E-Mail: ler@lerctr.o= rg
US Mail: 5708 Sabbia Dr, Round Rock, TX 78665-2106
--000000000000aedc2c05e1eab3a7--