Re: Intermittent failure of routing/gateway with ix(4) (x86_64)
Date: Sat, 30 Aug 2025 17:44:30 UTC
On Sat, Aug 30, 2025 at 8:18 AM R Tyler Croy <rtyler@brokenco.de> wrote: > > (replies inelin) > > > On Saturday, August 23rd, 2025 at 10:32 PM, Kevin Bowling <kevin.bowling@kev009.com> wrote: > > > > > > ix is generally regarded as stable as others have stated. It would be > > helpful to see 'sysctl dev.ix' after a stall, and maybe a clean one > > prior to it. > > > > > On X540 I'd pay some attention to thermals, the PHY generates a real > > amount of heat. > > > I was feeling pretty good this week that it might be the thermals since it's been cooler and there were no issues... > > Then in the middle of the night, the issue popped up again! > > I have attached the diff on `sysctl dev.ix` before and after the stall. The ix1 is the LAN interface where gateway traffic disappears. I'm hoping more savvy eyes on this list might spot something useful in this You've got an assortment of MAC level errors going on in the 'after': +dev.ix.1.mac_stats.checksum_errs: 137 +dev.ix.1.mac_stats.rx_missed_packets: 930676 +dev.ix.1.mac_stats.rx_errs: 930676 +dev.ix.0.mac_stats.checksum_errs: 45 +dev.ix.0.mac_stats.local_faults: 563829 +dev.ix.0.mac_stats.short_discards: 6 +dev.ix.0.mac_stats.byte_errs: 6 +dev.ix.0.mac_stats.ill_errs: 6 I would be surprised if it is not a hardware issue given the MAC errors on both ports. It's been a minute since I looked at this but IIRC the thermal diode is somehow botched somewhere in the ix family so we don't get a notification in software if the PHY over temps. But it's also possible yours is already cooked or some other issue. A potentially useful hint, the X550 uses a lot less power and produces less heat. Regards, Kevin > > Cheers