Large machine test ideas

John Baldwin jhb at freebsd.org
Mon Aug 29 18:21:21 UTC 2011


On Monday, August 29, 2011 1:28:37 pm Ivan Voras wrote:
> On 29 August 2011 18:33,  <mdf at freebsd.org> wrote:
> > On Mon, Aug 29, 2011 at 7:46 AM, Ivan Voras <ivoras at freebsd.org> wrote:
> >> On 26/08/2011 19:44, Garrett Cooper wrote:
> >>> On Fri, Aug 26, 2011 at 10:36 AM, Ivan Voras <ivoras at freebsd.org> wrote:
> >>>
> >>> ...
> >>>
> >>>> I think that I'll need a 9-CURRENT snapshot on it to run all 128 CPUs,
> >>>> right?
> >>>
> >>> A 9.0-BETA1 snapshot, yes.
> >>
> >> Well, I'll leave it another half an hour but the 9.9-beta1 shapshot
> >> froze on boot after showing a "SRAT: No CPU found for memory domain 4".
> >
> > This message implies the memory affinity information coming from ACPI
> > is either non-sensical, or you have an unexpected physical setup where
> > there really are CPUs with no memory in the local sockets.
> >
> > You should be able to boot with something like hint.srat.0="disabled"
> > at the boot loader prompt.
> 
> Unfortunately, neither the memtest or the srat disabling tunables
> worked (I also tried disabling srat.4).
> 
> My time with the machine is over, so I can't do more testing.

The hint to set would be 'hint.srat.0.disabled=1'.

However, the SRAT code just ignores the table when it encounters an issue like 
this, it doesn't hang.  Something else later in the boot must have hung.

-- 
John Baldwin


More information about the freebsd-hackers mailing list