svn commit: r272800 - head/sys/x86/acpica
John Baldwin
jhb at freebsd.org
Thu Oct 9 21:21:24 UTC 2014
On Thursday, October 09, 2014 2:23:10 pm Konstantin Belousov wrote:
> On Thu, Oct 09, 2014 at 05:34:29AM +0000, Adrian Chadd wrote:
> > Author: adrian
> > Date: Thu Oct 9 05:34:28 2014
> > New Revision: 272800
> > URL: https://svnweb.freebsd.org/changeset/base/272800
> >
> > Log:
> > Missing from previous commit - keep the VM domain -> PXM mapping
> > array and use it to map PXM -> VM domain when needed.
> >
> > Differential Revision: D906
> > Reviewed by: jhb
> >
> > Modified:
> > head/sys/x86/acpica/srat.c
> >
> > Modified: head/sys/x86/acpica/srat.c
> >
==============================================================================
> > --- head/sys/x86/acpica/srat.c Thu Oct 9 05:33:25 2014 (r272799)
> > +++ head/sys/x86/acpica/srat.c Thu Oct 9 05:34:28 2014 (r272800)
> > @@ -62,6 +62,8 @@ int num_mem;
> > static ACPI_TABLE_SRAT *srat;
> > static vm_paddr_t srat_physaddr;
> >
> > +static int vm_domains[VM_PHYSSEG_MAX];
> > +
> > static void srat_walk_table(acpi_subtable_handler *handler, void
*arg);
> >
> > /*
> > @@ -247,7 +249,6 @@ check_phys_avail(void)
> > static int
> > renumber_domains(void)
> > {
> > - int domains[VM_PHYSSEG_MAX];
> > int i, j, slot;
> >
> > /* Enumerate all the domains. */
> > @@ -255,17 +256,17 @@ renumber_domains(void)
> > for (i = 0; i < num_mem; i++) {
> > /* See if this domain is already known. */
> > for (j = 0; j < vm_ndomains; j++) {
> > - if (domains[j] >= mem_info[i].domain)
> > + if (vm_domains[j] >= mem_info[i].domain)
> > break;
> > }
> > - if (j < vm_ndomains && domains[j] == mem_info[i].domain)
> > + if (j < vm_ndomains && vm_domains[j] == mem_info[i].domain)
> > continue;
> >
> > /* Insert the new domain at slot 'j'. */
> > slot = j;
> > for (j = vm_ndomains; j > slot; j--)
> > - domains[j] = domains[j - 1];
> > - domains[slot] = mem_info[i].domain;
> > + vm_domains[j] = vm_domains[j - 1];
> > + vm_domains[slot] = mem_info[i].domain;
> > vm_ndomains++;
> > if (vm_ndomains > MAXMEMDOM) {
> > vm_ndomains = 1;
> > @@ -280,15 +281,15 @@ renumber_domains(void)
> > * If the domain is already the right value, no need
> > * to renumber.
> > */
> > - if (domains[i] == i)
> > + if (vm_domains[i] == i)
> > continue;
> >
> > /* Walk the cpu[] and mem_info[] arrays to renumber. */
> > for (j = 0; j < num_mem; j++)
> > - if (mem_info[j].domain == domains[i])
> > + if (mem_info[j].domain == vm_domains[i])
> > mem_info[j].domain = i;
> > for (j = 0; j <= MAX_APIC_ID; j++)
> > - if (cpus[j].enabled && cpus[j].domain == domains[i])
> > + if (cpus[j].enabled && cpus[j].domain == vm_domains[i])
> > cpus[j].domain = i;
> > }
> > KASSERT(vm_ndomains > 0,
> > @@ -368,4 +369,23 @@ srat_set_cpus(void *dummy)
> > }
> > }
> > SYSINIT(srat_set_cpus, SI_SUB_CPU, SI_ORDER_ANY, srat_set_cpus, NULL);
> > +
> > +/*
> > + * Map a _PXM value to a VM domain ID.
> > + *
> > + * Returns the domain ID, or -1 if no domain ID was found.
> > + */
> > +int
> > +acpi_map_pxm_to_vm_domainid(int pxm)
> > +{
> > + int i;
> > +
> > + for (i = 0; i < vm_ndomains; i++) {
> > + if (vm_domains[i] == pxm)
> > + return (i);
> > + }
> > +
> > + return (-1);
> > +}
> > +
> > #endif /* MAXMEMDOM > 1 */
>
> I do not like it. Sorry for not looking at the web thing, I have very
> little time.
>
> It never was an intention that one proximity domain reported by ACPI
> was mapped to single VM domain. VM could split domains (in terms of
> vm_domains) further for other reasons. Main motivation is that there
> is 1:1 relations between domain/page queues/page queues locks/pagedaemons.
>
> I have patches in WIP stage which split firmware proximity domains
> further, to decrease congestion on the page queue locks. I wrote about
> this in the pgsql performance report.
>
> The short version is that there is/will be N:1 relation between VM domains
> and proximity domains (which is reported by ACPI for devices).
_PXM is also defined to be what SRAT reports for memory, and is what will be
used to do NUMA-aware memory allocations. While the VM system may decide to
split a given NUMA domain into multiple some-other-things, those some-other-
things won't be a NUMA domain anymore. At that point, you will need to
divorce them from 'domain' and use another term as the domain index into
mem_info[] will still be needed so that NUMA allocations do the correct thing.
--
John Baldwin
More information about the svn-src-head
mailing list