Looks like ap_letgo use needs platform specific code to allow avoiding the "sleep-gets-stuck" problem on PowerMac11,2's . . .

Sat Apr 20 04:42:40 UTC 2019

On 2019-Apr-19, at 20:48, Justin Hibbits <chmeeedalf at gmail.com> wrote:

> On Fri, Apr 19, 2019 at 10:36 PM Mark Millard <marklmi at yahoo.com> wrote:
>> 
>> [Note: My context is tied to getting usefdt mode operable on
>> old PowerMacs. The below is only tested for usefdt mode
>> so far.]
>> 
>> The following investigatory patch has so-far stopped my having
>> sleep-gets-stuck problems (only seen on 2-socket/1-core-each
>> 970 MP G5 Powermac11,2's as far as I know):
>> 
>> # svnlite diff  /usr/src/sys/powerpc/powerpc/mp_machdep.c | more                                                                       Index: /usr/src/sys/powerpc/powerpc/mp_machdep.c
>> ===================================================================
>> --- /usr/src/sys/powerpc/powerpc/mp_machdep.c   (revision 345758)
>> +++ /usr/src/sys/powerpc/powerpc/mp_machdep.c   (working copy)
>> @@ -77,9 +77,10 @@
>>        PCPU_SET(awake, 1);
>>        __asm __volatile("msync; isync");
>> 
>> +       powerpc_sync();
>>        while (ap_letgo == 0)
>> -               __asm __volatile("or 31,31,31");
>> -       __asm __volatile("or 6,6,6");
>> +               powerpc_sync();
>> +       isync();
>> 
>>        /*
>>         * Set timebase as soon as possible to meet an implicit rendezvous
>> @@ -262,8 +263,11 @@
>>        __asm __volatile("msync; isync");
>> 
>>        /* Let APs continue */
>> -       atomic_store_rel_int(&ap_letgo, 1);
>> +       ap_letgo= 1;    // depend on prior sync, no need to lwsync first
>> 
>> +       powerpc_sync(); // analogous to what the ap's do (more similar time frame?)
>> +       if (ap_letgo) isync();
>> +
>>        platform_smp_timebase_sync(ap_timebase, 0);
>> 
>>        while (ap_awake < smp_cpus)
>> 
>> Apparently, the use of "or 31,31,31" causes sizable
>> variations in the time frame when the platform_smp_timebase_sync
>> happens on the various cores across the two 970MPs.
>> 
>> It looks something like a platform_ap_letgo_wait is appropriate,
>> with a powermac_ap_letgo_wait specific one, say. (Or, possibly,
>> AIM specific but spanning powermac?)
>> 
>> The above patch has booted and operated the 2-socket PowerMac7,2
>> context fine as well so far. I'll check the G5's and a G4
>> dual-socket with a 32-bit powerpc build.
>> 
>> I've no clue if there are any time-mismatch issues across
>> sockets/cores/hw-threads for the "8-way SMT" contexts with
>> "dozens to hundreds of CPUs".
>> 
>> I've only been testing for part of today and I do not have
>> access to any non-PowerMac PowerPC contexts. So this is
>> preliminary but I do not expect "or 31,31,31" is going to
>> be appropriate to the PowerMac11,2 contexts that caused
>> my investigation of the issue.
> 
> Those nops are just that on the G5: nops.  They do absolutely nothing
> on any processor that's not multithreaded.  On multithreaded CPUs they
> are priority hints.

I thought PowerISA 2.03 was before multi-threaded (but spanning
multi-core). It lists 31,31,31 in a table with other values. Is
there a better match to the 970MP vintage of things for me to
reference?

I'll test:

        powerpc_sync();
        while (ap_letgo == 0)
        {
                __asm __volatile("or 31,31,31");
                powerpc_sync();
        }
        __asm __volatile("or 6,6,6");
        isync();

(or some other such if you want). Let me know if you
have some specific variation you prefer.

> What most likely 'fixed' the problem for you was the addition of the
> synchronization primitives in the code path.  You can add them without
> sacrificing the nops.  Does this patch fix the boot issue on the G5
> quad without the usefdt=1 setting, and without reverting the KVA
> change?

We already had an exchange about my forcing an slb entry as
needed for pcpup->pc_curpcb to be used. It massively changed
the frequency of hangups (rare now).

As a reminder, I had added

hack_into_slb_if_needed(pcpup->pc_curpcb);

in cpudep_ap_bootstrap. This was because other
activity from:

        SI_SUB_KTHREAD_INIT     = 0xe000000,    /* init process*/
        SI_SUB_KTHREAD_PAGE     = 0xe400000,    /* pageout daemon*/
        SI_SUB_KTHREAD_VM       = 0xe800000,    /* vm daemon*/
        SI_SUB_KTHREAD_BUF      = 0xea00000,    /* buffer daemon*/
        SI_SUB_KTHREAD_UPDATE   = 0xec00000,    /* update daemon*/
        SI_SUB_KTHREAD_IDLE     = 0xee00000,    /* idle procs*/
#ifndef EARLY_AP_STARTUP
        SI_SUB_SMP              = 0xf000000,    /* start the APs*/
#endif 

was competing for slb entries and doing slb entry replacements in
parallel with the ap startup activity so sometimes no slot covered
the pcpup->pc_curpcb relted address range. (Replacement slots are
picked based on mftb()%n_slbs .)

Are you asking me to disable that call and see what happens?

With the hack_into_slb_if_needed call and the other patches
reported in https://bugs.freebsd.org/bugzilla/show_bug.cgi?id=233863
I'm booting and using historical and usefdt modes just fine
as far as I've tested. (usefdt mode needing vt.) The patches do
not involve reverting the KVA change. One of the test machines is 
a "G5 quad" and its is the  primary one I build on and test.

===
Mark Millard
marklmi at yahoo.com
( dsl-only.net went
away in early 2018-Mar)