4.7 vs 5.2.1 SMP/UP bridging performance

Bruce Evans bde at zeta.org.au
Thu May 6 10:19:01 PDT 2004


On Thu, 6 May 2004, Bruce M Simpson wrote:

> On Thu, May 06, 2004 at 10:15:44AM -0400, Andrew Gallatin wrote:
> > For what its worth, using those operations yeilds these results
> > on my 2.53GHz P4 (for UP)
> >
> > Mutex (atomic_store_rel_int) cycles per iteration: 208
> > Mutex (sfence) cycles per iteration: 85
> > Mutex (lfence) cycles per iteration: 63
> > Mutex (mfence) cycles per iteration: 169
> > Mutex (none) cycles per iteration: 18
> >
> > lfence looks like a winner..
>
> Please be aware, though, that the different FENCE instructions are acting
> as fences against different things. The NASM documentation has a good
> quick reference for what each of the instructions do, but the definitive
> reference is Intel's IA-32 programmer's reference manuals.

They are also documented in amd64 manuals.

Don't they all act as fences only on the same CPU, so they are no help
for SMP?  They are still almost twice as slow than full locks on Athlons,
so hopefully they do more.

Bruce


More information about the freebsd-current mailing list