> >  Zeroing a 4096-byte page on an
> > SSE-enabled i386 should take no more than 35 SSE instructions
> The downside is that we need multiple implementations to take advantage
> of features available in different CPUs.

We already *have* multiple implementations for different CPUs in the
i386 family.  We stopped using all but a few of them early in the 5.0
cycle5D because they weren't SMP-safe, but the mechanism is still in
place (take a look at src/sys/i386/i386/{identcpu.c,support.s})

