svn commit: r334702 - head/sys/sys
Mateusz Guzik
mjguzik at gmail.com
Wed Jun 6 13:01:58 UTC 2018
On Wed, Jun 6, 2018 at 1:35 PM, Ravi Pokala <rpokala at freebsd.org> wrote:
> > + * Passing the flag down requires malloc to blindly zero the entire
> object.
> > + * In practice a lot of the zeroing can be avoided if most of the object
> > + * gets explicitly initialized after the allocation. Letting the
> compiler
> > + * zero in place gives it the opportunity to take advantage of this
> state.
>
> This part, I still don't understand. :-(
>
> The call to bzero() is still for the full length passed in, so how does
> this help?
>
>
bzero is:
#define bzero(buf, len) __builtin_memset((buf), 0, (len))
> ...
> > + * _malloc_item = malloc(_size, type, (flags) &~ M_ZERO);
> > + * if (((flags) & M_WAITOK) != 0 || _malloc_item != NULL)
> > + * bzero(_malloc_item, _size);
> > + *
> > + * If the flag is set, the compiler knows the left side is always true,
> > + * therefore the entire statement is true and the callsite is:
>
> I think you mean "... the *right* side is always true ...", since the left
> side is the check for the flag being set. "If the flag is set, compiler
> knows (the check for the flag being set) is always true" is tautological.
>
It explains how __builtin_constant_p(flags) being true allows the compiler
to optimize out the flags-based check.
I don't understand why this particular use runs into so much confusion.
Just above it there is a M_ZERO check relying on the same property and
receiving no attention.
> ...
> > + * If the flag is not set, the compiler knows the left size is always
> false
> > + * and the NULL check is needed, therefore the callsite is:
>
> Same issue here.
>
> > ...
> > #ifdef _KERNEL
> > #define malloc(size, type, flags) ({
> \
>
> Now that I'm taking another look at this, I'm confused as to why the
> entire macro expansion is inside parentheses? (The braces make sense, since
> this is a block with local variables which need to be contained.)
>
>
It is to return the value (the last expression).
> > void *_malloc_item; \
> > @@ -193,7 +228,8 @@ void *malloc(size_t size, struct malloc_type
> *type, in
> > if (__builtin_constant_p(size) && __builtin_constant_p(flags) &&\
> > ((flags) & M_ZERO) != 0) { \
> > _malloc_item = malloc(_size, type, (flags) &~ M_ZERO); \
> > - if (((flags) & M_WAITOK) != 0 || _malloc_item != NULL) \
> > + if (((flags) & M_WAITOK) != 0 || \
> > + __predict_true(_malloc_item != NULL)) \
> > bzero(_malloc_item, _size); \
> > } else { \
> > _malloc_item = malloc(_size, type, flags); \
>
> This confuses me too. If the constant-size/constant-flags/M_ZERO-is-set
> test fails, then it falls down to calling malloc(). Which we are in the
> middle of defining. So what does that expand to?
>
>
Expansion is not recursive, so this is an actual call to malloc.
--
Mateusz Guzik <mjguzik gmail.com>
More information about the svn-src-head
mailing list