kern/133980: panic: ffs_valloc: dup alloc

Fri May 8 02:54:03 UTC 2009

On Fri, 8 May 2009, John Kilburg wrote:

> Subject: Re: kern/133980: panic: ffs_valloc: dup alloc
>
> I am guessing now that this is a problem that I've seen mentioned
> in passing as a possibility in a few postings about the maximum
> number of inodes for UFS2.
>
> Using the default settings the filesystem ended up with more than
> 2^31 inodes.  I decreased the inode density to about half the
> default (~1.4M inodes) and now things seem to be working perfectly.
> I rsync'd with 3 sessions, did a make buildworld and otherwise
> loaded the thing with no problems at all.
>
> It would be nice if newfs (or something) warned people about the
> effective 2^31 inode limit for ufs2 until the problem is spotted.
> I guess the real fix is to switch to a filesystem that doesn't
> have static inode allocation. :)

ino_t is uint32_t, so the limit should be 2^32.

This might be easy to fix.  I think newfs just allocates space for
inode blocks without ever counting inodes (and thus without checking
that all the inode blocks are usable).  It could easily count.  Then
ffs should simply refuse to allocate inodes with numbers >= 2^32,
but it has the following bad code in ffs_alloc.c:

ffs_valloc():
% 	ino = (ino_t)ffs_hashalloc(pip, cg, ipref, mode, ffs_nodealloccg);

This blindly truncates any value >= 2^32 returned by ffs_hashalloc().
(ffs_hashalloc() returns an ffs2 (sic) block number (int64_t) but is
abused to to allocate and return an inode number here.)

% static ufs2_daddr_t
% ffs_nodealloccg(ip, cg, ipref, mode)

This used to return ino_t.  Then the function itself was invalidly cast to a
common type in the above use of it as a parameter to ffs_hashalloccg(),
so as to break the warning about this type mismatch.  Now a common return
type is used instead.  This is valid, but we should be careful not to
return an out-of-range value.

% 	struct inode *ip;
% 	int cg;
% 	ufs2_daddr_t ipref;
% 	int mode;
% {
% 	...
% 	return (cg * fs->fs_ipg + ipref);

This seems to be the main place where overflow may occur.  Although
the function can now return a 64-bit value exceeding 2^32-1, cg is
plain int and fs_ipg is int32_t, so the multiplication will overflow
at 2^31.  ipref has type ino_t, but it is only a small adjustment
within the inode block, so adding it normally won't overflow or get
near 2^32.  Thus overflow is most likely to occur only in the
multiplication here.

% /*VARARGS5*/

Historical garbage -- variadic args are neither supported or used here.

% static ufs2_daddr_t
% ffs_hashalloc(ip, cg, pref, size, allocator)
% 	struct inode *ip;
% 	int cg;
% 	ufs2_daddr_t pref;
% 	int size;	/* size for data blocks, mode for inodes */
% 	allocfcn_t *allocator;
% {
% 	struct fs *fs;
% 	ufs2_daddr_t result;
% ...
% 	 * 1: preferred cylinder group
% 	 */
% 	result = (*allocator)(ip, cg, pref, size);

Since (*allocator)() returns the common type ufs2_daddr_t and is always
assigned to 'ufs2_daddr_t result' and then returned as a ufs2_daddr_t,
no further overflows are possible before the cast in ffs_valloc().
(For uses of ffs_hashalloc() to allocate block numbers for ffs1, overflow
would occur on assignment to a ufs1_daddr_t without even a cast if
ffs_hashalloc() somehow generated an out-of-range value.  ffs1 uses the
same block allocator as ffs2 and depends on newfs only generating values
that can't cause overflow for this to work.  So does ffs2, but overflow
for 64-bit block numbers is far off.)

Bruce