youshi10 at u.washington.edu
Mon Jul 2 04:32:15 UTC 2007
Pardon me for being naive, but wouldn't it be wiser for all of the data
in the +CONTENTS file to be aggregated into sections instead of having
line by line info?
[~100 lines of repetitive data...]
Could be aggregated into:
or something similar to XML.
This would reduce the filesize from n bytes to n - (9 + 4 -1) *
i_entries + 8. In larger package files this would reduce the amount of
data parsing by a long shot. Also, more powerful scripting languages
like Perl, Python, or smart parsers in C could make short work of this
data and just extract the MD5 elements for comparison.
Also, by doing a little extra work when creating packages by
organizing all the sections together, I think that the file size could
be reduced by a large degree.
Similar fields to @comment MD5 could be reduced I believe, but with
less benefit maybe, other than just the @unexec rmdir, etc lines.
Either that, or the data should be organized into separate files I
think (increases number of files, but reduces overall processing time IMO).
More information about the freebsd-ports