Request for comments: port-tags
Anton Berezin
tobez at FreeBSD.org
Mon Nov 7 09:54:10 PST 2005
[moving back to the mailing list since this might be of interest]
On Mon, Nov 07, 2005 at 12:19:49PM -0500, Jim Trigg wrote:
> On Mon, November 7, 2005 10:46 am, Anton Berezin wrote:
> > The idea is to make ports classification easier and more convenient.
> > Instead of using predefined and limited set of port categories,
> > port-tags uses short one-word descriptions called tags. A port can have
> > an arbitrary number of tags associated with it. One can use the web
> > interface (and maybe a command-line interface in the future) to view
> > only the ports that have particular tags associated with them. This
> > process is very efficient in narrowing down the number of sought ports.
>
> How do you add a tag to a port? For example, mail/dovecot does not
> currently have the tag maildir, even though it supports maildir.
Currently it basically takes the existing categories a port is in, plus
the words which constitute the port's COMMENT. Then it applies a number
of heuristics, most significantly stemming and filtering out the common
"stopwords" like "a", "the", and so on. And then there is of course a
cutoff for those resulting tags which are too rare (otherwise the number
of resulting tags in the tagcloud would explode; it is already pretty
bad as it is, with 815 tags).
Since mail/dovecot does not mention "maildir" in it's COMMENT, no tag
for it. I was thinking about parsing the pkg-descr file as well, but I
was afraid it will explode the number of tags even more. Possibly I was
mistaken, it would be good to experiment with this approach.
Failing taking into account pkg-descr, one can clearly see why a "social
collaboration" mode could produce higher quality results, since at least
one person (you) would make sure that mail/dovecot is tagged with
"maildir". :-)
\Anton.
--
An undefined problem has an infinite number of solutions.
-- Robert A. Humphrey
More information about the freebsd-ports
mailing list