ports/103571: [NEW PORT] textproc/p5-Text-Language-Guess: Trained module to guess a document's language

markun at onohara.to markun at onohara.to
Sun Sep 24 17:10:27 UTC 2006


>Number:         103571
>Category:       ports
>Synopsis:       [NEW PORT] textproc/p5-Text-Language-Guess: Trained module to guess a document's language
>Confidential:   no
>Severity:       non-critical
>Priority:       low
>Responsible:    freebsd-ports-bugs
>State:          open
>Quarter:        
>Keywords:       
>Date-Required:
>Class:          change-request
>Submitter-Id:   current-users
>Arrival-Date:   Sun Sep 24 17:10:25 GMT 2006
>Closed-Date:
>Last-Modified:
>Originator:     Masahiro Teramoto <markun at onohara.to>
>Release:        FreeBSD 4.11-RELEASE-p21 i386
>Organization:
>Environment:
System: FreeBSD gate.onohara.to 4.11-RELEASE-p21 FreeBSD 4.11-RELEASE-p21 #0: Thu Sep 7 22:37:27 JST 2006 root at gate.onohara.to:/usr/obj/usr/src/sys/RENTALv6 i386


	
>Description:
Text::Language::Guess guesses a document's language. Its implementation
is simple: Using "Text::ExtractWords" and "Lingua::StopWords" from CPAN,
it determines how many of the known stopwords the document contains for
each language supported by "Lingua::StopWords".

Each word in the document recognized as stopword of a particular
language scores one point for this language.

The "language_guess()" function takes a document as a parameter and
returns the abbreviation of the language that it is most likely written
in.

Author: Mike Schilli <cpan at perlmeister.com>
WWW:    http://search.cpan.org/~mschilli/Text-Language-Guess-0.02/

>How-To-Repeat:
	
>Fix:

# This is a shell archive.  Save it in a file, remove anything before
# this line, and then unpack it by entering "sh file".  Note, it may
# create directories; files and directories will be owned by you and
# have default permissions.
#
# This archive contains:
#
#	p5-Text-Language-Guess
#	p5-Text-Language-Guess/Makefile
#	p5-Text-Language-Guess/distinfo
#	p5-Text-Language-Guess/pkg-descr
#	p5-Text-Language-Guess/pkg-plist
#
echo c - p5-Text-Language-Guess
mkdir -p p5-Text-Language-Guess > /dev/null 2>&1
echo x - p5-Text-Language-Guess/Makefile
sed 's/^X//' >p5-Text-Language-Guess/Makefile << 'END-of-p5-Text-Language-Guess/Makefile'
X# New ports collection makefile for:	textproc/p5-Text-Language-Guess
X# Date created:		2006-09-25
X# Whom:			Masahiro Teramoto <markun at onohara.to>
X#
X# $FreeBSD$
X#
X
XPORTNAME=	Text-Language-Guess
XPORTVERSION=	0.02
XCATEGORIES=	textproc perl5
XMASTER_SITES=	${MASTER_SITE_PERL_CPAN}
XMASTER_SITE_SUBDIR=	../../authors/id/M/MS/MSCHILLI
XPKGNAMEPREFIX=	p5-
X
XMAINTAINER=	markun at onohara.to
XCOMMENT=	Trained module to guess a document's language
X
XBUILD_DEPENDS=	p5-Log-Log4perl>=1.0:${PORTSDIR}/devel/p5-Log-Log4perl \
X		p5-Text-ExtractWords>=0:${PORTSDIR}/textproc/p5-Text-ExtractWords \
X		p5-Lingua-StopWords>=0:${PORTSDIR}/textproc/p5-Lingua-StopWords
XRUN_DEPENDS=	${BUILD_DEPENDS}
X
XPERL_CONFIGURE=	yes
X
XMAN1=		language-guess.1
XMAN3=		Text::Language::Guess.3
X
X.include <bsd.port.pre.mk>
X.include <bsd.port.post.mk>
END-of-p5-Text-Language-Guess/Makefile
echo x - p5-Text-Language-Guess/distinfo
sed 's/^X//' >p5-Text-Language-Guess/distinfo << 'END-of-p5-Text-Language-Guess/distinfo'
XMD5 (Text-Language-Guess-0.02.tar.gz) = 66fbb68b17c3e62febbba633111f852e
XSHA256 (Text-Language-Guess-0.02.tar.gz) = 12ef612c1de0451367d403db73723446b836e2e10adeec5e9386b7baa8ede12f
XSIZE (Text-Language-Guess-0.02.tar.gz) = 5377
END-of-p5-Text-Language-Guess/distinfo
echo x - p5-Text-Language-Guess/pkg-descr
sed 's/^X//' >p5-Text-Language-Guess/pkg-descr << 'END-of-p5-Text-Language-Guess/pkg-descr'
XText::Language::Guess guesses a document's language. Its implementation
Xis simple: Using "Text::ExtractWords" and "Lingua::StopWords" from CPAN,
Xit determines how many of the known stopwords the document contains for
Xeach language supported by "Lingua::StopWords".
X
XEach word in the document recognized as stopword of a particular
Xlanguage scores one point for this language.
X
XThe "language_guess()" function takes a document as a parameter and
Xreturns the abbreviation of the language that it is most likely written
Xin.
X
XAuthor:	Mike Schilli <cpan at perlmeister.com>
XWWW:	http://search.cpan.org/~mschilli/Text-Language-Guess-0.02/
END-of-p5-Text-Language-Guess/pkg-descr
echo x - p5-Text-Language-Guess/pkg-plist
sed 's/^X//' >p5-Text-Language-Guess/pkg-plist << 'END-of-p5-Text-Language-Guess/pkg-plist'
X at comment $FreeBSD$
Xbin/language-guess
X%%SITE_PERL%%/%%PERL_ARCH%%/auto/Text/Language/Guess/.packlist
X%%SITE_PERL%%/Text/Language/Guess.pm
X at dirrmtry %%SITE_PERL%%/%%PERL_ARCH%%/auto/Text/Language/Guess
X at dirrmtry %%SITE_PERL%%/%%PERL_ARCH%%/auto/Text/Language
X at dirrmtry %%SITE_PERL%%/%%PERL_ARCH%%/auto/Text
X at dirrmtry %%SITE_PERL%%/Text/Language
X at dirrmtry %%SITE_PERL%%/Text
END-of-p5-Text-Language-Guess/pkg-plist
exit

>Release-Note:
>Audit-Trail:
>Unformatted:



More information about the freebsd-ports-bugs mailing list