[Bug 265768] [NEW PORT] textproc/py-textract: Extract text from any document

From: <bugzilla-noreply_at_freebsd.org>
Date: Wed, 10 Aug 2022 19:14:37 UTC
https://bugs.freebsd.org/bugzilla/show_bug.cgi?id=265768

            Bug ID: 265768
           Summary: [NEW PORT] textproc/py-textract: Extract text from any
                    document
           Product: Ports & Packages
           Version: Latest
          Hardware: Any
                OS: Any
            Status: New
          Severity: Affects Only Me
          Priority: ---
         Component: Individual Port(s)
          Assignee: ports-bugs@FreeBSD.org
          Reporter: DtxdF@riseup.net

Created attachment 235833
  --> https://bugs.freebsd.org/bugzilla/attachment.cgi?id=235833&action=edit
textproc-py-textract-1.6.5.patch

textract provides a single interface for extracting content embedded
from Word documents, PowerPoint presentations, PDFs and much more,
which can be used for further textual analysis and visualization.

WWW: https://github.com/deanmalmgren/textract

portlint: looks fine.
poudriere: testport is ok: with all options enabled, without any option
enabled, and with default options enabled (including groups).

Requirements:

* audio/py-pocketsphinx [1]
* textproc/python-pptx [2]
* textproc/py-extract-msg [3]

[1] https://bugs.freebsd.org/bugzilla/show_bug.cgi?id=265766
[2] https://bugs.freebsd.org/bugzilla/show_bug.cgi?id=265763
[3] https://bugs.freebsd.org/bugzilla/show_bug.cgi?id=265765

-- 
You are receiving this mail because:
You are the assignee for the bug.