From nobody Mon Sep 11 03:40:22 2023 X-Original-To: dev-commits-ports-all@mlmmj.nyi.freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2610:1c1:1:606c::19:1]) by mlmmj.nyi.freebsd.org (Postfix) with ESMTP id 4RkXY262Wvz4tRkb; Mon, 11 Sep 2023 03:40:22 +0000 (UTC) (envelope-from git@FreeBSD.org) Received: from mxrelay.nyi.freebsd.org (mxrelay.nyi.freebsd.org [IPv6:2610:1c1:1:606c::19:3]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits) key-exchange X25519 server-signature RSA-PSS (4096 bits) server-digest SHA256 client-signature RSA-PSS (4096 bits) client-digest SHA256) (Client CN "mxrelay.nyi.freebsd.org", Issuer "R3" (verified OK)) by mx1.freebsd.org (Postfix) with ESMTPS id 4RkXY25ctHz4N7D; Mon, 11 Sep 2023 03:40:22 +0000 (UTC) (envelope-from git@FreeBSD.org) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=freebsd.org; s=dkim; t=1694403622; h=from:from:reply-to:subject:subject:date:date:message-id:message-id: to:to:cc:mime-version:mime-version:content-type:content-type: content-transfer-encoding:content-transfer-encoding; bh=kRCwKmO+TVf1ruuYpVfiENsymF+H3LiVCc7h+dkDe1s=; b=NSc5mpwm6q9I6MPxzEJLOr9Uc8w4FTnD8p7+gKMNwW/wDtHfdU9g5JFcTXywT/d6YOFG6j HyK3aTgGK9OggV+O0dcsnEnDt+s720+ZvfRdr8d5Tce1M64AeksiZC/1Wgm96nWEu03ZqP QWFeAi1Tnh9AKXoX9kmgF8Lx+iLxnRv1cOEBJe4U21d/3f09+bNsciCdj69f1ijAH5x3en S9L/A4LniNF0dV4xtpltaQsmFevVmMEzzQxMdrjEVuqXseU74TIFTmSLG71rElwFEqzZEk h0oXBdW7r2QV9rzD64/NciUWdiRSW4ybv9oAr5Uk4s0GAZzto0U5PWvEvqGqtA== ARC-Seal: i=1; s=dkim; d=freebsd.org; t=1694403622; a=rsa-sha256; cv=none; b=aPXNmnWtKUftnmJC2UTfny5kucyvbh5xCDE3QaOKIV6kC+zPiv1xGP7+pMeaVNhu9hAZFa 2NGeJWTeY0IAvny6RiIlEzVZJOxYG5YHHPMLotpD3xThUkn9et9ZrlofC2yyYylRrSDR3d bpNpyw9T+wSEHzpxDN32VkBsD+LojNMZWmfIbjSrjA/BFxqZ8RiqEAZQfscRxJ+UDDfN6C b3ZJoVimc9JZ75wgonyNAaSgIZ8ks6UkK99hrSgpOhgKVWMn70HB+B8MZclAxgxvB9AxM7 brt01G92Ybo666FLJzEmisCxRLsuQBSPtRuK7W0Et+/fQxN8vdqqNVDGyMZ4AA== ARC-Authentication-Results: i=1; mx1.freebsd.org; none ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=freebsd.org; s=dkim; t=1694403622; h=from:from:reply-to:subject:subject:date:date:message-id:message-id: to:to:cc:mime-version:mime-version:content-type:content-type: content-transfer-encoding:content-transfer-encoding; bh=kRCwKmO+TVf1ruuYpVfiENsymF+H3LiVCc7h+dkDe1s=; b=PZijuejpZFHRPPFJ+NEqu9QT1mxfuEQgvFbgG51tMFG7EAbXsH2puSFXchFSPz9EyLk64T JRjp+4C3PAmRObsQNKWnt7cp4lsv7+OlDABCA8ICi7O0MDSj76DXB2+ro0yw1/YJXkd0FZ Mg5SQ9uH6GaC247x2OwxDpvOGeTDzJw9kH2/z/oo1L3oZKnY5zNL2WfRXPf0UfU1lHe1Vi +BE29/s6Q45AZlgltVdUiSejx10uCm4jEvZ/DBy/EgsUsxYuPuNt099ZjNhLGnrpGBHwq8 6LLDOdRey07hUNcgkt3xTabW5KrBGgrQxG6w34honi5doME2osjxQm0dl+SSSA== Received: from gitrepo.freebsd.org (gitrepo.freebsd.org [IPv6:2610:1c1:1:6068::e6a:5]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits) key-exchange X25519 server-signature RSA-PSS (4096 bits) server-digest SHA256) (Client did not present a certificate) by mxrelay.nyi.freebsd.org (Postfix) with ESMTPS id 4RkXY24hGgzYtP; Mon, 11 Sep 2023 03:40:22 +0000 (UTC) (envelope-from git@FreeBSD.org) Received: from gitrepo.freebsd.org ([127.0.1.44]) by gitrepo.freebsd.org (8.17.1/8.17.1) with ESMTP id 38B3eMdL069812; Mon, 11 Sep 2023 03:40:22 GMT (envelope-from git@gitrepo.freebsd.org) Received: (from git@localhost) by gitrepo.freebsd.org (8.17.1/8.17.1/Submit) id 38B3eM8Z069809; Mon, 11 Sep 2023 03:40:22 GMT (envelope-from git) Date: Mon, 11 Sep 2023 03:40:22 GMT Message-Id: <202309110340.38B3eM8Z069809@gitrepo.freebsd.org> To: ports-committers@FreeBSD.org, dev-commits-ports-all@FreeBSD.org, dev-commits-ports-main@FreeBSD.org From: Yuri Victorovich Subject: git: 5ef5e58a9051 - main - textproc/py-pdf2docx: New port: Library and applicaton to convert pdf to docx List-Id: Commit messages for all branches of the ports repository List-Archive: https://lists.freebsd.org/archives/dev-commits-ports-all List-Help: List-Post: List-Subscribe: List-Unsubscribe: Sender: owner-dev-commits-ports-all@freebsd.org X-BeenThere: dev-commits-ports-all@freebsd.org MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 8bit X-Git-Committer: yuri X-Git-Repository: ports X-Git-Refname: refs/heads/main X-Git-Reftype: branch X-Git-Commit: 5ef5e58a9051aad258aa948376badacc97ef130c Auto-Submitted: auto-generated The branch main has been updated by yuri: URL: https://cgit.FreeBSD.org/ports/commit/?id=5ef5e58a9051aad258aa948376badacc97ef130c commit 5ef5e58a9051aad258aa948376badacc97ef130c Author: Yuri Victorovich AuthorDate: 2023-09-11 03:39:34 +0000 Commit: Yuri Victorovich CommitDate: 2023-09-11 03:40:15 +0000 textproc/py-pdf2docx: New port: Library and applicaton to convert pdf to docx --- textproc/Makefile | 1 + textproc/py-pdf2docx/Makefile | 29 +++++++++++++++++++++++++++++ textproc/py-pdf2docx/distinfo | 3 +++ textproc/py-pdf2docx/pkg-descr | 6 ++++++ 4 files changed, 39 insertions(+) diff --git a/textproc/Makefile b/textproc/Makefile index 4394d47e183c..08f9879c5656 100644 --- a/textproc/Makefile +++ b/textproc/Makefile @@ -1467,6 +1467,7 @@ SUBDIR += py-parsimonious SUBDIR += py-parso SUBDIR += py-patiencediff + SUBDIR += py-pdf2docx SUBDIR += py-pdfminer.six SUBDIR += py-pdfminer3k SUBDIR += py-pdfrw diff --git a/textproc/py-pdf2docx/Makefile b/textproc/py-pdf2docx/Makefile new file mode 100644 index 000000000000..e7244113b971 --- /dev/null +++ b/textproc/py-pdf2docx/Makefile @@ -0,0 +1,29 @@ +PORTNAME= pdf2docx +DISTVERSION= 0.5.6 +CATEGORIES= textproc python +MASTER_SITES= PYPI +PKGNAMEPREFIX= ${PYTHON_PKGNAMEPREFIX} + +MAINTAINER= yuri@FreeBSD.org +COMMENT= Library and applicaton to convert pdf to docx +WWW= https://github.com/dothinking/pdf2docx + +LICENSE= GPLv3 +LICENSE_FILE= ${WRKSRC}/LICENSE + +PY_DEPENDS= ${PYTHON_PKGNAMEPREFIX}fire>=0.3.0:devel/py-fire@${PY_FLAVOR} \ + ${PYTHON_PKGNAMEPREFIX}PyMuPDF>=1.19.0:print/py-PyMuPDF@${PY_FLAVOR} \ + ${PYTHON_PKGNAMEPREFIX}fonttools>=4.24.0:print/py-fonttools@${PY_FLAVOR} \ + ${PYNUMPY} \ + ${PYTHON_PKGNAMEPREFIX}python-docx>=0.8.10:textproc/py-python-docx@${PY_FLAVOR} \ + opencv>0:graphics/opencv +BUILD_DEPENDS= ${PY_DEPENDS} \ + ${PYTHON_PKGNAMEPREFIX}pip>0:devel/py-pip@${PY_FLAVOR} +RUN_DEPENDS= ${PY_DEPENDS} + +USES= python +USE_PYTHON= distutils concurrent autoplist + +NO_ARCH= yes + +.include diff --git a/textproc/py-pdf2docx/distinfo b/textproc/py-pdf2docx/distinfo new file mode 100644 index 000000000000..91cc3c3f364c --- /dev/null +++ b/textproc/py-pdf2docx/distinfo @@ -0,0 +1,3 @@ +TIMESTAMP = 1694388825 +SHA256 (pdf2docx-0.5.6.tar.gz) = 364c8b78a6c1cfd96a1d2c659b06ba90fc116535c998e398641da80a8b5262c2 +SIZE (pdf2docx-0.5.6.tar.gz) = 3085029 diff --git a/textproc/py-pdf2docx/pkg-descr b/textproc/py-pdf2docx/pkg-descr new file mode 100644 index 000000000000..b0d2d9e319a8 --- /dev/null +++ b/textproc/py-pdf2docx/pkg-descr @@ -0,0 +1,6 @@ +pdf2docx is a Python program and library to convert pdf to docx. + +pdf2docx allows to: +* Extract data from PDF with PyMuPDF, e.g. text, images and drawings +* Parse layout with rule, e.g. sections, paragraphs, images and tables +* Generate docx with python-docx