svn commit: r446984 - in head/www: . py-html5-parser

Guido Falsi madpilot at FreeBSD.org
Mon Jul 31 16:22:21 UTC 2017


Author: madpilot
Date: Mon Jul 31 16:22:20 2017
New Revision: 446984
URL: https://svnweb.freebsd.org/changeset/ports/446984

Log:
  A fast implementation of the HTML 5 parsing spec for Python. Parsing
  is done in C using a variant of the gumbo parser. The gumbo parse
  tree is then transformed into an lxml tree, also in C, yielding
  parse times that can be a thirtieth of the html5lib parse times.
  That is a speedup of 30x. This differs, for instance, from the gumbo
  python bindings, where the initial parsing is done in C but the
  transformation into the final tree is done in python.
  
  WWW: https://html5-parser.readthedocs.io/

Added:
  head/www/py-html5-parser/
  head/www/py-html5-parser/Makefile   (contents, props changed)
  head/www/py-html5-parser/distinfo   (contents, props changed)
  head/www/py-html5-parser/pkg-descr   (contents, props changed)
Modified:
  head/www/Makefile

Modified: head/www/Makefile
==============================================================================
--- head/www/Makefile	Mon Jul 31 16:02:03 2017	(r446983)
+++ head/www/Makefile	Mon Jul 31 16:22:20 2017	(r446984)
@@ -1668,6 +1668,7 @@
     SUBDIR += py-horizon
     SUBDIR += py-hpack
     SUBDIR += py-html
+    SUBDIR += py-html5-parser
     SUBDIR += py-html5lib
     SUBDIR += py-http-parser
     SUBDIR += py-httpie

Added: head/www/py-html5-parser/Makefile
==============================================================================
--- /dev/null	00:00:00 1970	(empty, because file is newly added)
+++ head/www/py-html5-parser/Makefile	Mon Jul 31 16:22:20 2017	(r446984)
@@ -0,0 +1,19 @@
+# $FreeBSD$
+
+PORTNAME=	html5-parser
+PORTVERSION=	0.4.3
+CATEGORIES=	www python
+MASTER_SITES=	CHEESESHOP
+PKGNAMEPREFIX=	${PYTHON_PKGNAMEPREFIX}
+
+MAINTAINER=	madpilot at FreeBSD.org
+COMMENT=	Fast implementation of the HTML 5 parsing spec for Python
+
+LICENSE=	APACHE20
+
+BUILD_DEPENDS=	${PYTHON_PKGNAMEPREFIX}lxml>=3.8.0:devel/py-lxml
+
+USES=		pkgconfig python
+USE_PYTHON=	autoplist distutils
+
+.include <bsd.port.mk>

Added: head/www/py-html5-parser/distinfo
==============================================================================
--- /dev/null	00:00:00 1970	(empty, because file is newly added)
+++ head/www/py-html5-parser/distinfo	Mon Jul 31 16:22:20 2017	(r446984)
@@ -0,0 +1,3 @@
+TIMESTAMP = 1501237401
+SHA256 (html5-parser-0.4.3.tar.gz) = dd5e3647c5919439c41600172ef96b5fdbf278028bd4000476f87412c4fb7b9c
+SIZE (html5-parser-0.4.3.tar.gz) = 261906

Added: head/www/py-html5-parser/pkg-descr
==============================================================================
--- /dev/null	00:00:00 1970	(empty, because file is newly added)
+++ head/www/py-html5-parser/pkg-descr	Mon Jul 31 16:22:20 2017	(r446984)
@@ -0,0 +1,9 @@
+A fast implementation of the HTML 5 parsing spec for Python. Parsing
+is done in C using a variant of the gumbo parser. The gumbo parse
+tree is then transformed into an lxml tree, also in C, yielding
+parse times that can be a thirtieth of the html5lib parse times.
+That is a speedup of 30x. This differs, for instance, from the gumbo
+python bindings, where the initial parsing is done in C but the
+transformation into the final tree is done in python.
+
+WWW: https://html5-parser.readthedocs.io/


More information about the svn-ports-all mailing list