svn commit: r547859 - head/math/openblas
Piotr Kubaj
pkubaj at FreeBSD.org
Sun Sep 6 17:52:26 UTC 2020
Author: pkubaj
Date: Sun Sep 6 17:52:25 2020
New Revision: 547859
URL: https://svnweb.freebsd.org/changeset/ports/547859
Log:
math/openblas: update to 0.3.10, add POWER8 option
Changelog:
common:
Improved thread locking behaviour in blas_server and parallel getrf
Imported bugfix 394 from LAPACK (spurious reference to "XERBL"
due to overlong lines)
Imported bugfix 403 from LAPACK (compile option "recursive" required
for correctness with Intel and PGI)
Imported bugfix 408 from LAPACK (wrong scaling in ZHEEQUB)
Imported bugfix 411 from LAPACK (infinite loop in LARGV/LARTG/LARTGP)
Fixed mismatches between BUFFERSIZE and GEMM_UNROLL parameters that
could lead to crashes at large matrix sizes
Restored internal soname in dynamic libraries on FreeBSD and Dragonfly
Added API (openblas_setaffinity) to set thread affinity
programmatically on Linux
Added initial infrastructure for half-precision floating point
(bfloat16) support with a generic implementation of SHGEMM
Added CMAKE build system support for building the cblas_Xgemm3m
functions
Fixed CMAKE support for building in a path with embedded spaces
Fixed CMAKE (non)handling of NO_EXPRECISION and MAX_STACK_ALLOC
Fixed GCC version detection in the Makefiles
Allowed overriding the names of AR, AS and LD in Makefile builds
POWER:
fixed big-endian POWER8 ELFv2 builds on FreeBSD
Fixed GCC version checks and DYNAMIC_ARCH builds on POWER9
Fixed CMAKE build support for POWER9
fixed a potential race condition in the thread buffer allocation
Worked around LAPACK test failures on PPC G4
MIPS:
fixed a potential race condition in the thread buffer allocation
Added support for MIPS 24K/24KE family based on P5600 kernels
MIPS64:
fixed a potential race condition in the thread buffer allocation
Added TARGET=GENERIC
ARMV7:
fixed a race condition in the thread buffer allocation
ARMV8:
Fixed a race condition in the thread buffer allocation
Fixed zero initialisation in the assembly for SGEMM and DGEMM BETA
Improved performance of the ThunderX2 DAXPY kernel
Added an optimized SGEMM kernel for Cortex A53
Fixed Makefile support for INTERFACE64 (8-byte integer)
x86_64:
Fixed a syntax error in the CMAKE setup for SkylakeX
Improved performance of STRSM on Haswell, SkylakeX and Ryzen
Improved SGEMM performance on SGEMM for workloads with ldc a
multiple of 1024
Improved DGEMM performance on Skylake X
Fixed unwanted AVX512-dependency of SGEMM in DYNAMIC_ARCH
builds created on SkylakeX
Removed data alignment requirement in the SSE2 copy kernels
that could cause spurious crashes
Added a workaround for an optimizer bug in AppleClang 11.0.3
Fixed LAPACK-TEST failures with Intel Fortran
Fixed compilation and LAPACK test results with recent Flang
and AMD AOCC
Fixed DYNAMIC_ARCH builds with CMAKE on OS X
Fixed missing exports of cblas_i?amin, cblas_i?min, cblas_i?max,
cblas_?sum, cblas_?gemm3m in the shared library on OS X
Fixed reporting of cpu name in DYNAMIC_ARCH builds (would sometimes
show the name of an older generation chip supported by the same kernels)
IBM Z:
Improved performance of SGEMM/STRMM and DGEMM/DTRMM on Z14
PR: 249120
Approved by: phd_kimberlite at yahoo.co.jp (maintainer)
Modified:
head/math/openblas/Makefile
head/math/openblas/distinfo
Modified: head/math/openblas/Makefile
==============================================================================
--- head/math/openblas/Makefile Sun Sep 6 17:15:07 2020 (r547858)
+++ head/math/openblas/Makefile Sun Sep 6 17:52:25 2020 (r547859)
@@ -3,8 +3,7 @@
PORTNAME= openblas
DISTVERSIONPREFIX= v
-DISTVERSION= 0.3.9
-PORTREVISION= 2
+DISTVERSION= 0.3.10
PORTEPOCH= 1
CATEGORIES= math
MASTER_SITES= NL/lapack/timing/:lapack_tmg
@@ -12,10 +11,6 @@ DISTFILES= large.tgz:lapack_tmg \
timing.tgz:lapack_tmg
DIST_SUBDIR= openblas
-PATCH_SITES= https://github.com/${GH_ACCOUNT}/${GH_PROJECT}/commit/
-PATCHFILES= ee2e758278b5d82b7242f505ea694f082ef65879.patch:-p1 # fixes the regression: https://github.com/xianyi/OpenBLAS/pull/2512
-PATCHFILES+= 3eda3d34c3d9bfabb29e1d8a69860932334a815d.patch:-p1 # fixes build on powerpc (PPCG4)
-
MAINTAINER= phd_kimberlite at yahoo.co.jp
COMMENT= Optimized BLAS library based on GotoBLAS2
@@ -44,9 +39,11 @@ OPTIONS_DEFINE= DYNAMIC_ARCH INTERFACE64 OPENMP
OPTIONS_DEFAULT= OPENMP
OPTIONS_DEFINE_i386= AVX AVX2
OPTIONS_DEFINE_amd64= AVX AVX2
-OPTIONS_DEFINE_powerpc64= POWER6
OPTIONS_EXCLUDE_powerpc64= OPENMP
OPTIONS_EXCLUDE_powerpc= OPENMP
+OPTIONS_RADIO= ${OPTIONS_RADIO_${ARCH}}
+OPTIONS_RADIO_powerpc64= POWER
+OPTIONS_RADIO_POWER= POWER6 POWER8
OPTIONS_SUB= yes
.if defined(BATCH) || defined(PACKAGE_BUILDING)
@@ -60,6 +57,7 @@ OPENMP_DESC= Use OpenMP for threading
AVX_DESC= Support Advanced Vector Extensions (AVX)
AVX2_DESC= Support Advanced Vector Extensions 2 (AVX2)
POWER6_DESC= Optimize for POWER6, instead of the default PPC970
+POWER8_DESC= Optimize for POWER8, instead of the default PPC970
.include <bsd.port.options.mk>
@@ -67,6 +65,8 @@ POWER6_DESC= Optimize for POWER6, instead of the defa
USE_GCC= yes
. if ${PORT_OPTIONS:MPOWER6}
TARGET_CPU_ARCH= POWER6
+. elif ${PORT_OPTIONS:MPOWER8}
+TARGET_CPU_ARCH= POWER8
.else
TARGET_CPU_ARCH= PPC970
. endif
Modified: head/math/openblas/distinfo
==============================================================================
--- head/math/openblas/distinfo Sun Sep 6 17:15:07 2020 (r547858)
+++ head/math/openblas/distinfo Sun Sep 6 17:52:25 2020 (r547859)
@@ -1,11 +1,7 @@
-TIMESTAMP = 1591268760
+TIMESTAMP = 1592213495
SHA256 (openblas/large.tgz) = f328d88b7fa97722f271d7d0cfea1c220e0f8e5ed5ff01d8ef1eb51d6f4243a1
SIZE (openblas/large.tgz) = 2595
SHA256 (openblas/timing.tgz) = 999c65f8ea8bd4eac7f1c7f3463d4946917afd20a997807300fe35d70122f3af
SIZE (openblas/timing.tgz) = 1059485
-SHA256 (openblas/xianyi-OpenBLAS-v0.3.9_GH0.tar.gz) = 17d4677264dfbc4433e97076220adc79b050e4f8a083ea3f853a53af253bc380
-SIZE (openblas/xianyi-OpenBLAS-v0.3.9_GH0.tar.gz) = 12184323
-SHA256 (openblas/ee2e758278b5d82b7242f505ea694f082ef65879.patch) = 9210a94ae418b4ce6d9f7fab4e00dd3155451f5ef3769536a4d954a2149ca733
-SIZE (openblas/ee2e758278b5d82b7242f505ea694f082ef65879.patch) = 2028
-SHA256 (openblas/3eda3d34c3d9bfabb29e1d8a69860932334a815d.patch) = 849b91e8422b87ad58687435a921e295dbdd74eec9597331d10b7438c39f246b
-SIZE (openblas/3eda3d34c3d9bfabb29e1d8a69860932334a815d.patch) = 2988
+SHA256 (openblas/xianyi-OpenBLAS-v0.3.10_GH0.tar.gz) = 0484d275f87e9b8641ff2eecaa9df2830cbe276ac79ad80494822721de6e1693
+SIZE (openblas/xianyi-OpenBLAS-v0.3.10_GH0.tar.gz) = 12246979
More information about the svn-ports-head
mailing list