Uppercase RE matching problems in FreeBSD 11

Stefan Bethke stb at lassitu.de
Sun Nov 6 21:30:18 UTC 2016


> Am 06.11.2016 um 22:14 schrieb Stefan Ehmann <shoesoft at gmx.net>:
> 
>> That is rather surprising.  Is there a normative reference for the
>> treatment of bracket expressions and character classes when using
>> locales other than C and/or encodings like UTF-8?
> 
> I found an interesting article about this issue in gawk:
> https://www.gnu.org/software/gawk/manual/html_node/Ranges-and-Locales.html

OK, I give up.  Back to jwz: "now you have two problems.“

Although with en_US.UTF-8 on other systems, I have not had that experience.  A quick check on stuff I have immediate access to:

macOS 10.12:
$ echo 'abcdABCD' | sed 's/[A-Z]/X/g’
abcdXXXX

Ubuntu 14.04.5
$ echo 'abcdABCD' | sed 's/[A-Z]/X/g’
abcdXXXX

FreeBSD 10-stable
$ echo 'abcdABCD' | sed 's/[A-Z]/X/g'
abcdXXXX


Stefan

-- 
Stefan Bethke <stb at lassitu.de>   Fon +49 151 14070811






More information about the freebsd-stable mailing list